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HUMAN GENES AND GENE EXPRESSION PRODUCTS 

Cross-Reference to Related Application 

[0001] Ths application claims the benefit of earlier-filed U.S. provisional application serial no. 
60/226,326 filed August 16, 2000, which application is incorporated herein by reference in its 
entirety. 

Field of the Invention 

[0002] The present invention relates to polynucleotides of human origin, particularly in human 
colon, breast, prostate, and/or lung tissue, and the encoded gene products. 
Background of the Invention 

[0003] Identification of novel polynucleotides, particularly those that encode an expressed gene 
product, is important in the advancement of drug discovery, diagnostic technologies, and the 
understanding of the progression and nature of complex diseases such as cancer. Identification of 
genes expressed in different cell types isolated from sources that differ in disease state or stage, 
developmental stage, exposure to various environmental factors, the tissue of origin, the species 
from which the tissue was isolated, and the like is key to identifying the genetic factors that are 
responsible for the phenotypes associated with these various differences. 

[0004] This invention provides novel human polynucleotides, the polypeptides encoded by 
these polynucleotides, and the genes and proteins corresponding to these novel polynucleotides. 

Summary of the Invention 
[0005] This invention relates to novel human polynucleotides and variants thereof, their 
encoded polypeptides and variants thereof, to genes corresponding to these polynucleotides and to 
proteins expressed by the genes. The invention also relates to diagnostics and therapeutics 
comprising such novel human polynucleotides, their corresponding genes or gene products, 
including probes, antisense nucleotides, and antibodies. The polynucleotides of the invention 
correspond to a polynucleotide comprising the sequence information of at least one of SEQ ID 
NOS: 1-6010. 

[0006] Various aspects and embodiments of the invention will be readily apparent to the 
ordinarily skilled artisan upon reading the description provided herein. 

Detailed Description of the Invention 
[0007] Before the present invention is described, it is to be understood that this invention is not 
limited to particular embodiments described, as such may, of course, vary. It is also to be 
understood that the terminology used herein is for the purpose of describing particular embodiments 
only, and is not intended to be limiting. 

1 



_0214500A2_I_> 



wr/ 

r 



I . WO 02/14500 PCT/US01/25840 

Unless defined otherwise, all technical and scientific terms used herein have the same meaning 
as commonly understood by one of ordinary skill in the art to which this invention belongs. Although 
any methods and materials similar or equivalent to those described herein can be used in the practice or 
testing of the present invention, the preferred methods and materials are now described. 
5 All publications and patent applications cited in this specification are herein incorporated by 

reference as if each individual publication or patent application were specifically and individually 
indicated to be incorporated by reference. The citation of any publication is for its disclosure prior to 
the filing date and should not be construed as an admission that the present invention is not entitled to 
antedate such publication by virtue of prior invention. 
10 It must be noted that as used herein and in the appended claims, the singular forms "a," "and," 

and "the" include plural referents unless the context clearly dictates otherwise. Thus, for example, 
reference to "a polynucleotide" includes a plurality of such polynucleotides and reference to "the colon 
cancer cell" includes reference to one or more cells and equivalents thereof known to those skilled in 
the art, and so forth. 

15 The publications and applications discussed herein are provided solely for their disclosure 

prior to the filing date of the present application. Nothing herein is to be construed as an admission 
that the present invention is not entitled to antedate such publication by virtue of prior invention. 
Further, the dates of publication provided may be different from the actual publication dates which may 
need to be independently confirmed. 

20 Definitions 

The terms "polynucleotide" and "nucleic acid," used interchangeably herein, refer to a 
polymeric forms of nucleotides of any length, either ribonucleotides or deoxynucleotides. Thus, these 
terms include, but are not limited to, single-, double-, or multi-stranded DNA or RNA, genomic DNA, 
cDNA, DNA-RNA hybrids, branched nucleic acid (see, e.g., U.S. Pat. Nos. 5,124,246; 5,710,264; and 

25 5,849,48 1) , or a polymer comprising purine and pyrimidine bases or other natural, chemically or 

biochemically modified, non-natural, or derivatized nucleotide bases. These terms furhter include, but 
are not limited to, mRNA or cDNA that comprise intronic sequences (see, e.g. , Niwa et al. (1999) Cell 
99(7):69 1-702). The backbone of the polynucleotide can comprise sugars and phosphate groups (as 
may typically be found in RNA or DNA), or modified or substituted sugar or phosphate groups. 

30 Alternatively, the backbone of the polynucleotide can comprise a polymer of synthetic subunits such as 
phosphoramidites and thus can be an oligodeoxynucleoside phosphoramidate or a mixed 
phosphoramidate-phosphodiester oligomer. Peyrottes et al. (1996) Nucl. Acids Res. 24:1841-1848; 
Chaturvedi et al. (1996) Nucl. Acids Res. 24:23 18-2323. A polynuclotide may comprise modified 
nucleotides, such as methylated nucleotides and nucleotide analogs, uracyl, other sugars, and linking 
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groups such as fluororibose and thioate, and nucleotide branches. The sequence of nucleotides may be 
interrupted by non-nucleotide components. A polynucleotide may be further modified after 
polymerization, such as by conjugation with a labeling component. Other types of modifications 
included in this definition are caps, substitution of one or more of the naturally occurring nucleotides 
with an analog, and introduction of means for attaching the polynucleotide to proteins, metal ions, 
labeling components, other polynucleotides, or a solid support. 

The terms "polypeptide" and "protein," used interchangebly herein, refer to a polymeric form 
of amino acids of any length, which can include coded and non-coded amino acids, chemically or 
biochemically modified or derivatized amino acids, and polypeptides having modified peptide 
backbones. The term includes fusion proteins, including, but not limited to, fusion proteins with a 
heterologous amino acid sequence, fusions with heterologous and homologous leader sequences, with 
or without N-terminal methionine residues; immunologically tagged proteins; and the like. 

"Diagnosis" as used herein generally includes determination of a subject's susceptibility to a 
disease or disorder, determination as to whether a subject is presently affected by a disease or disorder, 
prognosis of a subject affected by a disease or disorder (e.g., identification of pre-metastatic or 
metastatic cancerous states, stages of cancer, or responsiveness of cancer to therapy), and therametrics 
(e.g., monitoring a subject's condition to provide information as to the effect or efficacy of therapy). 

"Sample" or !l biological sample" as used herein encompasses a variety of sample types, and are 
generally meant to refer to samples of biological fluids or tissues, particularly samples obtained from 
tissues, especially from cells of the type associated with a disease or condition for Which a diagnostic 
application is designed (e.g., ductal adenocarcinoma), and the like. "Sample" or "biological sample" 
are meant to encompass blood and other liquid samples of biological origin, solid tissue samples, such 
as a biopsy specimen or tissue cultures or cells derived therefrom and the progeny thereof. These terms 
encompass samples that have been manipulated in any way after their procurement as well as 
derivatives and fractions of samples, where the samples may be maniuplated by, for example, treatment 
with reagents, solubilization, or enrichment for certain components. The terms also encompass clinical 
samples, and also includes cells in cell culture, cell supematants, cell lysates, serum, plasma, biological 
fluids, and tissue samples. Where the sample is solid tissue, the cells of the tissue can be dissociated or 
tissue sections can be analyzed. 

The terms "treatment," "treating," "treat" and the like are used herein to generally refer to 
obtaining a desired pharmacologic and/or physiologic effect. The effect may be prophylactic in terms 
of completely or partially preventing a disease or symptom thereof and/or may be therapeutic in terms 
of a partial or complete stabilization or cure for a disease and/or adverse effect attributable to the 
disease. "Treatment" as used herein covers any treatment of a disease in a mammal, particularly a 
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human, and includes: (a) preventing the disease or symptom from occurring in a subject which maybe 
predisposed to the disease or symptom but has not yet been diagnosed as having it; (b) inhibiting the 
disease symptom, i.e., arresting its development; or relieving the disease symptom, i.e., causing 
regression of the disease or symptom. 
5 The terms "individual," "subject," "host" and "patient," used interchangeably herein and refer 

to any mammalian subject for whom diagnosis, treatment, or therapy is desired, particularly humans. 
Other subjects may include cattle, dogs, cats, guinea pigs, rabbits, rats, mice, horses, and so on. 

As used herein the term "isolated" refers to a polynucleotide, a polypeptide, an antibody, or a 
host cell that is in an environment different from that in which the polynucleotide, the polypeptide, the 

10 antibody, or the host cell naturally occurs. A polynucleotide, a polypeptide, an antibody, or a host cell 
which is isolated is generally substantially purified. As used herein, the term "substantially purified" 
refers to a compound (e.g., either a polynucleotide or a polypeptide or an antibody) that is removed 
from its natural environment and is at least 60% free, preferably 75% free, and most preferably 90% 
free from other components with which it is naturally associated. Thus, for example, a composition 

15 containing A is "substantially free of B when at least 85% by weight of the total A+B in the 

composition is A. Preferably, A comprises at least about 90% by weight of the total of A+B in the 
composition, more preferably at least about 95% or even 99% by weight. 

A "host cell," as used herein, refers to a microorganism or a eukaryotic cell or cell line cultured 
as a unicellular entity which can be, or has been, used as a recipient for a recombinant vector or other 

20 transfer polynucleotides, and include the progeny of the original cell which has been transfected. It is 
understood that the progeny of a single cell may not necessarily be completely identical in morphology 
or in genomic or total DNA complement as the original parent, due to natural, accidental, or deliberate 
mutation. 

The terms "cancer," "neoplasm," "tumor," and "carcinoma," are used interchangeably herein to 
25 refer to cells which exhibit relatively autonomous growth, so that they exhibit an aberrant growth 

phenotype characterized by a significant loss of control of cell proliferation. In general, cells of interest 
for detection or treatment in the present application include precancerous (e.g., benign), malignant, 
metastatic, and non-metastatic cells. Detection of cancerous cell is of particular interest. 

The use of "e", as in 10e-3, indicates that the number to the left of "e" is raised to the power of 

30 the number to the right of "e" (thus, 10e-3 is 10~ 3 ). 

The term heterologous" as used herein in the context of, for example, heterologous nucleic 
acid or amino acid sequences, heterologous polypeptides, or heterologous nucleic acid, is meant to refer 
to material that originates from a source different from that with which it is joined or associated. For 
example, two DNA sequences are heterologous to one another if the sequences are from different genes 
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or from different species. A recombinant host cell containing a sequence that is heterologous to the 
host cell can be, for example, a bacterial cell containing a sequence encoding a human polypeptide. 

The invention relates to polynucleotides comprising the disclosed nucleotide sequences, to full 
length cDNA, mRNA, genomic sequences, and genes corresponding to these sequences and degenerate 

5 variants thereof, and to polypeptides encoded by the polynucleotides of the invention and polypeptide 
variants. The following detailed description describes the polynucleotide compositions encompassed 
by the invention, methods for obtaining cDNA or genomic DNA encoding a full-length gene product, 
expression of these polynucleotides and genes, identification of structural motifs of the polynucleotides 
and genes, identification of the function of a gene product encoded by a gene corresponding to a 

10 polynucleotide of the invention, use of the provided polynucleotides as probes and in mapping and in 
tissue profiling, use of the corresponding polypeptides and other gene products to raise antibodies, and 
use of the polynucleotides and their encoded gene products for therapeutic and diagnostic purposes. 
Polynucleotide Compositions 

The scope of the invention with respect to polynucleotide compositions includes, but is not 

15 necessarily limited to, polynucleotides having a sequence set forth in any one of SEQ ID NOS: 1-6010; 
polynucleotides obtained from the biological materials described herein or other biological sources 
(particularly human sources) by hybridization under stringent conditions (particularly conditions of 
high stringency); genes corresponding to the provided polynucleotides; variants of the provided 
polynucleotides and their corresponding genes, particularly those variants that retain a biological 

20 activity of the encoded gene product (e.g. , a biological activity ascribed to a gene product 

corresponding to the provided polynucleotides as a result of the assignment of the gene product to a 
protein family(ies) and/or identification of a functional domain present in the gene product). Other 
nucleic acid compositions contemplated by and within the scope of the present invention will be readily 
apparent to one of ordinary skill in the art when provided with the disclosure here. "Polynucleotide" 

25 and "nucleic acid" as used herein with reference to nucleic acids of the composition is not intended to 
be limiting as to the length or structure of the nucleic acid unless specifically indicated. 

The invention features polynucleotides that are expressed in human tissue, especially human 
colon, prostate, breast, lung and/or endothelial tissue. Novel nucleic acid compositions of the invention 
of particular interest comprise a sequence set forth in any one of SEQ ID NOS: 1-6010 or an 

30 identifying sequence thereof. An "identifying sequence" is a contiguous sequence of residues at least 
about 10 nt to about 20 nt in length, usually at least about 50 nt to about 100 nt in length, that uniquely 
identifies a polynucleotide sequence, e.g., exhibits less than 90%, usually less than about 80% to about 
85% sequence identity to any contiguous nucleotide sequence of more than about 20 nt Thus, the 
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subject novel nucleic acid compositions include full length cDNAs or mRNAs that encompass an 
identifying sequence of contiguous nucleotides from any one of SEQ ID NOS: 1-6010. 

The polynucleotides of the invention also include polynucleotides having sequence similarity 
or sequence identity. Nucleic acids having sequence similarity are detected by hybridization under low 
5 stringency conditions, for example, at 50°C and 10XSSC (0.9 M saline/0.09 M sodium citrate) and 
remain bound when subjected to washing at 55°C in 1XSSC. Sequence identity can be determined by 
hybridization under stringent conditions, for example, at 50°C or higher and 0. 1XSSC (9 mM 
saline/0.9 mM sodium citrate). Hybridization methods and conditions are well known in the art, see, 
e.g., USPN 5,707,829. Nucleic acids that are substantially identical to the provided polynucleotide 
10 sequences, e.g. allelic variants, genetically altered versions of the gene, etc. , bind to the provided 

polynucleotide sequences ( SEQ ID NOS: 1-6010) under stringent hybridization conditions. By using 
probes, particularly labeled probes of DNA sequences, one can isolate homologous or related genes. 
The source of homologous genes can be any species, e.g. primate species, particularly human; rodents, 
such as rats and mice; canines, felines, bovines, ovines, equines, yeast, nematodes, etc. 
15 Preferably, hybridization is performed using at least 15 contiguous nucleotides (nt) of at least 

one of SEQ ID NOS:1-6010. That is, when at least 15 contiguous nt of one of the disclosed SEQ ID 
NOS. is used as a probe, the probe will preferentially hybridize with a nucleic acid comprising the 
complementary sequence, allowing the identification and retrieval of the nucleic acids that uniquely 
hybridize to the selected probe. Probes from more than one SEQ ID NO. can hybridize with the same 
20 nucleic acid if the cDNA from which they were derived corresponds to one mRNA. Probes of more 

than 15 nt can be used, e.g., probes of from about 18 nt to about 100 nt, but 15 nt represents sufficient 
sequence for unique identification. 

The polynucleotides of the invention also include naturally occurring variants of the nucleotide 
sequences {e.g., degenerate variants, allelic variants, etc.). Variants of the polynucleotides of the 
25 invention are identified by hybridization of putative variants with nucleotide sequences disclosed 

herein, preferably by hybridization under stringent conditions. For example, by using appropriate wash 
conditions, variants of the polynucleotides of the invention can be identified where the allelic variant 
exhibits at most about 25-30% base pair (bp) mismatches relative to the selected polynucleotide probe. 
In general, allelic variants contain 15-25% bp mismatches, and can contain as little as even 5-15%, or 
30 2-5%, or 1-2% bp mismatches, as well as a single bp mismatch. 

The invention also encompasses homologs corresponding to the polynucleotides of SEQ ID 
NOS: 1-6010, where the source of homologous genes can be any mammalian species, e.g., primate 
species, particularly human; rodents, such as rats; canines, felines, bovines, ovines, equines, yeast, 
nematodes, etc. Between mammalian species, e.g., human and mouse, homologs generally have 
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substantial sequence similarity, e.g., at least 75% sequence identity, usually at least 90%, more usually 
at least 95% between nucleotide sequences. Sequence similarity is calculated based on a reference 
sequence, which may be a subset of a larger sequence, such as a conserved motif, coding region, 
flanking region, etc. A reference sequence will usually be at least about 18 contiguous nt long, more 
usually at least about 30 nt long, and may extend to the complete sequence that is being compared. 
Algorithms for sequence analysis are known in the art, such as gapped BLAST, described in Altschul, 
et al. Nucleic Acids Res. (1997) 25:3389-3402. 

In general, variants of the invention have a sequence identity greater than at least about 65%, 
preferably at least about 75%, more preferably at least about 85%, and can be greater than at least 
about 90% or more as determined by the Smith-Waterman homology search algorithm as implemented 
in MPSRCH program (Oxford Molecular). For the purposes of this invention, a preferred method of 
calculating percent identity is the Smith-Waterman algorithm, using the following. Global DNA 
sequence identity must be greater than 65% as determined by the Smith-Waterman homology search 
algorithm as implemented in MPSRCH program (Oxford Molecular) using an affine gap search with 
the following search parameters: gap open penalty, 12; and gap extension penalty, 1. 

The subject nucleic acids can be cDNAs or genomic DNAs, as well as fragments thereof, 
particularly fragments that encode a biologically active gene product and/or are useful in the methods 
disclosed herein (e.g. , in diagnosis, as a unique identifier of a differentially expressed gene of interest, 
etc.). The term "cDNA" as used herein is intended to include all nucleic acids that share the 
arrangement of sequence elements found in native mature mRNA species, where sequence elements are 
exons and 3' and 5' non-coding regions. Normally mRNA species have contiguous exons, with the 
intervening introns, when present, being removed by nuclear RNA splicing, to create a continuous open 
reading frame encoding a polypeptide of the invention. 

A genomic sequence of interest comprises the nucleic acid present between the initiation codon 
and the stop codon, as defined in the listed sequences, including all of the introns that are normally 
present in a native chromosome. It can further include the 3' and 5 9 untranslated regions found in the 
mature mRNA. It can further include specific transcriptional and translational regulatory sequences, 
such as promoters, enhancers, etc., including about 1 kb, but possibly more, of flanking genomic DNA 
at either the 5 ' and 3 9 end of the transcribed region. The genomic DNA can be isolated as a fragment 
of 100 kbp or smaller; and substantially free of flanking chromosomal sequence. The genomic DNA 
flanking the coding region, either 3' and 5', or internal regulatory sequences as sometimes found in 
introns, contains sequences required for proper tissue, stage-specific, or disease-state specific 
expression. 
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The nucleic acid compositions of the subject invention can encode all or a part of the subject 
polypeptides. Double or single stranded fragments can be obtained from the DNA sequence by 
chemically synthesizing oligonucleotides in accordance with conventional methods, by restriction 
enzyme digestion, by PCR amplification, etc. Isolated polynucleotides and polynucleotide fragments 
5 of the invention comprise at least about 10, about 15, about 20, about 35, about 50, about 100, about 
150 to about 200, about 250 to about 300, or about 350 contiguous nt selected from the polynucleotide 
sequences as shown in SEQ ID NOS: 1-60 10. For the most part, fragments will be of at least 15 nt, 
usually at least 1 8 nt or 25 nt, and up to at least about 50 contiguous nt in length or more. In a 
preferred embodiment, the polynucleotide molecules comprise a contiguous sequence of at least 12 nt 

10 selected from the group consisting of the polynucleotides shown in SEQ IDNOS:1-6010. 

Probes specific to the polynucleotides of the invention can be generated using the 
polynucleotide sequences disclosed in SEQ ID NOS: 1-6010. The probes are preferably at least about 
12, 15, 16, 18, 20, 22, 24, or 25 nt fragment of a corresponding contiguous sequence of SEQ ID 
NOS: 1-60 10, and can be less than 2, 1,0.5, 0.1, or 0.05 kb in length. The probes can be synthesized 

1 5 chemically or can be generated from longer polynucleotides using restriction enzymes. The probes can 
be labeled, for example, with a radioactive, biotinylated, or fluorescent tag. Preferably, probes are 
designed based upon an identifying sequence of a polynucleotide of one of SEQ ID NOS: 1-6010. 
More preferably, probes are designed based on a contiguous sequence of one of the subject 
polynucleotides that remain unmasked following application of a masking program for masking low 

20 complexity (e.g.,XBLAST) to the sequence., i.e., one would select an unmasked region, as indicated by 
the polynucleotides outside the poly-n stretches of the masked sequence produced by the masking 
program. 

The polynucleotides of the subject invention are isolated and obtained in substantial purity, 
generally as other than an intact chromosome. Usually, the polynucleotides, either as DNA or RNA, 

25 will be obtained substantially free of other naturally-occurring nucleic acid sequences, generally being 
at least about 50%, usually at least about 90% pure and are typically "recombinant," e.g., flanked by 
one or more nucleotides with which it is not normally associated on a naturally occurring chromosome. 

The polynucleotides of the invention can be provided as a linear molecule or within a circular 
molecule, and can be provided within autonomously replicating molecules (vectors) or within 

30 molecules without replication sequences. Expression of the polynucleotides can be regulated by then- 
own or by other regulatory sequences known in the art. The polynucleotides of the invention can be 
introduced into suitable host cells using a variety of techniques available in the art, such as transferrin 
polycation-mediated DNA transfer, transfection with naked or encapsulated nucleic acids, liposome- 
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mediated DNA transfer, intracellular transportation of DNA-coated latex beads, protoplast fusion, viral 
infection, electroporation, gene gun, calcium phosphate-mediated transfection, and the like. 

The subject nucleic acid compositions can be used to, for example, produce polypeptides, as 
probes for the detection of mRNA of the invention in biological samples (e.g., extracts of human cells) 

5 to generate additional copies of the polynucleotides, to generate ribozymes or antisense 

oligonucleotides, and as single stranded DNA probes or as triple-strand forming oligonucleotides. The 
probes described herein can be used to, for example, determine the presence or absence of the 
polynucleotide sequences as shown in SEQ ID NOS : 1 -60 1 0 or variants thereof in a sample. These 
and other uses are described in more detail below. 

10 Use of Polynucleotides to Obtain Full-Length cDNA. Gene, and Promoter Region 

In one embodiment, the polynucleotides are useful as starting materials to construct larger 
molecules. In one example, the polynucleotides of the invention are used to construct polynucleotides 
that encode a largo: polypeptide {e.g. , up to the full-length native polypeptide as well as fusion proteins 
comprising all or a portion of the native polypeptide) or may be used to produce haptens of the 

15 polypeptide {e.g. , polypeptides useful to generate antibodies). 

In one particular example, the polynucleotides of the invention are used to make or isolate 
cDNA molecules encoding all or portion of a naturally-occuring polypeptide. Full-length cDNA 
molecules comprising the disclosed polynucleotides are obtained as follows. A polynucleotide having a 
sequence of one of SEQ ID NOS: 1-6010, or a portion thereof comprising at least 12, 15, 18, or 20 nt, 

20 is used as a hybridization probe to detect hybridizing members of a cDNA library using probe design 
methods, cloning methods, and clone selection techniques such as those described in USPN 5,654, 173. 
Libraries of cDNA are made from selected tissues, such as normal or tumor tissue, or from tissues of a 
mammal treated with, for example, a pharmaceutical agent Preferably, the tissue is the same as the 
tissue from which the polynucleotides of the invention were isolated, as both the polynucleotides 

25 described herein and the cDNA represent expressed genes. Most preferably, the cDNA library is made 
from the biological material described herein in the Examples. The choice of cell type for library 
construction can be made after the identity of the protein encoded by the gene corresponding to the 
polynucleotide of the invention is known. This will indicate which tissue and cell types are likely to 
express the related gene, and thus represent a suitable source for the mRNA for generating the cDNA. 

30 Where the provided polynucleotides are isolated from cDNA libraries, the libraries are prepared from 
mRNA of human colon cells, more preferably, human colon cancer cells, even more preferably, from a 
highly metastatic colon cell, Kml2L4-A. 

Techniques for producing and probing nucleic acid sequence libraries are described, for 
example, in Sambrook et al., Molecular Cloning: A Laboratory Manual, 2nd Ed., (1989) Cold Spring 
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Harbor Press, Cold Spring Harbor, NY. The cDNA can be prepared by using primers based on 
polynucleotides comprising a sequence of SEQ ID NOS: 1-60 10. In one embodiment, the cDNA 
library can be made from only poly-adenylated mRNA. Thus, poly-T primers can be used to prepare 
cDNA from the mRNA. 

Members of the library that are larger than the provided polynucleotides, and preferably that 
encompass the complete coding sequence of the native message, are obtained. In order to confirm that 
the entire cDNA has been obtained, RNA protection experiments are performed as follows. 
Hybridization of a full-length cDNA to an mRNA will protect the RNA from RNase degradation. If 
the cDNA is not full length, then the portions of the mRNA that are not hybridized will be subject to 
RNase degradation. This is assayed, as is known in the art, by changes in electrophoretic mobility on 
polyacrylamide gels, or by detection of released monoribonucleotides. Sambrook et al., Molecular 
Cloning: A Laboratory Manual, 2nd E4, (1989) Cold Spring Harbor Press, Cold Spring Harbor, NY. 
In order to obtain additional sequences 5' to the end of a partial cDNA, 5' RACE (PCR Protocols: A 
Guide to Methods and Applications, (1990) Academic Press, Inc.) can be performed. 

Genomic DNA is isolated using the provided polynucleotides in a manner similar to the 
isolation of full-length cDNAs. Briefly, the provided polynucleotides, or portions thereof, are used as 
probes to libraries of genomic DNA. Preferably, the library is obtained from the cell type that was 
used to generate the polynucleotides of the invention, but this is not essential. Most preferably, the 
genomic DNA is obtained from the biological material described herein in the Examples. Such 
libraries can be in vectors suitable for carrying large segments of a genome, such as PI or YAC, as 
described in detail in Sambrook et al., supra, 9.4-9.30. In addition, genomic sequences can be isolated 
from human BAC libraries, which are commercially available from Research Genetics, Inc., Huntsville, 
Alabama, USA, for example. In order to obtain additional 5' or 3 f sequences, chromosome walking is 
performed, as described in Sambrook et al., such that adjacent and overlapping fragments of genomic 
DNA are isolated. These are mapped and pieced together, as is known in the art, using restriction 
digestion enzymes and DNA ligase. 

Using the polynucleotide sequences of the invention, corresponding full-length genes can be 
isolated using both classical and PCR methods to construct and probe cDNA libraries. Using either 
method, Northern blots, preferably, are perfonned on a number of cell types to determine which cell 
lines express the gene of interest at the highest level. Classical methods of constructing cDNA libraries 
are taught in Sambrook et al., supra. With these methods, cDNA can be produced from mRNA and 
inserted into viral or expression vectors. Typically, libraries of mRNA comprising poly(A) tails can be 
produced with poly(T) primers. Similarly, cDNA libraries can be produced using the instant sequences 
as primers. 
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PCR methods are used to amplify the members of a cDNA library that comprise the desired 
insert. In this case, the desired insert will contain sequence from the full length cDNA that corresponds 
to the instant polynucleotides. Such PCR methods include gene trapping and RACE methods. Gene 
trapping entails inserting a member of a cDNA library into a vector. The vector then is denatured to 
5 produce single stranded molecules. Next, a substrate-bound probe, such as a biotinylated oligo, is used 
to trap cDNA inserts of interest. Biotinylated probes can be linked to an avidin-bound solid substrate. 
PCR methods can be used to amplify the trapped cDNA. To trap sequences corresponding to the full 
length genes, the labeled probe sequence is based on the polynucleotide sequences of the invention. 
Random primers or primers specific to the library vector can be used to amplify the trapped cDNA. 
10 Such gene trapping techniques are described in Gruber et al., WO 95/04745 and Gruber et al., USPN 
5,500,356. Kits are commercially available to perform gene trapping experiments from, for example, 
Life Technologies, Gaithersburg, Maryland, USA. 

"Rapid amplification of cDNA ends," or RACE, is a PCR method of amplifying cDNAs from 
a number of different RNAs. The cDNAs are ligated to an oligonucleotide linker, and amplified by 
15 PCR using two primers. One primer is based on sequence from the instant polynucleotides, for which 
full length sequence is desired, and a second primer comprises sequence that hybridizes to the 
oligonucleotide linker to amplify the cDNA. A description of this method is reported in WO 
97/191 10. In preferred embodiments of RACE, a common primer is designed to anneal to an arbitrary 
adaptor sequence ligated to cDNA ends (Apte and Siebert, Biotechniques (1993) 15:890-893; Edwards 
20 et al., Nuc. Acids Res. (1991) 19:5227-5232). When a single gene-specific RACE primer is paired 
with the common primer, preferential amplification of sequences between the single gene specific 
primer and the common prima: occurs. Commercial cDNA pools modified for use in RACE are 
available. 

Another PCR-based method generates full-length cDNA library with anchored ends without 
25 needing specific knowledge of the cDNA sequence. The method uses lock-docking primers (I-VI), 

where one primer, poly TV (HE) locks over the polyA tail of eukaryotic mRNA producing first strand 
synthesis and a second primer, polyGH (TV-VT) locks onto the polyC tail added by terminal 
deoxynucleotidyl transferase (TdT)(see, e.g., WO 96/40998). 

The promoter region of a gene generally is located 5 5 to the initiation site for RNA polymerase 
30 H. Hundreds of promoter regions contain the "TATA" box, a sequence such as TATTA or TATAA, 
which is sensitive to mutations. The promoter region can be obtained by performing 5' RACE using a 
primer from the coding region of the gene. Alternatively, the cDNA can be used as a probe for the 
genomic sequence, and the region 5' to the coding region is identified by "walking up." If the gene is 
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highly expressed or differentially expressed, the promoter from the gene can be of use in a regulatory 
construct for a heterologous gene. 

Once the full-length cDNA or gene is obtained, DNA encoding variants can be prepared by 
site-directed mutagenesis, described in detail in Sambrook et al., 15.3-15.63. The choice of codon or 

5 nucleotide to be replaced can be based on disclosure herein on optional changes in amino acids to 
achieve altered protein structure and/or function. 

As an alternative method to obtaining DNA or RNA from a biological material, nucleic acid 
comprising nucleotides having the sequence of one or more polynucleotides of the invention can be 
synthesized. Thus, the invention encompasses nucleic acid molecules ranging in length from 15 nt 

10 (corresponding to at least 15 contiguous nt of one of SEQ ID NOS: 1-6010) up to a maximum length 
suitable for one or more biological manipulations, including replication and expression, of the nucleic 
acid molecule. The invention includes but is not limited to (a) nucleic acid having the size of a full 
gene, and comprising at least one of SEQ ID NOS:1-6010; (b) the nucleic acid of (a) also comprising 
at least one additional gene, operably linked to permit expression of a fusion protein; (c) an expression 

15 vector comprising (a) or (b); (d) a plasmid comprising (a) or (b); and (e) a recombinant viral particle 
comprising (a) or (b). Once provided with the polynucleotides disclosed herein, construction or 
preparation of (a) » (e) are well within the skill in the art. 

The sequence of a nucleic acid comprising at least 15 contiguous nt of at least any one of SEQ 
IDNOS:1-6010, preferably the entire sequence of at least any one of SEQ ID NOS:1-6010, is not 

20 limited and can be any sequence of A, T, G, and/or C (for DNA) and A, U, G, and/or C (for RNA) or 
modified bases thereof, including inosine and pseudouridine. The choice of sequence will depend on 
the desired function and can be dictated by coding regions desired, the intron-like regions desired, and 
the regulatory regions desired. Where the entire sequence of any one of SEQIDNOS:1-6010 is within 
the nucleic acid, the nucleic acid obtained is referred to herein as a polynucleotide comprising the 

25 sequence of any one of SEQ ID NOS: 1-6010. 

Ex pression of Polypeptide Encoded bv Full-Length cDNA or Full-Length Gene 

The provided polynucleotides (e.g., a polynucleotide having a sequence of one of SEQ ID 
NOS: 1-60 10), the corresponding cDNA, or the full-length gene is used to express a partial or complete 
gene product. Constructs of polynucleotides having sequences of SEQ ID NOS : 1-6010 can also be 

30 generated synthetically. Alternatively, single-step assembly of a gene and entire plasmid from large 
numbers of oligodeoxyribonucleotides is described by, e.g., Stemmer et aL, Gene (Amsterdam) (1995) 
164(l):49-53. In this method, assembly PCR (the synthesis of long DNA sequences from large 
numbers of oligodeoxyribonucleotides (oligos)) is described. The method is derived from DNA 
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shuffling (Stemmer, Nature (1994) 370:389-391), and does not rely on DNA ligase, but instead relies 
on DNA polymerase to build increasingly longer DNA fragments during the assembly process. 

Appropriate polynucleotide constructs are purified using standard recombinant DNA 
techniques as described in, for example, Sambrook et al, Molecular Cloning: A Laboratory Manual, 
2nd Ed, 7 (1989) Cold Spring Harbor Press, Cold Spring Harbor, NY, and under current regulations 
described in United States Dept. of HHS, National Institute of Health (NIH) Guidelines for 
Recombinant DNA Research. The gene product encoded by a polynucleotide of the invention is 
expressed in any expression system, including, for example, bacterial, yeast, insect, amphibian and 
mammalian systems. Vectors, host cells and methods for obtaining expression in same are well known 
in the art. Suitable vectors and host cells are described in USPN 5,654,173. 

Polynucleotide molecules comprising a polynucleotide sequence provided herein are generally 
propagated by placing the molecule in a vector. Viral and non-viral vectors are used, including 
plasmids. The choice of plasmid will depend on the type of cell in which propagation is desired and the 
purpose of propagation. Certain vectors are useful for amplifying and making large amounts of the 
desired DNA sequence. Other vectors are suitable for expression in cells in culture. Still other vectors 
are suitable for transfer and expression in cells in a whole animal or person. The choice of appropriate 
vector is well within the skill of the art. Many such vectors are available commercially. Methods for 
preparation of vectors comprising a desired sequence are well known in the art. 

The polynucleotides set forth in SEQ ID NOS:1-6010 or their corresponding full-length 
polynucleotides are linked to regulatory sequences as appropriate to obtain the desired expression 
properties. These can include promoters (attached either at the 5' end of the sense strand or at the 3' 
end of the antisense strand), enhancers, terminators, operators, repressors, and inducers. The 
promoters can be regulated or constitutive. In some situations it may be desirable to use conditionally 
active promoters, such as tissue-specific or developmental stage-specific promoters. These are linked 
to the desired nucleotide sequence using the techniques described above for linkage to vectors. Any 
techniques known in the art can be used. 

When any of the above host cells, or other appropriate host cells or organisms, are used to 
replicate and/or express the polynucleotides or nucleic acids of the invention, the resulting replicated 
nucleic acid, RNA, expressed protein or polypeptide, is within the scope of the invention as a product 
of the host cell or organism. The product is recovered by any appropriate means known in the art. 

Once the gene corresponding to a selected polynucleotide is identified, its expression can be 
regulated in the cell to which the gene is native. For example, an endogenous gene of a cell can be 
regulated by an exogenous regulatory sequence as disclosed in USPN 5,641,670. 
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Identification of Functional and Structural Motifs 

Translations of the nucleotide sequence of the provided polynucleotides, cDNAs or full genes 
can be aligned with individual known sequences. Similarity with individual sequences can be used to 
determine the activity of the polypeptides encoded by the polynucleotides of the invention. Also, 
5 sequences exhibiting similarity with more than one individual sequence can exhibit activities that are 
characteristic of either or both individual sequences. 

The full length sequences and fragments of the polynucleotide sequences of the nearest 
neighbors can be used as probes and primers to identify and isolate the full length sequence 
corresponding to provided polynucleotides. The nearest neighbors can indicate a tissue or cell type to 
10 be used to construct a library for the full-length sequences corresponding to the provided 
polynucleotides. 

Typically, a selected polynucleotide is translated in all six frames to determine the best 
alignment with the individual sequences. The sequences disclosed herein in the Sequence Listing are in 
a 5' to 3 5 orientation and translation in three frames can be sufficient (with a few specific exceptions as 

15 described in the Examples). These amino acid sequences are referred to, generally, as query sequences, 
which will be aligned with the individual sequences. Databases with individual sequences are 
described in "Computer Methods for Macromolecular Sequence Analysis" Methods in Enzymology 
(1996) 266, Doolittle, Academic Press, Inc., a division of Harcourt Brace & Co., San Diego, 
California, USA. Databases include GenBank, EMBL, and DNA Database of Japan (DDBJ). 

20 Query and individual sequences can be aligned using the methods and computer programs 

described above, and include BLAST 2.0, available over the world wide web at a site supported by the 
National Center for Biotechnology Information, which is supported by the National Library of 
Medicine and the National Institutes of Health. See also Altschul, et al. Nucleic Acids Res. (1997) 
25:3389-3402. Another alignment algorithm is Fasta, available in the Genetics Computing Group 

25 (GCG) package, Madison, Wisconsin, USA, a wholly owned subsidiary of Oxford Molecular Group, 
Inc. Other techniques for alignment are described in Doolittle, supra. Preferably, an alignment 
program that permits gaps in the sequence is utilized to align the sequences. The Smith-Waterman is 
one type of algorithm that permits gaps in sequence alignments. See Meth. MoL Biol (1997) 70: 173- 
187. Also, the GAP program using the Needleman and Wunsch alignment method can be utilized to 

30 align sequences. An alternative search strategy uses MPSRCH software, which runs on a MASPAR 
computer. MPSRCH uses a Smith-Waterman algorithm to score sequences on a massively parallel 
computer. This approach improves ability to identify sequences that are distantly related matches, and 
is especially tolerant of small gaps and nucleotide sequence errors. Amino acid sequences encoded by 
the provided polynucleotides can be used to search both protein and DNA databases. Incorporated 
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herein by reference are all sequences that have been made public as of the filing date of this application 
by any of the DNA or protein sequence databases, including the patent databases (e.g., GeneSeq). 
Also incorporated by reference are those sequences that have been submitted to these databases as of 
the filing date of the present application but not made public until after the filing date of the present 
5 application 

Results of individual and query sequence alignments can be divided into three categories: high 
similarity, weak similarity, and no similarity. Individual alignment results ranging from high similarity 
to weak similarity provide a basis for determining polypeptide activity and/or structure. Parameters for 
categorizing individual results include: percentage of the alignment region length where the strongest 
1 0 alignment is found, percent sequence identity, and p value. The percentage of the alignment region 
length is calculated by counting the number of residues of the individual sequence found in the region 
of strongest ali gnme nt, e.g., contiguous region of the individual sequence that contains the greatest 
number of residues that are identical to the residues of the corresponding region of the aligned query 
sequence. This number is divided by the total residue length of the query sequence to calculate a 
15 percentage. For example, a query sequence of 20 amino acid residues might be aligned with a 20 

amino acid region of an individual sequence. The individual sequence might be identical to amino acid 
residues 5, 9-15, and 17-19 of the query sequence. The region of strongest alignment is thus the region 
stretching from residue 9-19, an 1 1 amino acid stretch. The percentage of the alignment region length 
is: 1 1 (length of the region of strongest alignment) divided by (query sequence length) 20 or 55%. 
20 Percent sequence identity is calculated by counting the number of amino acid matches between 

the query and individual sequence and dividing total number of matches by the number of residues of 
the individual sequences found in the region of strongest alignment. Thus, the percent identity in the 
example above would be 10 matches divided by 1 1 amino acids, or approximately, 90.9% 

P value is the probability that the alignment was produced by chance. For a single alignment, 
25 the p value can be calculated according to Karlin et al., Proc. Natl. Acad. Sci. (1990) 87:2264 and 
Karlin et al., Proc. Natl. Acad. Sci. (1993) 90. The p value of multiple alignments using the same 
query sequence can be calculated using an heuristic approach described in Altschul et al., Nat. Genet. 
(1994) 6: 1 19. Alignment programs such as BLAST program can calculate the p value. See also 
Altschul et al., Nucleic Acids Res. (1997) 25:3389-3402. 
30 Another factor to consider for determining identity or similarity is the location of the similarity 

or identity. Strong local alignment can indicate similarity even if the length of alignment is short. 
Sequence identity scattered throughout the length of the query sequence also can indicate a similarity 
between the query and profile sequences. The boundaries of the region where the sequences align can 
be determined according to Doohtde, supra; BLAST 2.0 (see, e.g., Altschul, et al. Nucleic Acids Res. 
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(1997) 25:3389-3402) or FAST programs; or by determining the area where sequence identity is 
highest. 

High Similarity. In general, in alignment results considered to be of high similarity, the 
percent of the alignment region length is typically at least about 55% of total length query sequence; 

5 more typically, at least about 58%; even more typically; at least about 60% of the total residue length 
of the query sequence. Usually, percent length of the alignment region can be as much as about 62%; 
more usually, as much as about 64%; even more usually, as much as about 66%. Further, for high 
similarity, the region of alignment, typically, exhibits at least about 75% of sequence identity; more 
typically, at least about 78%; even more typically; at least about 80% sequence identity. Usually, 

10 percent sequence identity can be as much as about 82%; more usually, as much as about 84%; even 
more usually, as much as about 86%. 

The p value is used in conjunction with these methods. If high similarity is found, the query 
sequence is considered to have high similarity with a profile sequence when the p value is less than or 
equal to about 10e-2; more usually; less than or equal to about 10e-3; even more usually; less than or 

15 equal to about 10e-4. More typically, the p value is no more than about 10e-5; more typically; no more 
than or equal to about 10e-10; even more typically, no more than or equal to about 10e-15 for the 
query sequence to be considered high similarity. 

Weak Similarity. In general, where alignment results considered to be of weak similarity, there 
is no minimum percent length of the alignment region nor minimum length of alignment A better 

20 showing of weak similarity is considered when the region of alignment is, typically, at least about 15 
amino acid residues in length; more typically, at least about 20; even more typically, at least about 25 
amino acid residues in length. Usually, length of the alignment region can be as much as about 30 
amino acid residues; more usually, as much as about 40; even more usually, as much as about 60 
amino acid residues. Further, for weak similarity, the region of alignment, typically, exhibits at least 

25 about 35% of sequence identity; more typically, atleast about 40%; even more typically, at least about 
45% sequence identity. Usually, percent sequence identity can be as much as about 50%; more 
usually, as much as about 55%; even more usually, as much as about 60%. 

If low similarity is found, the query sequence is considered to have weak similarity with a 
profile sequence when the p value is usually less than or equal to about 10e-2; more usually, less than 

30 or equal to about 10e-3; even more usually; less than or equal to about 10e-4. More typically, the p 
value is no more than about 10e-5; more usually; no more than or equal to about 10e-10; even more 
usually, no more than or equal to about 10e-15 for the query sequence to be considered weak similarity. 

Similarity Determined bv Sequence Identity Alone . Sequence identity alone can be used to 
determine similarity of a query sequence to an individual sequence and can indicate the activity of the 
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sequence. Such an alignment, preferably, permits gaps to align sequences. Typically, the query 
sequence is related to the profile sequence if the sequence identity over the entire query sequence is at 
least about 15%; more typically, at least about 20%; even more typically, at least about 25%; even 
more typically, at least about 50%. Sequence identity alone as a measure of similarity is most useful 
5 when the query sequence is usually, at least 80 residues in length; more usually, at least 90 residues in 
length; even more usually, at least 95 amino acid residues in length. More typically, similarity can be 
concluded based on sequence identity alone when the query sequence is preferably 100 residues in 
length; more preferably, 120 residues in length; even more preferably, 150 amino acid residues in 
length. 

10 Alipnments with Profile and Mnltinle Aliened Sequences. Translations of the provided 

polynucleotides can be aligned with amino acid profiles that define either protein families or common 
motifs. Also, translations of the provided polynucleotides can be aligned to multiple sequence 
alignments (MSA) comprising the polypeptide sequences of members of protein families or motifs. 
Similarity or identity with profile sequences or MSAs can be used to determine the activity of the gene 
15 products (e.g., polypeptides) encoded by the provided polynucleotides or corresponding cDNA or 
genes. For example, sequences that show an identity or sirnilarity with a chemokinc profile or MSA 
can exhibit chemokine activities. 

Profiles can be designed manually by (1) creating an MSA, which is an alignment of the amino 
acid sequence of members that belong to the family and (2) constructing a statistical representation of 
the alignment Such methods are described, for example, in Bimey et al., Nucl. Acid Res. (1996) 
24(14): 2730-2739. MSAs of some protein families and motifs are publicly available. For example, 
the Genome Sequencing Center at thw Washington University School of Medicine provides a web set 
(Pfam) which provides MSAs of 547 different families and motifs. These MSAs are described also in 
Sonnhammer et al., Proteins (1997) 28: 405-420. Other sources over the world wide web include the 
25 site supported by the European Molecular Biology Laboratories in Heidelberg, Germany. A brief 

description of these MSAs is reported in Pascarella et al., Prot. Eng. (1996) 9(3):249-251. Techniques 
for building profiles from MSAs are described in Sonnhammer et al, supra; Bimey et al., supra; and 
"Computer Methods for Macro-molecular Sequence Analysis," Methods in Enzymology (1996) 266, 
Doolittle, Academic Press, Inc., San Diego, California, USA. 
30 Similarity between a query sequence and a protein family or motif can be determined by (a) 

comparing the query sequence against the profile and/or (b) ahgning the query sequence with the 
members of the family or motif. Typically, a program such as Searchwise is used to compare the query 
sequence to the statistical representation of the multiple alignment, also known as a profile (see Bimey 
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et al., supra). Other techniques to compare the sequence and profile are described in Sonnhammer et 
al., supra and Doolittle, supra. 

Next, methods described by Feng et al., J. Mol. Evol (1987) 25:351 and Higgins et al., 
CABIOS (1989) 5:15 1 can be used align the query sequence with the members of a family or motif, 

5 also known as a MSA. Sequence alignments can be generated using any of a variety of software tools. 
Examples include PileUp, which creates a multiple sequence alignment, and is described in Feng et al., 
J. Mol. Evol. (1987) 25:351. Another method, GAP, uses the alignment method of Needleman et al., J. 
Mol. Biol. (1970) 48:443. GAP is best suited for global alignment of sequences. A third method, 
BestFit, functions by inserting gaps to maximize the number of matches using the local homology 

10 algorithm of Smith et al., Adv. Appl. Math. (198 1) 2:482. In general, the following factors are used to 
determine if a similarity between a query sequence and a profile or MSA exists: (1) number of 
conserved residues found in the query sequence, (2) percentage of conserved residues found in the 
query sequence, (3) number of frameshifts, and (4) spacing between conserved residues. 

Some alignment programs that both translate and align sequences can make any number of 

15 frameshifts when translating the nucleotide sequence to produce the best alignment. The fewer 

frameshifts needed to produce an alignment, the stronger the similarity or identity between the query 
and profile or MSAs. For example, a weak similarity resulting from no frameshifts can be a better 
indication of activity or structure of a query sequence, than a strong similarity resulting from two 
frameshifts. Preferably, three or fewer frameshifts are found in an alignment; more preferably two or 

20 fewer frameshifts; even more preferably, one or fewer frameshifts; even more preferably, no 
frameshifts are found in an alignment of query and profile or MSAs. 

Conserved residues are those amino acids found at a particular position in all or some of the 
family or motif members. Alternatively, a position is considered conserved if only a certain class of 
amino acids is found in a particular position in all or some of the family members. For example, the N- 

25 terminal position can contain a positively charged amino acid, such as lysine, arginine, or histidine. 

Typically, a residue of a polypeptide is conserved when a class of amino acids or a single 
amino acid is found at a particular position in at least about 40% of all class members; more typically, 
at least about 50%; even more typically, at least about 60% of the members. Usually, a residue is 
conserved when a class or single amino acid is found in at least about 70% of the members of a family 

30 or motif; more usually, at least about 80%; even more usually, at least about 90%; even more usually, 
at least about 95%. 

A residue is considered conserved when three unrelated amino acids are found at a particular 
position in some or all of the members; more usually, two unrelated amino acids. These residues are 
conserved when the unrelated amino acids are found at particular positions in at least about 40% of all 
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class member; more typically, at least about 50%; even more typically, at least about 60% of the 
members. Usually, a residue is conserved when a class or single amino acid is found in at least about 
70% of the members of a family or motif; more usually, at least about 80%; even more usually, at least 
about 90%; even more usually, at least about 95%. 

A query sequence has similarity to a profile or MSA when the query sequence comprises at 
least about 25% of the conserved residues of the profile or MSA; more usually, at least about 30%; 
even more usually; at least about 40%. Typically, the query sequence has a stronger similarity to a 
profile sequence or MSA when the query sequence comprises at least about 45% of the conserved 
residues of the profile or MSA; more typically, at least about 50%; even more typically, at least about 
55%. 

Identification of Secreted & Membrane-Bound Polypeptides. Both secreted and membrane- 
bound polypeptides of the present invention are of particular interest. For example, levels of secreted 
polypeptides can be assayed in body fluids that are convenient, such as blood, plasma, serum, and other 
body fluids such as urine, prostatic fluid and semen. Membrane-bound polypeptides are useful for 
constructing vaccine antigens or inducing an immune response. Such antigens would comprise all or 
part of the extracellular region of the membrane-bound polypeptides. Because both secreted and 
membrane-bound polypeptides comprise a fragment of contiguous hydrophobic amino acids, 
hydrophobicity predicting algorithms can be used to identify such polypeptides. 

A signal sequence is usually encoded by both secreted and membrane-bound polypeptide genes 
to direct a polypeptide to the surface of the cell. The signal sequence usually comprises a stretch of 
hydrophobic residues. Such signal sequences can fold into helical structures. Membrane-bound 
polypeptides typically comprise at least one transmembrane region that possesses a stretch of 
hydrophobic amino acids that can transverse the membrane. Some transmembrane regions also exhibit 
a helical structure. Hydrophobic fragments within a polypeptide can be identified by using computer 
algorithms. Such algorithms include Hopp & Woods, Proc. Natl. Acad. Sci. USA (1981) 78:3824- 
3828; Kyte & Doolittle, J. Mol. Biol. (1982) 157: 105-132; and RAOAR algorithm, Degli Esposti et 
al., Eur. J. Biochem. (1990) 190: 207-219. 

Another method of identifying secreted and membrane-bound polypeptides is to translate the 
polynucleotides of the invention in all six frames and determine if at least 8 contiguous hydrophobic 
amino acids are present. Those translated polypeptides with at least 8; more typically, 10; even more 
typically, 12 contiguous hydrophobic amino acids are considered to be either a putative secreted or 
membrane bound polypeptide. Hydrophobic amino acids include alanine, glycine, histidine, isoleucine, 
leucine, lysine, methionine, phenylalanine, proline, threonine, tryptophan, tyrosine, and valine 
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Identification of the Function of an Expression Product of a Full-Le ngth Gene 

Ribozymes, antisense constructs, and dominant negative mutants can be used to determine 
function of the expression product of a gene corresponding to a polynucleotide provided herein. These 
methods and compositions are particularly useful where the provided novel polynucleotide exhibits no 
5 significant or substantial homology to a sequence encoding a gene of known function. Antisense 
molecules and ribozymes can be constructed from synthetic polynucleotides. Typically, the 
phosphoramidite method of oligonucleotide synthesis is used. See Beaucage et al., Tet. Lett. (1981) 
22:1859 andLISPN 4,668,777. Automated devices for synthesis are available to create 
oligonucleotides using this chemistry. Examples of such devices include Biosearch 8600, Models 392 
10 and 394 by Applied Biosystems, a division of Perkm-Elmer Corp., Foster City, California, USA; and 
Expedite by Perceptive Biosystems, Framingham, Massachusetts, USA. Synthetic RNA, phosphate 
analog oligonucleotides, and chemically derivatized oligonucleotides can also be produced, and can be 
covalently attached to other molecules. RNA oligonucleotides can be synthesized, for example, using 
RNA phosphoramidites. This method can be performed on an automated synthesizer, such as Applied 
1 5 Biosystems, Models 392 and 394, Foster City, California, USA. 

Phosphorothioate oligonucleotides can also be synthesized for antisense construction. A 
sulfurizing reagent, such as tetraethylthiruam disulfide (TETD) in acetonitrile can be used to convert 
the internucleotide cyanoethyl phosphite to the phosphorothioate triester within 15 minutes at room 
temperature. TETD replaces the iodine reagent, while all other reagents used for standard 
20 phosphoramidite chemistry remain the same. Such a synthesis method can be automated using Models 
392 and 394 by Applied Biosystems, for example. 

Oligonucleotides of up to 200 nt can be synthesized, more typically, 100 nt; more typically 50 
nt; even more typically, 30 to 40 nt. These synthetic fragments can be annealed and ligated together to 
construct larger fragments. See, for example, Sambrook et al., supra. Trans-cleaving catalytic RNAs 
25 (ribozymes) are RNA molecules possessing endoribonuclease activity. Ribozymes are specifically 
designed for a particular target, and the target message must contain a specific nucleotide sequence. 
They are engineered to cleave any RNA species site-specifically in the background of cellular RNA. 
The cleavage event renders the mRNA unstable and prevents protein expression. Importantly, 
ribozymes can be used to inhibit expression of a gene of unknown function for the purpose of 
30 determining its function in an in vitro or in vivo context, by detecting the phenotypic effect. One 

commonly used ribozyme motif is the hammerhead, for which the substrate sequence requirements are 
minimal. Design of the hammerhead ribozyme, as well as therapeutic uses of ribozymes, are disclosed 
in Usman et al., Current Opin. Struct. Biol. (1996) 6:527. Methods for production of ribozymes, 
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including hairpin structure ribozyme fragments, methods of increasing ribozyme specificity, and the 
like are known in the art. 

The hybridizing region of the ribozyme can be modified or can be prepared as a branched 
structure as described in Horn and Urdea, Nucleic Acids Res. (1989) 17:6959. The basic structure of 
the ribozymes can also be chemically altered in ways familiar to those skilled in the art, and chemically 
synthesized ribozymes can be administered as synthetic oligonucleotide derivatives modified by 
monomeric units. In a therapeutic context, liposome mediated delivery of ribozymes improves cellular 
uptake, as described in Birikh et al., Eur. J. Biochem. (1997) 245:1. 

Antisense nucleic acids are designed to specifically bind to RNA, resulting in the formation of 
RNA-DNA or RNA-RNA hybrids, with an arrest of DNA replication, reverse transcription or 
messenger RNA translation Antisense polynucleotides based on a selected polynucleotide sequence 
can interfere with expression of the corresponding gene. Antisense polynucleotides are typically 
generated within the cell by expression from antisense constructs that contain the antisense strand as 
the transcribed strand. Antisense polynucleotides based on the disclosed polynucleotides will bind 
and/or interfere with the translation of mRNA comprising a sequence complementary to the antisense 
polynucleotide. The expression products of control cells and cells treated with the antisense construct 
are compared to detect the protein product of the gene corresponding to the polynucleotide upon which 
the antisense construct is based. The protein is isolated and identified using routine biochemical 
methods. 

Given the extensive background literature and clinical experience in antisense therapy, one 
skilled in the art can use selected polynucleotides of the invention as additional potential therapeutics. 
The choice of polynucleotide can be narrowed by first testing them for binding to "hot spot" regions of 
the genome of cancerous cells. If a polynucleotide is identified as binding to a "hot spot," testing the 
polynucleotide as an antisense compound in the corresponding cancer cells is warranted. 

As an alternative method for identifying function of the gene corresponding to a polynucleotide 
disclosed herein, dominant negative mutations are readily generated for corresponding proteins that are 
active as homomultimers. A mutant polypeptide will interact with wild-type polypeptides (made from 
the other allele) and form a non-functional multimer. Thus, a mutation is in a substrate-binding 
domain, a catalytic domain, or a cellular localization domain Preferably, the mutant polypeptide will 
be overproduced. Point mutations are made that have such an effect In addition, fusion of different 
polypeptides of various lengths to the terminus of a protein can yield dominant negative mutants. 
General strategies are available for making dominant negative mutants (see, e.g., Herskowitz, Nature 
(1987) 329:219). Such techniques can be used to create loss of function mutations, which are useful 
for determining protein function. 
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Polypeptides and Variants Thereof 

The polypeptides of the invention include those encoded by the disclosed polynucleotides, as 
well as nucleic acids that, by virtue of the degeneracy of the genetic code, are not identical in sequence 
to the disclosed polynucleotides. Thus, the invention includes within its scope a polypeptide encoded 

5 by a polynucleotide having the sequence of any one of SEQ ID NOS: 1-6010 or a variant thereof 
In general, the term "polypeptide" as used herein refers to both the full length polypeptide 
encoded by the recited polynucleotide, the polypeptide encoded by the gene represented by the recited 
polynucleotide, as well as portions or fragments thereof "Polypeptides" also includes variants of the 
naturally occurring proteins, where such variants are homologous or substantially similar to the 

1 0 naturally occurring protein, and can be of an origin of the same or different species as the naturally 
occurring protein (e.g., human, murine, or some other species that naturally expresses the recited 
polypeptide, usually a mammalian species). In general, variant polypeptides have a sequence that has 
at least about 80%, usually at least about 90%, and more usually at least about 98% sequence identity 
with a differentially expressed polypeptide of the invention, as measured by BLAST 2.0 using the 

1 5 parameters described above. The variant polypeptides can be naturally or non-naturally glycosylated, 
i.e., the polypeptide has a glycosylation pattern that differs from the glycosylation pattern found in the 
corresponding naturally occurring protein. 

The invention also.encompasses homologs of the disclosed polypeptides (or fragments thereof) 
where the homologs are isolated from other species, i.e. other animal or plant species, where such 

20 homologs, usually mammalian species, e.g. rodents, such as mice, rats; domestic animals, e.g., horse, 
cow, dog, cat; and humans. By "homolog" is meant a polypeptide having at least about 35%, usually 
at least about 40% and more usually at least about 60% amino acid sequence identity to a particular 
differentially expressed protein as identified above, where sequence identity is determined using the 
BLAST 2.0 algorithm, with the parameters described supra. 

25 In general, the polypeptides of the subject invention are provided in a non-naturally occurring 

environment, e.g. are separated from their naturally occurring environment. In certain embodiments, 
the subject protein is present in a composition that is enriched for the protein as compared to a control. 
As such, purified polypeptide is provided, where by purified is meant that the protein is present in a 
composition that is substantially free of non-differentially expressed polypeptides, where by 

30 substantially free is meant that less than 90%, usually less than 60% and more usually less than 50% of 
the composition is made up of non-differentially expressed polypeptides. 

Also within the scope of the invention are variants; variants of polypeptides include mutants, 
fragments, and fusions. Mutants can include amino acid substitutions, additions or deletions. The 
amino acid substitutions can be conservative amino acid substitutions or substitutions to eliminate non- 
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essential amino acids, such as to alter a glycosylation site, a phosphorylation site or an acetylation site, 
or to minimize misfolding by substitution or deletion of one or more cysteine residues that are not 
necessary for function. Conservative amino acid substitutions are those that preserve the general 
charge, hydrophobicity/ hydrophilicity, and/or steric bulk of the amino acid substituted. Variants can 

5 be designed so as to retain or have enhanced biological activity of a particular region of the protein 
(e.g., a functional domain and/or, where the polypeptide is a member of a protein family, a region 
associated with a consensus sequence). Selection of amino acid alterations for production of variants 
can be based upon the accessibility (interior vs. exterior) of the amino acid (see, e.g., Go et al, Int. J. 
Peptide Protein Res. (1980) 15:21 1), the thermostability of the variant polypeptide (see, e.g., Querol et 

10 al., Prot Eng. (1996) 9:265), desired glycosylation sites (see, e.g., Olsen and Thomsen, J. Gen. 

Microbiol. (1991) 137:579), desired disulfide bridges (see, e.g., Clarke et al., Biochemistry (1993) 
32:4322; and Wakarchuk et al., Protein Eng. (1994) 7: 1379), desired metal binding sites (see, e.g., 
Toma et al., Biochemistry (1991) 30:97, and Haezerbrouck et al., Protein Eng. (1993) 6:643), and 
desired substitutions within proline loops (see, e.g., Masul et al., Appl. Env. Microbiol. (1994) 

15 60:3579). Cysteine-depleted muteins can be produced as disclosed in USPN 4,959,3 14. 

Variants also include fragments of the polypeptides disclosed herein, particularly haptens, 
biologically active fragments, and/or fragments corresponding to functional domains. Fragments of 
interest will typically be at least about 10 aa to at least about 15 aa in length, usually at least about 50 
aa in length, and can be as long as 300 aa in length or longer, but will usually not exceed about 1000 aa 

20 in length, where the fragment will have a stretch of amino acids that is identical to a polypeptide 

encoded by a polynucleotide having a sequence of any SEQ ID NOS:1-6010, or a homolog thereof. 
The protein variants described herein are encoded by polynucleotides that are within the scope of the 
invention. The genetic code can be used to select the appropriate codons to construct the 
corresponding variants. 

25 Computer-Related Embodiments 

In general, a library of polynucleotides is a collection of sequence information, which 
information is provided in either biochemical form (e.g., as a collection of polynucleotide molecules), 
or in electronic form (e.g., as a collection of polynucleotide sequences stored in a computer-readable 
form, as in a computer system and/or as part of a computer program). The sequence information of the 

30 polynucleotides can be used in a variety of ways, e.g., as a resource for gene discovery, as a 

representation of sequences expressed in a selected cell type (e.g., cell type markers), and/or as markers 
of a given disease or disease state. In general, a disease marker is a representation of a gene product 
that is present in all cells affected by disease either at an increased or decreased level relative to a 
normal cell (e.g., a cell of the same or similar type that is not substantially affected by disease). For 
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example, a polynucleotide sequence in a library can be a polynucleotide that represents an mKN A, 
polypeptide, or other gene product encoded by the polynucleotide, that is either overexpressed or 
underexpressed in a breast ductal cell affected by cancer relative to a normal (i.e., substantially disease- 
free) breast cell. 

5 The nucleotide sequence information of the library can be embodied in any suitable form, e.g., 

electronic or biochemical forms. For example, a library of sequence information embodied in 
electronic form comprises an accessible computer data file (or, in biochemical form, a collection of 
nucleic acid molecules) that contains the representative nucleotide sequences of genes that are 
differentially expressed (e.g., overexpressed or underexpressed) as between, for example, i) a 

10 cancerous cell and a normal cell; ii) a cancerous cell and a dysplastic cell; iii) a cancerous cell and a cell 
affected by a disease or condition other than cancer; iv) a metastatic cancerous cell and a normal cell 
and/or non-metastatic cancerous cell; v) a malignant cancerous cell and a non-malignant cancerous cell 
(or a normal cell) and/or vi) a dysplastic cell relative to a normal cell. Other combinations and 
comparisons of cells affected by various diseases or stages of disease will be readily apparent to the 

15 ordinarily skilled artisan Biochemical embodiments of the library include a collection of nucleic acids 
that have the sequences of the genes in the library, where the nucleic acids can correspond to the entire 
gene in the library or to a fragment thereof, as described in greater detail below. 

The polynucleotide libraries of the subject invention generally comprise sequence information 
of a plurality of polynucleotide sequences, where at least one of the polynucleotides has a sequence of 

20 any of SEQ ID NOS: 1-60 10. By plurality is meant at least 2, usually at least 3 and can include up to 
all of SEQ ID NOS: 1-6010. The length and number of polynucleotides in the library will vary with 
the nature of the library, e.g., if the library is an oligonucleotide array, a cDNA array, a computer 
database of the sequence information, etc. 

Where the library is an electronic library, the nucleic acid sequence information can be present 

25 in a variety of media. "Media" refers to a manufacture, other than an isolated nucleic acid molecule, 
that contains the sequence information of the present invention. Such a manufacture provides the 
genome sequence or a subset thereof in a form that can be examined by means not directly applicable 
to the sequence as it exists in a nucleic acid. For example, the nucleotide sequence of the present 
invention, e.g. the nucleic acid sequences of any of the polynucleotides of SEQ ID NOS : 1 -60 1 0, can 

30 be recorded on computer readable media, e.g. any medium that can be read and accessed directly by a 
computer. Such media include, but are not limited to: magnetic storage media, such as a floppy disc, a 
hard disc storage medium, and a magnetic tape; optical storage media such as CD-ROM; electrical 
storage media such as RAM and ROM; and hybrids of these categories such as magnetic/optical 
storage media. One of skill in the art can readily appreciate how any of the presently known computer 
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readable mediums can be used to create a manufacture comprising a recording of the present sequence 
information. "Recorded" refers to a process for storing information on computer readable medium, 
using any such methods as known in the art. Any convenient data storage structure can be chosen, 
based on the means used to access the stored information. A variety of data processor programs and 

5 formats can be used for storage, e.g. word processing text file, database format, etc. In addition to the 
sequence information, electronic versions of the libraries of the invention can be provided in 
conjunction or connection with other computer-readable information and/or other types of computer- 
readable files (e.g., searchable files, executable files, etc, including, but not limited to, for example, 
search program software, etc.). 

10 By providing the nucleotide sequence in computer readable form, the information can be 

accessed for a variety of purposes. Computer software to access sequence information is publicly 
available. For example, the gapped BLAST (Altschul et aL Nucleic Acids Res. (1997) 25:3389-3402) 
and BLAZE (Brutlag et al. Comp. Chem. (1993) 17:203) search algorithms on a Sybase system can be 
used to identify open reading frames (ORFs) within the genome that contain homology to ORFs from 

15 other organisms. 

As used herein, "a computer-based system" refers to the hardware means, software means, and 
data storage means used to analyze the nucleotide sequence information of the present invention. The 
minimu m hardware of the computer-based systems of the present invention comprises a central 
processing unit (CPU), input means, output means, and data storage means. A skilled artisan can 
20 readily appreciate that any one of the currently available computer-based system are suitable for use in 
the present invention. The data storage means can comprise any manufacture comprising a recording 
of the present sequence information as described above, or a memory access means that can access 
such a manufacture. 

"Search means" refers to one or more programs implemented on the computer-based system, 
25 to compare a target sequence or target structural motif, or expression levels of a polynucleotide in a 
sample, with the stored sequence information. Search means can be used to identify fragments or 
regions of the genome that match a particular target sequence or target motif. A variety of known 
algorithms are publicly known and commercially available, e.g. MacPattern (EMBL), BLASTN and 
BLASTX (NCBI). A "target sequence" can be any polynucleotide or amino acid sequence of six or 
30 more contiguous nucleotides or two or more amino acids, preferably from about 10 to 100 amino acids 
or from about 30 to 300 nt A variety of comparing means can be used to accomplish comparison of 
sequence information from a sample (e.g., to analyze target sequences, target motifs, or relative 
expression levels) with the data storage means. A skilled artisan can readily recognize that any one of 
the publicly available homology search programs can be used as the search means for the computer 
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based systems of the present invention to accomplish comparison of target sequences and motifs. 
Computer programs to analyze expression levels in a sample and in controls are also known in the art. 

A "target structural motif," or "target motif," refers to any rationally selected sequence or 
combination of sequences in which the sequence(s) are chosen based on a three-dimensional 
5 configuration that is formed upon the folding of the target motif, or on consensus sequences of 

regulatory or active sites. There are a variety of target motifs known in the art. Protein target motifs 
include, but arc not limited to, enzyme active sites and signal sequences. Nucleic acid target motifs 
include, but are not limited to, hairpin structures, promoter sequences and other expression elements 
such as binding sites for transcription factors. 

10 A variety of structural formats for the input and output means can be used to input and output 

the information in the computer-based systems of the present invention. One format for an output 
means ranks the relative expression levels of different polynucleotides. Such presentation provides a 
skilled artisan with a ranking of relative expression levels to determine a gene expression profile. 

As discussed above, the "library" of the invention also encompasses biochemical libraries of 

15 the polynucleotides of SEQ ID NOS: 1-6010 , e.g., collections of nucleic acids representing the 

provided polynucleotides. The biochemical libraries can take a variety of forms, e.g., a solution of 
cDNAs, a pattern of probe nucleic acids stably associated with a surface of a solid support (i.e., an 
array) and the like. Of particular interest are nucleic acid arrays in which one or more of SEQ ID 
NOS: 1-6010 is represented on the array. By array is meant a an article of manufacture that has at least 

20 a substrate with at least two distinct nucleic acid targets on one of its surfaces, where the number of 
distinct nucleic acids can be considerably higher, typically being at least 10, usually at least 20, and 
often at least 25 distinct nucleic acid molecules. A variety of different array formats have been 
developed and are known to those of skill in the art. The arrays of the subject invention find use in a 
variety of applications, including gene expression analysis, drug screening, mutation analysis and the 

25 like, as disclosed in the above-listed exemplary patent documents. 

In addition to the above nucleic acid libraries, analogous libraries of polypeptides are also 
provided, where the polypeptides of the library will represent at least a portion of the polypeptides 
encoded by a gene corresponding to one or more of SEQ ID NOS : 1-60 1 0. 
Utilities 

30 The polynucleotides of the invention are useful in a variety of applications. Exemplary utilies 

of the polynucleotides of the invention are described below. 

Construction of Larger Molecules: Recombinant DNAs and Nucleic Ac id Multimers. In one 
embodiment of particular interest, the polynucleotides described herein as useful as the building blocks 
for larger molecules. In one example, the polynucleotide is a component of a larger cDNA molecule 
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which in turn can be adapted for expression in a host cell (e.g., a bacterial or eukaryotic (e.g., yeast or 
mammalian) host cell). The cDNA can include, in addition to the polypeptide encoded by the starting 
material polynucleotide (i.e., a polynucleotide described herein), an amino acid sequence that is 
heterologous to the polypeptide encoded by the polynucleotide described herein (e.g., as in a sequence 

5 encoding a fusion protein). In some embodiments, the polynucleotides described herein is used as 
starting material polynucleotide for synthesizing all or a portion of the gene to which the described 
polynucleotide corresponds. For example, a DNA molecule encoding a full-length human polypeptide 
can be constructed using a polynucleotide described herein as starting material. 

In another embodiment, the polynucleotides of the invention are used in nucleic acid multimers 

10 . Nucleic acid multimers can be linear or branched polymers of the same repeating single-stranded 
oligonucleotide unit or different single-stranded oligonucleotide units. Where the molecules are 
branched, the multimers are generally described as either "fork" or "comb" structures. The 
oligonucleotide units of the multimer may be composed of RNA, DNA, modified nucleotides or 
combinations thereof. At least one of the units has a sequence, length, and composition that permits it 

15 to bind specifically to a first single-stranded nucleotide sequence of interest, typically analyte or an 
oligonucleotide bound to the analyte. In order to achieve such specificity and stability, this unit will 
normally be 15 to 50 nt, preferably 15 to 30 nt, in length and have a GC content in the range of 40% to 
60%. In addition to such unit(s), the multimer includes a multiplicity of units that are capable of 
hybridizing specifically and stably to a second single-stranded nucleotide of interest, typically a labeled 

20 oligonucleotide or another multimer. These units will also normally be 15 to 50 nt, preferably 15 to 30 
nt, in length and have a GC content in the range of 40% to 60%. When a multimer is designed to be 
hybridized to another multimer, the first and second oligonucleotide units are heterogeneous (different). 
One or more of the polynucleotides described herein, or a portion of a polynucleotide described herein, 
can be used as a repeating unit of such nucleic acid multimers. 

25 The total number of oligonucleotide units in the multimer will usually be in the range of 3 to 

50, more usually 10 to 20. In multimers in which the unit that hybridizes to the nucleotide sequence of 
interest is different from the unit that hybridizes to the labeled oligonucleotide, the number ratio of the 
latter to the former will usually be 2:1 to 30:1, more usually 5:1 to 20:1, and-preferably 10:1 to 15:1. 
The oligonucleotide units of the multimer may be covalently linked directly to each other 

30 through phosphodiester bonds or through interposed linking agents such as nucleic acid, amino acid, 
carbohydrate or polyol bridges, or through other cross-linking agents that are capable of cross-linking 
nucleic acid or modified nucleic acid strands. The site(s) of linkage may be at the ends of the unit (in 
either normal 3,-5* orientation or randomly oriented) and/or at one or more internal nucleotides in the 
strand. In linear multimers the individual units are linked end-to-end to form a linear polymer. In one 
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type of branched multimer three or more oligonucleotide units emanate from a point of origin to form a 
branched structure. The point of origin may be another oligonucleotide unit or a multifunctional 
molecule to which at least three units can be covalently bound. In another type, there is an 
oligonucleotide unit backbone with one or more pendant oligonucleotide units. These latter-type 

5 multimers are "fork-like", "comb-like" or combination "fork-" and "comb-like" in structure. The 
pendant units will normally depend from a modified nucleotide or other organic moiety having 
appropriate functional groups to which oligonucleotides may be conjugated or otherwise attached. The 
multimer may be totally linear, totally branched, or a combination of linear and branched portions. 
Preferably there will be at least two branch points in the multimer, more preferably at least 3, 

10 preferably 5 to 10. The multimer may include one or more segments of double-stranded sequences. 

Multimeric nucleic acid molecules are useful in amplifying the signal that results from 
hybridization of one the first sequence of the multimeric molecule to a target sequence. The 
amplification is theoretically proportional to the number of iterations of the second segment. 

Without being held to theory, forked structures of greater than about eight branches exhibited 

15 steric hindrance which inhibited binding of labeled probes to the multimer. On the other hand, comb 
structures exhibit Utile or no steric problems and are thus a preferred type of branched multimer. For a 
description of branched nucleic acid multimers of both the fork and comb types, as well as methods of 
use and synthesis, see, e.g., U.S. Pat. Nos. 5,124,246 (fork-type structures); 5,710,264 (synthesis of 
comb structures); and 5,849,481. 

20 Use of Polynucleotide Probes in Mapping, and in Tissue Profiling. Polynucleotide probes, 

generally comprising at least 12 contiguous nt of a polynucleotide as shown in the Sequence Listing, 
are used for a variety of purposes, such as chromosome mapping of the polynucleotide and detection of 
transcription levels. Additional disclosure about preferred regions of the disclosed polynucleotide 
sequences is found in the Examples. A probe that hybridizes specifically to a polynucleotide disclosed 

25 herein should provide a detection signal at least 5-, 10-, or 20-fold higher than the background 
hybridization provided with other unrelated sequences. 

Detection of Expression Levels. Nucleotide probes are used to detect expression of a gene 
coiresponding to the provided polynucleotide. In Northern blots, mRNA is separated 
electrophoretically and contacted with a probe. A probe is detected as hybridizing to an mRNA species 

30 of a particular size. The amount of hybridization is quantitated to determine relative amounts of 

expression, for example under a particular condition. Probes are used for in situ hybridization to cells 
to detect expression. Probes can also be used in vivo for diagnostic detection of hybridizing sequences. 
Probes are typically labeled with a radioactive isotope. Other types of detectable labels can be used 
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such as chromophores, fluors, and enzymes. Other examples of nucleotide hybridization assays are 
described in WO92/02526 and USPN 5,124,246. 

Alternatively, the Polymerase Chain Reaction (PCR) is another means for detecting small 
amounts of target nucleic acids (see, e.g., Mullis et al., Meth. Enzymol. (1987) 155:335; USPN 
5 4,683,195; and USPN 4,683,202). Two primer polynucleotides nucleotides that hybridize with the 

target nucleic acids are used to prime the reaction. The primers can be composed of sequence within or 
3 f and 5' to the polynucleotides of the Sequence Listing. Alternatively, if the primers are 3 ! and 5 ! to 
these polynucleotides, they need not hybridize to them or the complements. After amplification of the 
target with a thermostable polymerase, the amplified target nucleic acids can be detected by methods 

1 0 known in the art, e.g., Southern blot. mRNA or cDNA can also be detected by traditional blotting 

techniques (e.g., Southern blot, Northern blot, etc.) described in Sambrook et al., "Molecular Cloning: 
A Laboratory Manual" (New York, Cold Spring Harbor Laboratory, 1989) (e.g., without PCR 
amplification). In general, mRNA or cDNA generated from mRNA using a polymerase enzyme can be 
purified and separated using gel electrophoresis, and transferred to a solid support, such as 

15 nitrocellulose. The solid support is exposed to a labeled probe, washed to remove any unhybridized 
probe, and duplexes containing the labeled probe are detected 

Mapping. Polynucleotides of the present invention can be used to identify a chromosome on 
which the corresponding gene resides. Such mapping can be useful in identifying the fimction of the 
polynucleotide-related gene by its proximity to other genes with known function. Function can also be 

20 assigned to the polynucleotide-related gene when particular syndromes or diseases map to the same 

chromosome. For example, use of polynucleotide probes in identification and quantification of nucleic 
acid sequence aberrations is described in USPN 5,783,387. An exemplary mapping method is 
fluorescence in situ hybridization (FISH), which facilitates comparative genomic hybridization to allow 
total genome assessment of changes in relative copy number of DNA sequences (see, e.g., Valdes et 

25 al., Methods in Molecular Biology ( 1 997) 68:1). Polynucleotides can also be mapped to particular 

chromosomes using, for example, radiation hybrids or chromosome-specific hybrid panels. See Leach 
et al., Advances in Genetics, (1995) 33:63-99; Walter et al., Nature Genetics (1994) 7:22; Walter and 
Goodfellow, Trends in Genetics (1992) 9:352. Panels for radiation hybrid mapping are available from 
Research Genetics, Inc., Huntsville, Alabama, USA. Databases for markers using various panels are 

30 available via the world wide web at sites supported by the Stanford Human Genome Center (Stanford 
University) and the Whitehead Institute for Biomedical Research/MIT Center for Genome Research. 
The statistical program RHMAP can be used to construct a map based on the data from radiation 
hybridization with a measure of the relative likelihood of one order versus another. RHMAP is 
available via the world wide web at a site supported by the University of Michigan. In addition, 
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commercial programs are available for identifying regions of chromosomes commonly associated with 
disease, such as cancer. 

Tissue Typi n g or Profiling, Expression of specific mKNA corresponding to the provided 
polynucleotides can vary in different cell types and can be tissue-specific. This variation of mRNA 
5 levels in different cell types can be exploited with nucleic acid probe assays to determine tissue types. 
For example, PCR, branched DNA probe assays, or blotting techniques utilizing nucleic acid probes 
substantially identical or complementary to polynucleotides listed in the Sequence Listing can 
determine the presence or absence of the corresponding cDNA or mRNA. 

Tissue typing can be used to identify the developmental organ or tissue source of a metastatic 

1 0 lesion by identifying the expression of a particular marker of that organ or tissue. If a polynucleotide is 
expressed only in a specific tissue type, and a metastatic lesion is found to express that polynucleotide, 
then the developmental source of the lesion has been identified. Expression of a particular 
polynucleotide can be assayed by detection of either the corresponding mRNA or the protein product. 
As would be readily apparent to any forensic scientist, the sequences disclosed herein are useful in 

1 5 differentiating human tissue from non-human tissue, hi particular, these sequences are useful to 
differentiate human tissue from bird, reptile, and amphibian tissue, for example. 

Use of Polymorphisms . A polynucleotide of the invention can be used in forensics, genetic 
analysis, mapping, and diagnostic applications where the corresponding region of a gene is 
polymorphic in the human population. Any means for detecting a polymorphism in a gene can be used, 

20 including, but not limited to electrophoresis of protein polymorphic variants, differential sensitivity to 
restriction enzyme cleavage, and hybridization to allele-specific probes. 

Antibody Production. Expression products of a polynucleotide of the invention, as well as the 
corresponding mRNA, cDNA, or complete gene, can be prepared and used for raising antibodies for 
experimental, diagnostic, and therapeutic purposes. For polynucleotides to which a corresponding gene 

25 has not been assigned, this provides an additional method of identifying the corresponding gene. The 
polynucleotide or related cDNA is expressed as described above, and antibodies are prepared. These 
antibodies are specific to an epitope on the polypeptide encoded by the polynucleotide, and can 
precipitate or bind to the corresponding native protein in a cell or tissue preparation or in a cell-free 
extract of an in vitro expression system. 

30 Methods for production of antibodies that specifically bind a selected antigen are well known 

in the art. Immunogens for raising antibodies can be prepared by mixing a polypeptide encoded by a 
polynucleotide of the invention with an adjuvant, and/or by making fusion proteins with larger 
immunogenic proteins. Polypeptides can also be covalently linked to other larger immunogenic 
proteins, such as keyhole limpet hemocyanin. Immunogens are typically administered intradermally, 
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subcutaneously, or intramuscularly to experimental animals such as rabbits, sheep, and mice, to 
generate antibodies. Monoclonal antibodies can be generated by isolating spleen cells and fusing 
myeloma cells to form hybridomas. Alternatively, the selected polynucleotide is administered directly, 
such as by intramuscular injection, and expressed in vivo. The expressed protein generates a variety of 
5 protein-specific immune responses, including production of antibodies, comparable to administration 
of the protein. 

Preparations of polyclonal and monoclonal antibodies specific for polypeptides encoded by a 
selected polynucleotide are made using standard methods known in the art. The antibodies specifically 
bind to epitopes present in the polypeptides encoded by polynucleotides disclosed in the Sequence 

10 Listing. Typically, at least 6, 8, 10, or 12 contiguous amino acids are required to form an epitope. 
Epitopes that involve non-contiguous amino acids may require a longer polypeptide, e.g., at least 15, 
25, or 50 amino acids. Antibodies that specifically bind to human polypeptides encoded by the 
provided polypeptides should provide a detection signal at least 5-, 10-, or 20-fold higher than a 
detection signal provided with other proteins when used in Western blots or other immunochemical 

15 assays. Preferably, antibodies that specifically bind polypeptides contemplated by the invention do not 
bind to other proteins in immunochemical assays at detectable levels and can immunoprecipitate the 
specific polypeptide from solution. 

The invention also contemplates naturally occurring antibodies specific for a polypeptide of 
the invention. For example, serum antibodies to a polypeptide of the invention in a human population 

20 can be purified by methods well known in the art, e.g., by passing antiserum over a column to which 
the corresponding selected polypeptide or fusion protein is bound. The bound antibodies can then be 
eluted from the column, for example, using a buffer with a higji salt concentration. 

In addition to the antibodies discussed above, the invention also contemplates genetically 
engineered antibodies antibodies (e.g., chimeric antibodies, humanized antibodies, human antibodies 

25 produced by a transgenic animal (e.g., a transgenic mouse such as the XenomousTM), antibody 

derivatives (e.g., single chain antibodies, antibody fragments (e.g., Fab, etc.)), according to methods 

well known in the art. 

Polynucleotides or Arrays for Diagnostics. 

Polynucleotide arrays provide a high throughput technique that can assay a large number of 

30 polynucleotides in a sample. This technology can be used as a diagnostic and as tool to test for 
differential expression expression, e.g., to determine function of an encoded protein. A variety of 
methods of producing arrays, as well as variations of these methods, are known in the art and 
contemplated for use in the invention. For example, arrays can be created by spotting polynucleotide 
probes onto a substrate (e.g., glass, nitrocellulose, etc.) in a two-dimensional matrix or array having 
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bound probes. The probes can be bound to the substrate by either covalent bonds or by non-specific 
interactions, such as hydrophobic interactions. Samples of polynucleotides can be detectably labeled 
(e.g., using radioactive or fluorescent labels) and then hybridized to the probes. Double stranded 
polynucleotides, comprising the labeled sample polynucleotides bound to probe polynucleotides, can be 
5 detected once the unbound portion of the sample is washed away. Alternatively, the polynucleotides of 
the test sample can be immobilized on the array, and the probes detectably labeled. Techniques for 
constructing arrays and methods of using these arrays are described in, for example, Schena et al. 
(1996) Proc Natl Acad Sci USA. 93(20): 10614-9; Schena et al. (1995) Science 270(5235):467~70; 
Shalon et al. (1996) Genome Res. 6(7):639-45, USPN 5,807,522, EP 799 897; WO 97/29212; WO 
10 97/273 17; EP 785 280; WO 97/02357; USPN 5,593,839; USPN 5,578,832; EP 728 520; 
USPN 5,599,695; EP 721 016; USPN 5,556,752; WO 95/22058; and USPN 5,631,734. 

Arrays can be used to, for example, examine differential expression of genes and can be used 
to determine gene function. For example, arrays can be used to detect differential expression of a gene 
corresponding to a polynucleotide of the invention, where expression is compared between a test cell 
15 and control cell (e.g., cancer cells and normal cells). For example, high expression of a particular 
message in a cancer cell, which is not observed in a corresponding normal cell, can indicate a cancer 
specific gene product. Exemplary uses of arrays are further described in, for example, Pappalarado et 
al., Sem. Radiation Oncol. (1998) 8:217; and Ramsay Nature Biotechnol. (1998) 16:40. Furthermore, 
many variations on methods of detection using arrays are well within the skill in the art and within the 
20 scope of the present invention. For example, rather than immobilizing the probe to a solid support, the 
test sample can be immobilized on a solid support which is then contacted with the probe. 
Differential Expression in Diagnosis 

The polynucleotides of the invention can also be used to detect differences in expression levels 
between two cells, e.g., as a method to identify abnormal or diseased tissue in a human. For 
25 polynucleotides corresponding to profiles of protein families, the choice of tissue can be selected 

according to the putative biological function. In general, the expression of a gene corresponding to a 
specific polynucleotide is compared between a first tissue that is suspected of being diseased and a 
second, normal tissue of the human. The tissue suspected of being abnormal or diseased can be 
derived from a different tissue type of the human, but preferably it is derived from the same tissue type; 
30 for example, an intestinal polyp or other abnormal growth should be compared with normal intestinal 
tissue. The normal tissue can be the same tissue as that of the test sample, or any normal tissue of the 
patient, especially those that express the polynucleotide-related gene of interest (e.g., brain, thymus, 
testis, heart, prostate, placenta, spleen, small intestine, skeletal muscle, pancreas, and the mucosal 
lining of the colon). A difference between the polynucleotide-related gene, mRNA, or protein in the 
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two tissues which are compared, for example, in molecular weight, amino acid or nucleotide sequence, 
or relative abundance, indicates a change in the gene, or a gene which regulates it, in the tissue of the 
human that was suspected of being diseased. Examples of detection of differential expression and its 
use in diagnosis of cancer are described in USPNs 5,688,641 and 5,677,125. 

A genetic predisposition to disease in a human can also be detected by comparing expression 
levels of an mRNA or protein corresponding to a polynucleotide of the invention in a fetal tissue with 
levels associated in normal fetal tissue. Fetal tissues that are used for this purpose include, but are not 
limited to, amniotic fluid, chorionic villi, blood, and the blastomere of an in vitro-fertilized embryo. 
The comparable normal polynucleotide-related gene is obtained from any tissue. The mRNA or 
protein is obtained from a normal tissue of a human in which the polynucleotide-related gene is 
expressed. Differences such as alterations in the nucleotide sequence or size of the same product of the 
fetal polynucleotide-related gene or mRNA, or alterations in the molecular weight, amino acid 
sequence, or relative abundance of fetal protein, can indicate a germline mutation in the polynucleotide- 
related gene of the fetus, which indicates a genetic predisposition to disease. In general, diagnostic, 
prognostic, and other methods of the invention based on differential expression involve detection of a 
level or amount of a gene product, particularly a differentially expressed gene product, in a test sample 
obtained from a patient suspected of having or being susceptible to a disease (e.g., breast cancer, lung 
cancer, colon cancer and/or metastatic forms thereof), and comparing the detected levels to those levels 
found in normal cells (e.g., cells substantially unaffected by cancer) and/or other control cells (e.g., to 
differentiate a cancerous cell from a cell affected by dysplasia). Furthermore, the severity of the 
disease can be assessed by comparing the detected levels of a differentially expressed gene product 
with those levels detected in samples representing the levels of differentially expressed gene product 
associated with varying degrees of severity of disease. It should be noted that use of the term 
"diagnostic" herein is not necessarily meant to exclude "progm^ 0 " or "prognosis," but rather is used 
as a matter of convenience. 

The term "differentially expressed gene" is generally intended to encompass a polynucleotide 
that can, for example, include an open reading frame encoding a gene product (e.g., a polypeptide), 
and/or introns of such genes and adjacent 5 1 and 3* non-coding nucleotide sequences involved in the 
regulation of expression, up to about 20 kb beyond the coding region, but possibly further in either 
direction. The gene can be introduced into an appropriate vector for extrachromosomal maintenance or 
for integration into a host genome. In general, a difference in expression level associated with a 
decrease in expression level of at least about 25%, usually at least about 50% to 75%, more usually at 
least about 90% or more is indicative of a differentially expressed gene of interest, i.e., a gene that is 
underexpressed or down-regulated in the test sample relative to a control sample. Furthermore, a 
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difference in expression level associated with an increase in expression of at least about 25%, usually 
at least about 50% to 75%, more usually at least about 90% and can be at least about P/i-fold, usually 
at least about 2-fold to about 10-fold, and can be about 100-fold to about 1,000-fold increase relative 
to a control sample is indicative of a differentially expressed gene of interest, i.e., an overexpressed or 

5 up-regulated gene. 

"Differentially expressed polynucleotide" as used herein means a nucleic acid molecule (RNA 
or DNA) comprising a sequence that represents a differentially expressed gene, e.g., the differentially 
expressed polynucleotide comprises a sequence (e.g., an open reading frame encoding a gene product) 
that uniquely identifies a differentially expressed gene so that detection of the differentially expressed 

10 polynucleotide in a sample is correlated with the presence of a differentially expressed gene in a 
sample. "Differentially expressed polynucleotide" is also meant to encompass fragments of the 
disclosed polynucleotides, e.g., fragments retaining biological activity, as well as nucleic acids 
homologous, substantially similar, or substantially identical (e.g., having about 90% sequence identity) 
to the disclosed polynucleotides. 

15 Methods of the subject invention useful in diagnosis or prognosis typically involve comparison 

of the abundance of a selected differentially expressed gene product in a sample of interest with that of 
a control to determine any relative differences in the expression of the gene product, where the 
difference can be measured qualitatively and/or quantitatively. Quantitation can be accomplished, for 
example, by comparing the level of expression product detected in the sample with the amounts of 

20 product present in a standard curve. A comparison can be made visually; by using a technique such as 
densitometry, with or without computerized assistance; by preparing a representative library of cDNA 
clones of mRNA isolated from a test sample, sequencing the clones in the library to determine that 
number of cDNA clones corresponding to the same gene product, and analyzing the number of clones 
corresponding to that same gene product relative to the number of clones of the same gene product in a 

25 control sample; or by using an array to detect relative levels of hybridization to a selected sequence or 
set of sequences, and comparing the hybridization pattern to that of a control. The differences in 
expression are then correlated with the presence or absence of an abnormal expression pattern. A 
variety of different methods for determining the nucleic acid abundance in a sample are known to those 
of skill in the art (see, e.g., WO 97/27317). 

30 In general, diagnostic assays of the invention involve detection of a gene product of a 

polynucleotide sequence (e.g., mRNA or polypeptide) that corresponds to a sequence of SEQ ID 
NOS: 1-6010. The patient from whom the sample is obtained can be apparently healthy, susceptible to 
disease (e.g., as determined by family history or exposure to certain environmental factors), or can 
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already be identified as having a condition in which altered expression of a gene product of the 
invention is implicated. 

Diagnosis can be determined based on detected gene product expression levels of a gene 
product encoded by at least one, preferably at least two or more, at least 3 or more, or at least 4 or more 
of the polynucleotides having a sequence set forth in SEQ ID NOS: 1-6010, and can involve detection 
of expression of genes corresponding to all of SEQ ID NOS: 1-60 10 and/or additional sequences that 
can serve as additional diagnostic markers and/or reference sequences. Where the diagnostic method is 
designed to detect the presence or susceptibility of a patient to cancer, the assay preferably involves 
detection of a gene product encoded by a gene corresponding to a polynucleotide that is differentially 
expressed in cancer. Examples of such differentially expressed polynucleotides are described in the 
Examples below. Given the provided polynucleotides and information regarding their relative 
expression levels provided herein, assays using such polynucleotides and detection of their expression 
levels in diagnosis and prognosis will be readily apparent to the ordinarily skilled artisan. 

Any of a variety of detectable labels can be used in connection with the various embodiments 
of the diagnostic methods of the invention. Suitable detectable labels include fluorochromes,(e.g. 
fluorescein isothiocyanate (FITC), rhodamine, Texas Red, phycoerythrin, allophycocyanin, 6- 
carboxyfluorescein (6-FAM), 2%7'-dimetho^-4 :> ,5 5 -dichloro-6-carboxyfluorescem, 6-carboxy-X- 
rhodamine (ROX), 6-carboxy-2',4',7',4,7-hexachlorofluorescein (HEX), 5-carboxyfluorescein 
(5-FAM) or N,N,N',N'-tetramethyl-6-carboxyrhodamine (TAMRA)), radioactive labels, (e.g. 32P, 
35S, 3H, etc.), and the like. The detectable label can involve a two stage systems (e.g., biotin-avidin, 
hapten-anti-hapten antibody, etc.). 

Reagents specific for the polynucleotides and polypeptides of the invention, such as antibodies 
and nucleotide probes, can be supplied in a kit for detecting the presence of an expression product in a 
biological sample. The kit can also contain buffers or labeling components, as well as instructions for 
using the reagents to detect and quantify expression products in the biological sample. Exemplary 
embodiments of the diagnostic methods of the invention are described below in more detail. 

Polypeptide detection in diagnosis. In one embodiment, the test sample is assayed for the level 
of a differentially expressed polypeptide. Diagnosis can be accomplished using any of a number of 
methods to determine the absence or presence or altered amounts of the differentially expressed 
polypeptide in the test sample. For example, detection can utilize staining of cells or histological 
sections with labeled antibodies, performed in accordance with conventional methods. Cells can be 
permeabilized to stain cytoplasmic molecules. In general, antibodies that specifically bind a 
differentially expressed polypeptide of the invention are added to a sample, and incubated for a period 
of time sufficient to allow binding to the epitope, usually at least about 10 minutes. The antibody can 
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be detectably labeled for direct detection (e.g., using radioisotopes, enzymes, fluorescers, 
chemiluminescers, and the like), or can be used in conjunction with a second stage antibody or reagent 
to detect binding (e.g., biotin with horseradish peroxidase-conjugated avidin, a secondary antibody 
conjugated to a fluorescent compound, e.g. fluorescein, rhodamine, Texas red, etc.). The absence or 
5 presence of antibody binding can be determined by various methods, including flow cytometry of 
dissociated cells, microscopy, radiography, scintillation counting, etc. Any suitable alternative 
methods of qualitative or quantitative detection of levels or amounts of differentially expressed 
polypeptide can be used, for example, ELIS A, western blot, immunoprecipitation, radioimmunoassay, 
etc. 

10 mRNA detection. The diagnostic methods of the invention can also or alternatively involve 

detection of mRNA encoded by a gene corresponding to a differentially expressed polynucleotide of the 
invention. Any suitable qualitative or quantitative methods known in the art for detecting specific 
mRNAs can be used. mRNA can be detected by, for example, in situ hybridization in tissue sections, 
by reverse transcriptase-PCR, or in Northern blots containing poly A+ mRNA. One of skill in the art 

15 can readily use these methods to determine differences in the size or amount of mRNA transcripts 

between two samples. mRNA expression levels in a sample can also be determined by generation of a 
library of expressed sequence tags (ESTs) from the sample, where the EST library is representative of 
sequences present in the sample (Adams, et al., (1991) Science 252: 1651). Enumeration of the relative 
representation of ESTs within the library can be used to approximate the relative representation of the 

20 gene transcript within the starting sample. The results of EST analysis of a test sample can then be 
compared to EST analysis of a reference sample to determine the relative expression levels of a 
selected polynucleotide, particularly a polynucleotide corresponding to one or more of the differentially 
expressed genes described herein. Alternatively, gene expression in a test sample can be performed 
using serial analysis of gene expression (SAGE) methodology (e.g., Velculescu et al., Science (1995) 

25 270:484) or differential display (DD) methodology (see, e.g., USPN 5,776,683 and USPN 5,807,680). 

Alternatively, gene expression can be analyzed using hybridization analysis. Oligonucleotides 
or cDNA can be used to selectively identify or capture DNA or RNA of specific sequence composition, 
and the amount of RNA or cDNA hybridized to a known capture sequence determined qualitatively or 
quantitatively, to provide information about the relative representation of a particular message within 

30 the pool of cellular messages in a sample. Hybridization analysis can be designed to allow for 
concurrent screening of the relative expression of hundreds to thousands of genes by using, for 
example, array-based technologies having high density formats, including filters, microscope slides, or 
microchips, or solution-based technologies that use spectroscopic analysis (e.g., mass spectrometry). 
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One exemplary use of arrays in the diagnostic methods of the invention is described below in more 
detail. 

Use of a si ngle gene in diagnostic applications. The diagnostic methods of the invention can 
focus on the expression of a single differentially expressed gene. For example, the diagnostic method 
5 can involve detecting a differentially expressed gene, or a polymorphism of such a gene (e.g., a 

polymorphism in a coding region or control region), that is associated with disease. Disease-associated 
polymorphisms can include deletion or truncation of the gene, mutations that alter expression level 
and/or affect activity of the encoded protein, etc. 

A number of methods are available for analyzing nucleic acids for the presence of a specific 
10 sequence, e.g. a disease associated polymorphism. Where large amounts of DNA are available, 

genomic DNA is used directly. Alternatively, the region of interest is cloned into a suitable vector and 
grown in sufficient quantity for analysis. Cells that express a differentially expressed gene can be used 
as a source of mRNA, which can be assayed directly or reverse transcribed into cDNA for analysis. 
The nucleic acid can be amplified by conventional techniques, such as the polymerase chain reaction 
15 (PCR), to provide sufficient amounts for analysis, and a detectable label can be included in the 

amplification reaction (e.g., using a detectably labeled primer or detectably labeled oligonucleotides) to 
facilitate detection. Alternatively, various methods are also known in the art that utilize oligonucleotide 
ligation as a means of detecting polymorphisms, see, e.g., Riley et al., Nucl. Acids Res. (1990) 
18:2887; and Delahunty et al., Am. J. Hum. Genet. (1996) 58:1239. 
20 The amplified or cloned sample nucleic acid can be analyzed by one of a number of methods 

known in the art. The nucleic acid can be sequenced by dideoxy or other methods, and the sequence of 
bases compared to a selected sequence, e.g., to a wild-type sequence. Hybridization with the 
polymorphic or variant sequence can also be used to determine its presence in a sample (e.g., by 
Southern blot, dot blot, etc.). The hybridization pattern of a polymorphic or variant sequence and a 
25 control sequence to an array of oligonucleotide probes immobilized on a solid support, as described in 
US 5,445,934, or in WO 95/35505, can also be used as a means of identifying polymorphic or variant 
sequences associated with disease. Single strand conformational polymorphism (SSCP) analysis, 
denaturing gradient gel electrophoresis (DGGE), and heteroduplex analysis in gel matrices are used to 
detect conformational changes created by DNA sequence variation as alterations in electrophoretic 
30 mobility. Alternatively, where a polymorphism creates or destroys a recognition site for a restriction 
endonuclease, the sample is digested with that endonuclease, and the products size fractionated to 
determine whether the fragment was digested. Fractionation is performed by gel or capillary 
electrophoresis, particularly acrylamide or agarose gels. 
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Screening for mutations in a gene can be based on the functional or antigenic characteristics of 
the protein. Protein truncation assays are useful in detecting deletions that can affect the biological 
activity of the protein. Various immunoassays designed to detect polymorphisms in proteins can be 
used in screening. Where many diverse genetic mutations lead to a particular disease phenotype, 

5 functional protein assays have proven to be effective screening tools. The activity of the encoded 
protein can be determined by comparison with the wild-type protein. 
Diagnosis. Prognosis. Assessment of Therapy (TherametricsV and Manageme nt of Cancer 

The polynucleotides of the invention, as well as their gene products, are of particular interest 
as genetic or biochemical markers (e.g., in blood or tissues) that will detect the earliest changes along 

10 the carcinogenesis pathway and/or to monitor the efficacy of various therapies and preventive 

interventions. For example, the level of expression of certain polynucleotides can be indicative of a 
poorer prognosis, and therefore warrant more aggressive chemo- or radio-therapy for a patient or vice 
versa. The correlation of novel surrogate tumor specific features with response to treatment and 
outcome in patients can define prognostic indicators that allow the design of tailored therapy based on 

15 the molecular profile of the tumor. These therapies include antibody targeting, antagonists (e.g., small 
molecules), and gene therapy. Determining expression of certain polynucleotides and comparison of a 
patient's profile with known expression in normal tissue and variants of the disease allows a 
determination of the best possible treatment for a patient, both in terms of specificity of treatment and 
in terms of comfort level of the patient Surrogate tumor markers, such as polynucleotide expression, 

20 can also be used to better classify, and thus diagnose and treat, different forms and disease states of 
cancer. Two classifications widely used in oncology that can benefit from identification of the 
expression levels of the genes corresponding to the polynucleotides of the invention are staging of the 
cancerous disorder, and grading the nature of the cancerous tissue. 

The polynucleotides that correspond to differentially expressed genes, as well as their encoded 

25 gene products, can be useful to monitor patients having or susceptible to cancer to detect potentially 
malignant events at a molecular level before they are detectable at a gross morphological level. In 
addition, the polynucleotides of the invention, as well as the genes corresponding to such 
polynucleotides, can be useful as therametrics, e.g., to assess the effectiveness of therapy by using the 
polynucleotides or their encoded gene products, to assess, for example, tumor burden in the patient 

30 before, during, and after therapy. 

Furthermore, a polynucleotide identified as corresponding to a gene that is differentially 
expressed in, and thus is important for, one type of cancer can also have implications for development 
or risk of development of other types of cancer, e.g., where a polynucleotide represents a gene 
differentially expressed across various cancer types. Thus, for example, expression of a polynucleotide 
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corresponding to a gene that has clinical implications for metastatic colon cancer can also have clinical 
implications for stomach cancer or endometrial cancer. 

Staging. Staging is a process used by physicians to describe how advanced the cancerous state 
is in a patient. Staging assists the physician in determining a prognosis, planning treatment and 
5 evaluating the results of such treatment Staging systems vary with the types of cancer, but generally 
involve the following "TNM" system: the type of tumor, indicated by T; whether the cancer has 
metastasized to nearby lymph nodes, indicated by N; and whether the cancer has metastasized to more 
distant parts of the body, indicated by M. Generally, if a cancer is only detectable in the area of the 
primary lesion without having spread to any lymph nodes it is called Stage I. If it has spread only to 
10 the closest lymph nodes, it is called Stage II. In Stage EI, the cancer has generally spread to the lymph 
nodes in near proximity to the site of the primary lesion. Cancers that have spread to a distant part of 
the body, such as the liver, bone, brain or other site, are Stage IV, the most advanced stage. 

The polynucleotides of the invention can facilitate fine-tuning of the staging process by 
identifying markers for the aggresivity of a cancer, e.g., the metastatic potential, as well as the presence 
15 in different areas of the body. Thus, a Stage II cancer with a polynucleotide signifying a higji 

metastatic potential cancer can be used to change a borderline Stage II tumor to a Stage III tumor, 
justifying more aggressive therapy. Conversely, the presence of a polynucleotide signifying a lower 
metastatic potential allows more conservative staging of a tumor. 

Grading of cancers. Grade is a term used to describe how closely a tumor resembles normal 
20 tissue of its same type. The microscopic appearance of a tumor is used to identify tumor grade based 
on parameters such as cell morphology, cellular organization, and other markers of differentiation. As 
a general rule, the grade of a tumor corresponds to its rate of growth or aggressiveness, with 
undifferentiated or high-grade tumors being more aggressive than well-differentiated or low-grade 
tumors. The following guidelines are generally used for grading tumors: 1) GX Grade cannot be 
25 assessed; 2) Gl Well differentiated; 3) G2 Moderately well differentiated; 4) G3 Poorly differentiated; 
5) G4 Undifferentiated. The polynucleotides of the invention can be especially valuable in determining 
the grade of the tumor, as they not only can aid in determining the differentiation status of the cells of a 
tumor, they can also identify factors other than differentiation that are valuable in determining the 
aggressiveness of a tumor, such as metastatic potential. 
30 Detection of colon cancer. The polynucleotides corresponding to genes that exhibit the 

appropriate expression pattern can be used to detect colon cancer in a subject. Colorectal cancer is one 
of the most common neoplasms in humans and perhaps the most frequent form of hereditary neoplasia. 
Prevention and early detection are key factors in controlling and curing colorectal cancer. Colorectal 
cancer begins as polyps, which are small, benign growths of cells that form on the inner lining of the 
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colon. Over a period of several years, some of these polyps accumulate additional mutations and 
become cancerous. Multiple familial colorectal cancer disorders have been identified, which are 
summarized as follows: 1) Familial adenomatous polyposis (FAP); 2) Gardner's syndrome; 3) 
Hereditary nonpolyposis colon cancer (HNPCC); and 4) Familial colorectal cancer in Ashkenazi Jews. 

5 The expression of appropriate polynucleotides of the invention can be used in the diagnosis, prognosis 
and management of colorectal cancer. Detection of colon cancer can be determined using expression 
levels of any of these sequences alone or in combination with the levels of expression. Determination 
of the aggressive nature and/or the metastatic potential of a colon cancer can be determined by 
comparing levels of one or more polynucleotides of the invention and comparing total levels of another 

10 sequence known to vary in cancerous tissue, e.g., expression of p53, DCC ras, lor FAP (see, e.g., 

Fearon ER, et al., Cell (1990) 61(5):759; Hamilton SR et aL, Cancer (1993) 72:957; Bodmer W, et aL, 
Nat Genet (1994) 4(3):217; Fearon ER, Ann N Y Acad Sci. (1995) 768: 101). For example, 
development of colon cancer can be detected by examining the ratio of any of the polynucleotides of 
the invention to the levels of oncogenes (e.g., ras) or tumor suppressor genes (e.g., FAP or p53). Thus, 

15 expression of specific marker polynucleotides can be used to discriminate between normal and 
cancerous colon tissue, to discriminate between colon cancers with different cells of origin, to 
discriminate between colon cancers with different potential metastatic rates, etc. For a review of 
markers of cancer, see, e.g., Hanahan et al. (2000) Cell 100:57-70. 

Detection of prostate cancer. The polynucleotides and their corresponding genes and gene 

20 products exhibiting the appropriate differential expression pattern can be used to detect prostate cancer 
in a subject. Over 95% of primary prostate cancers are adenocarcinomas. Signs and symptoms may 
include: frequent urination, especially at night; inability to urinate; trouble starting or holding back 
urination; a weak or interrupted urine flow; and frequent pain or stiffness in the lower back, hips or 
upper thighs. 

25 Many of the signs and symptoms of prostate cancer can be caused by a variety of other non- 

cancerous conditions. For example, one common cause of many of these signs and symptoms is a 
condition called benign prostatic hypertrophy, or BPH. In BPH, the prostate gets bigger and may block 
the flow of urine or interfere with sexual function. The methods and compositions of the invention can 
be used to distinguish between prostate cancer and such non-cancerous conditions. The methods of the 

30 invention can be used in conjunction with conventional methods of diagnosis, e.g., digital rectal exam 
and/or detection of the level of prostate specific antigen (PSA), a substance produced and secreted by 
the prostate. 

Detection of breast cancer. The majority of breast cancers are adenocarcinoma subtypes, 
which can be summarized as follows: 1) ductal carcinoma in situ (DOS), including comedocarcinoma; 
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2) infiltrating (or invasive) ductal carcinoma (IDC); 3) lobular carcinoma in situ (LCIS); 4) infiltrating 
(or invasive) lobular carcinoma (ILC); 5) inflammatory breast cancer; 6) medullary carcinoma; 7) 
mucinous carcinoma; 8) Paget's disease of the nipple; 9) Phyllodes tumor; and 10) tubular carcinoma; 
The expression of polynucleotides of the invention can be used in the diagnosis and 
5 management of breast cancer, as well as to distinguish between types of breast cancer. Detection of 
breast cancer can be determined using expression levels of any of the appropriate polynucleotides of 
the invention, either alone or in combination. Determination of the aggressive nature and/or the 
metastatic potential of a breast cancer can also be determined by comparing levels of one or more 
polynucleotides of the invention and comparing levels of another sequence known to vary in cancerous 
10 tissue, e.g., ER expression. In addition, development of breast cancer can be detected by examining the 
ratio of expression of a differentially expressed polynucleotide to the levels of steroid hormones (e.g., 
testosterone or estrogen) or to other hormones (e.g., growth hormone, insulin). Thus, expression of 
specific marker polynucleotides can be used to discriminate between normal and cancerous breast 
tissue, to discriminate between breast cancers with different cells of origin, to discriminate between 
15 breast cancers with different potential metastatic rates, etc. 

Detection of lung cancer. The polynucleotides of the invention can be used to detect lung 
cancer in a subject. Although there are more than a dozen different kinds of lung cancer, the two main 
types of lung cancer are small cell and nonsmall cell, which encompass about 90% of all lung cancer 
cases. Small cell carcinoma (also called oat cell carcinoma) usually starts in one of the larger bronchial 
20 tubes, grows fairly rapidly, and is likely to be large by the time of diagnosis. Nonsmall cell lung cancer 
(NSCLC) is made up of three general subtypes of lung cancer. Epidermoid carcinoma (also called 
squamous cell carcinoma) usually starts in one of the larger bronchial tubes and grows relatively 
slowly. The size of these tumors can range from very small to quite large. Adenocarcinoma starts 
growing near the outside surface of the lung and can vary in both size and growth rate. Some slowly 
25 growing adenocarcinomas are described as alveolar cell cancer. Large cell carcinoma starts near the 
surface of the lung, grows rapidly, and the growth is usually fairly large when diagnosed. Other less 
common forms of lung cancer are carcinoid, cylindroma, mucoepidennoid, and malignant 
mesothelioma. 

The polynucleotides of the invention, e.g., polynucleotides differentially expressed in normal 
30 cells versus cancerous lung cells (e.g., tumor cells of high or low metastatic potential) or between types 
of cancerous lung cells (e.g., high metastatic versus low metastatic), can be used to distinguish types of 
lung cancer as well as identifying traits specific to a certain patient's cancer and selecting an 
appropriate therapy. For example, if the patient's biopsy expresses a polynucleotide that is associated 
with a low metastatic potential, it may justify leaving a larger portion of the patient's lung in surgery to 
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remove the lesion. Alternatively, a smaller lesion with expression of a polynucleotide that is associated 
with high metastatic potential may justify a more radical removal of lung tissue and/or the surrounding 
lymph nodes, even if no metastasis can be identified through pathological examination. 
Use of Polynucleotides to Screen for Peptide Analogs and Antagonists 

5 Polypeptides encoded by the instant polynucleotides and corresponding full-length genes can 

be used to screen peptide libraries to identify binding partners, such as receptors, from among the 
encoded polypeptides. Peptide libraries can be synthesized according to methods known in the art (see, 
e.g., USPN 5,010,175 , and WO 91/17823). Agonists or antagonists of the polypeptides of the 
invention can be screened using any available method known in the art, such as signal transduction, 

10 antibody binding, receptor binding, mitogenic assays, chemotaxis assays, etc. The assay conditions 

ideally should resemble the conditions under which the native activity is exhibited in vivo, that is, under 
physiologic pH, temperature, and ionic strength. Suitable agonists or antagonists will exhibit strong 
inhibition or enhancement of the native activity at concentrations that do not cause toxic side effects in 
the subject. Agonists or antagonists that compete for binding to the native polypeptide can require 

1 5 concentrations equal to or greater than the native concentration, while inhibitors capable of binding 
irreversibly to the polypeptide can be added in concentrations on the order of the native concentration. 

Such screening and experimentation can lead to identification of a novel polypeptide binding 
partner, such as a receptor, encoded by a gene or a cDNA corresponding to a polynucleotide of the 
invention, and at least one peptide agonist or antagonist of the novel binding partner. Such agonists 

20 and antagonists can be used to modulate, enhance, or inhibit receptor function in cells to which the 

receptor is native, or in cells that possess the receptor as a result of genetic engineering. Further, if the 
novel receptor shares biologically important characteristics with a known receptor, information about 
agonist/antagonist binding can facilitate development of improved agonists/antagonists of the known 
receptor. 

25 Pharmaceutical Compositions and Uses 

Pharmaceutical compositions can comprise polypeptides, receptors that specifically bind a 
polypeptide produced by a differentially expressed gene (e.g., antibodies, or polynucleotides (including 
antisense nucleotides and ribozymes) of the claimed invention in a therapeutically effective amount. 
The compositions can be used to treat primary tumors as well as metastases of primary tumors. In 

30 addition, the pharmaceutical compositions can be used in conjunction with conventional methods of 
cancer treatment, e.g., to sensitize tumors to radiation or conventional chemotherapy. 

Where the pharmaceutical composition comprises a receptor (such as an antibody) that 
specifically binds to a gene product encoded by a differentially expressed gene, the receptor can be 
coupled to a drug for delivery to a treatment site or coupled to a detectable label to facilitate imaging of 

42 



WSDOCID: <WO 0214500A2_I_> 



WO 02/14500 



PCT/US01/25840 



a site comprising colon cancer cells. Methods for coupling antibodies to drugs and detectable labels 
are well known in the art, as are methods for imaging using detectable labels. 

The term "therapeutically effective amount" as used herein refers to an amount of a therapeutic 
agent to treat, ameliorate, or prevent a desired disease or condition, or to exhibit a detectable 

5 therapeutic or preventative effect The effect can be detected by, for example, chemical markers or 
antigen levels. Therapeutic effects also include reduction in physical symptoms, such as decreased 
body temperature. The precise effective amount for a subject will depend upon the subject's size and 
health, the nature and extent of the condition, and the therapeutics or combination of therapeutics 
selected for administration. Thus, it is not useful to specify an exact effective amount in advance. 

10 However, the effective amount for a given situation is determined by routine experimentation and is 
within the judgment of the clinician. For purposes of the present invention, an effective dose will 
generally be from about 0.01 mg/kg to 50 mg/kg or 0.05 mg/kg to about 10 mg/kg of the DNA 
constructs in the individual to which it is administered. 

A pharmaceutical composition can also contain a pharmaceutically acceptable carrier. The 

15 term "pharmaceutically acceptable carrier" refers to a carrier for administration of a therapeutic agent, 
such as antibodies or a polypeptide, genes, and other therapeutic agents. The term refers to any 
pharmaceutical carrier that does not itself induce the production of antibodies harmful to the individual 
receiving the composition, and which can be administered without undue toxicity. Suitable carriers can 
be large, slowly metabolized macromolecules such as proteins, polysaccharides, polylactic acids, 

20 polyglycolic acids, polymeric amino acids, amino acid copolymers, and inactive virus particles. Such 
carriers are well known to those of ordinary skill in the art. Pharmaceutically acceptable carriers in 
therapeutic compositions can include liquids such as water, saline, glycerol and ethanol. Auxiliary 
substances, such as wetting or emulsifying agents, pH buffering substances, and the like, can also be 
present in such vehicles. Typically, the therapeutic compositions are prepared as injectables, either as 

25 liquid solutions or suspensions; solid forms suitable for solution in, or suspension in, liquid vehicles 
prior to injection can also be prepared. Liposomes are included within the definition of a 
pharmaceutically acceptable carrier. Pharmaceutically acceptable salts can also be present in the 
pharmaceutical composition, e.g., mineral acid salts such as hydrochlorides, hydrobromides, 
phosphates, sulfates, and the like; and the salts of organic acids such as acetates, propionates, 

30 malonates, benzoates, and the like. A thorough discussion of pharmaceutically acceptable excipients is 
available in Remington's Pharmaceutical Sciences (Mack Pub. Co., N. J. 1991). 
Delivery Methods 

Once formulated, the compositions of the invention can be (1) administered directly to the 
subject (e.g., as polynucleotide or polypeptides); or (2) delivered ex vivo, to cells derived from the 

43 



<WO 021 4500A2J_> 



WO 02/14500 



PCT/US01/25840 



subject (e.g., as in ex vivo gene therapy). Direct delivery of the compositions will generally be 
accomplished by parenteral injection, e.g., subcutaneously, intraperitoneally, intravenously or 
intramuscularly, intratumoraUy or to the interstitial space of a tissue. Other modes of administration 
include oral and pulmonary administration, suppositories, and transdermal applications, needles, and 
5 gene guns or hyposprays. Dosage treatment can be a single dose schedule or a multiple dose schedule. 

Methods for the ex vivo delivery and reimplantation of transformed cells into a subject are 
known in the art and described in, e.g., WO 93/14778. Examples of cells useful in ex vivo applications 
include, for example, stem cells, particularly hematopoetic, lymph cells, macrophages, dendritic cells, 
or tumor cells. Generally, delivery of nucleic acids for both ex vivo and in vitro applications can be 
10 accomplished by, for example, dextran-mediated transfection, calcium phosphate precipitation, 
polybrene mediated transfection, protoplast fusion, electroporation, encapsulation of the 
polynucleotide(s) in liposomes, and direct microinjection of the DNA into nuclei, all well known in the 
art. 

Once differential expression of a gene corresponding to a polynucleotide of the invention has 

15 been found to correlate with a proliferative disorder, such as neoplasia, dysplasia, and hyperplasia, the 
disorder can be amenable to treatment by administration of a therapeutic agent based on the provided 
polynucleotide, corresponding polypeptide or other corresponding molecule (e.g., antisense, ribozyme, 
etc.). In other embodiments, the disorder can be amenable to treatment by administration of a small 
molecule drug that, for example, serves as an inhibitor (antagonist) of the function of the encoded gene 

20 product of a gene having increased expression in cancerous cells relative to normal cells or as an 
agonist for gene products that are decreased in expression in cancerous cells (e.g., to promote the 
activity of gene products that act as tumor suppressors). 

The dose and the means of administration of the inventive pharmaceutical compositions are 
determined based on the specific qualities of the therapeutic composition, the condition, age, and 

25 weight of the patient, the progression of the disease, and other relevant factors. For example, 

administration of polynucleotide therapeutic composition agents of the invention includes local or 
systemic administration, including injection, oral administration, particle gun or catheterized 
administration, and topical administration. Preferably, the therapeutic polynucleotide composition 
contains an expression construct comprising a promoter operably linked to a polynucleotide of at least 

30 12, 22, 25, 30, or 35 contiguous nt of the polynucleotide of the invention. Various methods can be used 
to administer the therapeutic composition directly to a specific site in the body. For example, a small 
metastatic lesion is located and the therapeutic composition injected several times in several different 
locations within the body of tumor. Alternatively, arteries that serve a tumor are identified, and the 
therapeutic composition injected into such an artery, in order to deliver the composition directly into 
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the tumor. A tumor that has a necrotic crater is aspirated and the composition injected directly into the 
now empty center of the tumor. The antisense composition is directly administered to the surface of 
the tumor, for example, by topical application of the composition. X-ray imaging is used to assist in 
certain of the above delivery methods. 
5 Targeted delivery of therapeutic compositions containing an antisense polynucleotide, 

subgenomic polynucleotides, or antibodies to specific tissues can also be used. Receptor-mediated 
DNA delivery techniques are described in, for example, Findeis et al., Trends Biotechnol. (1993) 
1 1:202; Chiou et al., Gene Therapeutics: Methods And Applications Of Direct Gene Transfer (J A. 
Wolff; ed.) (1994); Wu et al., J. Biol. Chem. (1988) 263:621; Wu et al., J. Biol. Chem. (1994) 
10 269:542; Zenke et aL, Proc. Natl. Acad. Sci. (USA) (1990) 87:3655; Wu et al., J. Biol. Chem. (1991) 
266:338. Therapeutic compositions containing a polynucleotide are administered in a range of about 
100 ng to about 200 mg of DNA for local administration in a gene therapy protocol. Concentration 
ranges of about 500 ng to about 50 mg, about 1 micrograms to about 2 mg, about 5 micrograms to 
about 500 micrograms, and about 20 micrograms to about 100 micrograms of DNA can also be used 
15 during a gene therapy protocol. Factors such as method of action (e.g., for enhancing or inhibiting 
levels of the encoded gene product) and efficacy of transformation and expression are considerations 
which will affect the dosage required for ultimate efficacy of the antisense subgenomic polynucleotides. 
Where greater expression is desired over a larger area of tissue, larger amounts of antisense 
subgenomic polynucleotides or the same amounts readministered in a successive protocol of 
20 administrations, or several administrations to different adjacent or close tissue portions of, for 

example, a tumor site, may be required to effect a positive therapeutic outcome. In all cases, routine 
experimentation in clinical trials will determine specific ranges for optimal therapeutic effect For 
polynucleotide related genes encoding polypeptides or proteins with anti- in fla mm atory activity, 
suitable use, doses, and administration are described in USPN 5,654,173. 
25 The therapeutic polynucleotides and polypeptides of the present invention can be delivered 

using gene delivery vehicles. The gene delivery vehicle can be of viral or non-viral origin (see 
generally, Jolly, Cancer Gene Therapy (1994) 1:51; Kimura, Human Gene Therapy (1994) 5:845; 
Connelly, Human Gene Therapy (1995) 1:185; and Kaplitt, Nature Genetics (1994) 6:148). 
Expression of such coding sequences can be induced using endogenous mammalian or heterologous 
30 promoters. Expression of the coding sequence can be either constitutive or regulated. 

Viral-based vectors for delivery of a desired polynucleotide and expression in a desired cell are 
well known in the art. Exemplary viral-based vehicles include, but are not limited to, recombinant 
retroviruses (see, e.g., WO 90/07936; WO 94/03622; WO 93/25698; WO 93/25234; USPN 5, 
219,740; WO 93/11230; WO 93/10218; USPN 4,777,127; GB Patent No. 2,200,651; EP 0 345 242; 

45 



3NSDOCID: <WO 021 4500A2_I_> 



WO 02/14500 



PCT/US01/25840 



and WO 91/02805), alphavirus-based vectors (e.g., Sindbis virus vectors, Semlflri forest virus (ATCC 
VR-67; ATCC VR-1247), Ross River virus (ATCC VR-373; ATCC VR-1246) and Venezuelan 
equine encephalitis virus (ATCC VR-923; ATCC VR-1250; ATCC VR 1249; ATCC VR-532), and 
adeno-associated virus (AAV) vectors (see, e.g., WO 94/12649, WO 93/03769; WO 93/19191; WO 

5 94/28938; WO 95/1 1984 and WO 95/00655). Administration of DNA linked to killed adenovirus, as 
described in Curiel, Hum. Gene Ther. (1992) 3: 147, can also be employed. 

Non-viral delivery vehicles and methods can also be employed, including, but not limited to, 
polycationic condensed DNA linked or unlinked to killed adenovirus alone (see, e.g., Curiel, Hum. 
Gene Ther. (1992) 3: 147); ligand-linked DNA (see, e.g., Wu, J. Biol. Chem. (1989) 264: 16985); 

10 eukaryotic cell delivery vehicles cells (see, e.g., USPN 5,814,482; WO 95/07994; WO 96/17072; 
WO 95/30763; and WO 97/42338) and nucleic charge neutralization or fusion with cell membranes. 
Naked DNA can also be employed. Exemplary naked DNA introduction methods are described in WO 
90/1 1092 and USPN 5,580,859. Liposomes that can act as gene delivery vehicles are described in 
USPN 5,422,120; WO 95/13796; WO 94/23697; WO 91/14445; and EP 0524968. Additional 

15 approaches are described in Philip, Mol. Cell Biol. (1994) 14:241 1, and in Woffendin, Proc. Natl. 
Acad. Sci. (1994) 91:1581 

Further non-viral delivery suitable for use includes mechanical delivery systems such as the 
approach described in Woffendin et al., Proc. Natl. Acad. Sci. USA (1994) 91(24):11581. Moreover, 
the coding sequence and the product of expression of such can be delivered through deposition of 

20 photopolymerized hydrogel materials or use of ionizing radiation (see, e.g., USPN 5,206,152 and WO 
92/1 1033). Other conventional methods for gene delivery that can be used for delivery of the coding 
sequence include, for example, use of hand-held gene transfer particle gun (see, e.g., USPN 
5,149,655); use of ionizing radiation for activating transferred gene (see, e.g., USPN 5,206,152 and 
WO 92/11033). 

25 The present invention will now be illustrated by reference to the following examples which set 

forth particularly advantageous embodiments. However, it should be noted that these embodiments are 
illustrative and are not to be construed as restricting the invention in any way. 

EXAMPLES 

30 The following examples are put forth so as to provide those of ordinary skill in the art with a 

complete disclosure and description of how to make and use the present invention, and are not intended 
to limit the scope of what the inventors regard as their invention nor are they intended to represent that 
the experiments below are all or the only experiments performed. It will be readily apparent to those 
skilled in the art that the formulations, dosages, methods of administration, and other parameters of 
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this invention may be further modified or substituted in various ways without departing from the spirit 
and scope of the invention. Efforts have been made to ensure accuracy with respect to numbers used 
(e.g. amounts, temperature, etc.) but some experimental errors and deviations should be accounted for. 
Unless indicated otherwise, parts are parts by weigfat, molecular weight is weight average molecular 
5 weight, temperature is in degrees Centigrade, and pressure is at or near atmospheric. 

Example 1 Source of Biological Materials and Overview of Novel Polynucleotides Expressed by the 
Biological Materials 

Candidate polynucleotides that may represent novel polynucleotides were obtained from 
10 cDNA libraries generated from selected cell lines and patient tissues. In order to obtain the candidate 
polynucleotides, mRNA was isolated from several selected cell lines and patient tissues, and used to 
construct cDNA libraries. The cells and tissues that served as sources for these cDNA libraries are 
summarized in Table 1 below. 

Human colon cancer cell line Kml2L4-A (Morikawa, et al., Cancer Research (1988) 48:6863) 
15 is derived from the KM12C cell line. The KM12C cell line (Morikawa et al. Cancer Res. (1988) 

48: 1943-1948), which is poorly metastatic (low metastatic) was established in culture from a Dukes' 
stage B2 surgical specimen (Morikawa et al. Cancer Res. (1988) 48:6863). The KM12L4-A is a 
highly metastatic subline derived from KM12C (Yeatman et al. Nucl. Acids. Res. (1995) 23:4007; 
Bao-Ling et al. Proc. Annu. Meet. Am. Assoc. Cancer. Res. (1995) 21:3269). The KM12C and 
20 KM12C-derived cell lines (e.g., KM12L4, KM12L4-A, etc.) are well-recognized in the art as a model 
cell line for the study of colon cancer (see, e.g., Moriakawa et al., supra; Radinsky et al. Clin Cancer 
Res. (1995) 1: 19; Yeatman et al., (1995) supra; Yeatman et al. Clin. Exp. Metastasis (1996) 14:246). 

The MDA-MB-23 1 cell line (Brinkley et al. Cancer Res. (1980) 40:3118-3129) was originally 
isolated from pleural effusions (Cailleau, J. Natl. Cancer. Inst. (1974) 53:661), is of high metastatic 
25 potential, and forms poorly differentiated adenocarcinoma grade II in nude mice consistent with breast 
carcinoma. The MCF7 cell line was derived from a pleural effusion of a breast adenocarcinoma and is 
non-metastatic. The MV-522 cell line is derived from a human lung carcinoma and is of high 
metastatic potential. The UCP-3 cell line is a low metastatic human lung carcinoma cell line; the MV- 
522 is a high metastatic variant of UCP-3. These cell lines are well-recognized in the art as models for 
30 the study of human breast and lung cancer (see, e.g., Chandrasekaran et al., Cancer Res. (1979) 39:870 
(MDA-MB-231 andMCF-7); Gastparet al., J Med Chem (1998) 41:4965 (MDA-MB-231 andMCF- 
7); Ranson et al., Br J Cancer (1998) 77: 1586 (MDA-MB-23 1 and MCF-7); Kuang et al., Nucleic 
Acids Res (1998) 26: 1116 (MDA-MB-23 1 and MCF-7); Varki et al., Int J Cancer (1987) 40:46 
(UCP-3); Varki et al., Tumour Biol. (1990) 1 1:327; (MV-522 and UCP-3); Varki et aL, Anticancer 
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Res. (1990) 10:637; (MV-522); Kelner et aL, Anticancer Res (1995) 15:867 (MV-522); and Zhang et 
aL, Anticancer Drugs (1997) 8:696 (MV522)). 

The samples of libraries 15-20 are derived from two different patients (UC#2, and UC#3). 
The bFGF-treated HMVEC were prepared by incubation with bFGF at lOng/ml for 2 hrs; the VEGF- 
treated HMVEC were prepared by incubation with 20ng/ml VEGF for 2 hrs. Following incubation 
with the respective growth factor, the cells were washed and lysis buffer added for RNA preparation. 
The GRRpz and WOca cell lines were provided by Dr. Donna M. Peehl, Department of Medicine, 
Stanford University School of Medicine. GRRpz was derived from normal prostate epithelium. The 
WOca cell line is a Gleason Grade 4 cell line. 



Table 1. Description of cDNA Libraries 



Library 
0ib#) 


Description 


Nfumber of 
Clones in 
Library 


0 

i 


Artificial library composed of deselected clones (clones with no 
associated variant or cluster) 


673 


1 


Human Colon Cell Line Kml2 L4: High Metastatic Potential (derived 
from Kml2C) 


308731 


2 


Human Colon Cell Line Kml2C: Low Metastatic Potential 


284771 


3 


Human Breast Cancer Cell Line MDA-MB-23 1 : High Metastatic 
Potential; micro-mets in lung 


326937 


4 


Human Breast Cancer Cell Line MCF7: Non Metastatic 


318979 


8 


Human Lung Cancer Cell Line MV-522: High Metastatic Potential 


223620 


9 


Human Lung Cancer Cell Line UCP-3: Low Metastatic Potential 


312503 


12 


Human microvascular endothelial cells (HMEC) - UNTREATED 
(PCR (OligodT) cDNA library) 


41938 


13 


Human microvascular endothelial cells (HMEC) - bFGF TREATED 
(PCR (OligodT) cDNA library) 


42100 


14 


Human microvascular endothelial cells (HMEC) - VEGF TREATED 
(PCR (OligodT) cDNA library) 


42825 


15 


Normal Colon - UC#2 Patient (MICRODISSECTED PCR (OligodT) 
cDNA library) 


282722 


16 


Colon Tumor - UC#2 Patient (MICRODISSECTED PCR (OligodT) 
cDN A library* 


298831 


17 


Liver Metastasis from Colon Tumor of UC#2 Patient 
(MICRODISSECTED PCR (OligodT) cDNA library) 


303467 


18 


Normal Colon - UC#3 Patient (MICRODISSECTED PCR (OligodT) 
cDNA library) 


36216 


19 


Colon Tumor - UC#3 Patient (MICRODISSECTED PCR (OligodT) 
cDN A library) 


41388 


20 


Liver Metastasis from Colon Tumor of UC#3 Patient 
(MICRODISSECTED PCR (OligodT) cDNA hbrary) 


30956 


21 


GRRpz Cells derived from normal prostate epithelium 


164801 


22 


WOca Cells derived from Gleason Grade 4 prostate cancer epithelium 


162088 


23 


Normal Lung Epithelium of Patient #1006 (MICRODISSECTED PCR 
(OligodT) cDNA hbrary) 


306198 
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Library 
(lib#) 


Description 


Number ol 
Clones in 
Library 


24 


Primary tumor, Large Cell Carcinoma of Patient #1006 
(MICRODISSECTED PCR (OligodT) cDNA library) 


309349 


25 


Normal Prostate Epithelium from Patient IF97-2681 1 


279444 


26 


Prostate Cancer Epithelium Gleason 3+3 Patient IF97-268 11 


269406 


27 


Normal Breast Epithelium from Patient 515 


239494 


28 


Primary Breast tumor from Patient 5 15 


259960 


29 


Lymph node metastasis from Patient 515 


326786 


30 


Normal Prostate Epithelium from Chiron Patient ID 884 


298431 


31 


Prostate Cancer Epithelium (Gleason 4+4) from Chiron Patient ID 884 


331941 



Characterization of sequences in the libraries 

After using the software program Phred (ver 0.000925 x, Green and Weing,, ©1993-2000) to 
select those polynucleotides having the best quality sequence, the polyncueltoides were compared 

5 against the public databases to identify any homolgous sequences. The sequences of the isolated 

polynucleotides were first masked to eliminate low complexity sequences using the BLASTX masking 
program (Claverie "Effective Large-Scale Sequence Similarity Searches/' In: Computer Methods for 
Macromolecular Sequence Analysis, Doolittle, ed., Meth. EnzymoL 266:212-227 Academic Press, 
NY, NY (1996); see particularly Claverie, in "Automated DNA Sequencing and Analysis Techniques" 

10 Adams et al., eds., Chap. 36, p. 267 Academic Press, San Diego, 1994 and Claverie et al. Comput 
Chem. (1993) 17:191 ). Generally, masking does not influence the final search results, except to 
eliminate sequences of relatively little interest due to their low complexity, and to eliminate multiple 
"hits" based on similarity to repetitive regions common to multiple sequences, e.g., Alu repeats. The 
remaining sequences were then used in a BLASTN vs. GenBank search; sequences that exhibited 

15 greater than 70% overlap, 99% identity, and a p value of less than 1 x 10e-40 were discarded 

Sequences from this search also were discarded if the inclusive parameters were met, but the sequence 
was ribosomal or vector-derived. 

The resulting sequences from the previous search were classified into three groups (1, 2 and 3 
below) and searched in a BLASTX vs. NRP (non-redundant proteins) database search: (1) unknown 

20 (no hits in the GenBank search), (2) weak similarity (greater than 45% identity and p value of less than 
1 x 10e-5), and (3) high similarity (greater than 60% overlap, greater than 80% identity, and p value 
less than 1 x 10e-5). Sequences having greater than 70% overlap, greater than 99% identity, and p 
value of less than 1 x 10e-40 were discarded. 

The r emaining sequences were classified as unknown (no hits), weak similarity, and high 

25 similarity (parameters as above). Two searches were performed on these sequences. First, a BLAST 
vs. EST database search was performed and sequences with greater than 99% overlap, greater than 
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99% similarity and a p value of less than 1 x 10e-40 were discarded. Sequences with a p value of less 
than 1 x 10e-65 when compared to a database sequence of human origin were also excluded Second, a 
BLASTN vs. Patent GeneSeq database was performed and sequences having greater than 99% 
identity, p value less than 1 x 10e-40, and greater than 99% overlap were discarded. 

5 The remaining sequences were subjected to screening using other rules and redundancies in the 

dataset. Sequences with a p value of less than 1 x 10e-l 1 1 in relation to a database sequence of human 
origin were specifically excluded. The final result provided the 8064 sequences listed as SEQ ID 
NOS: 1-60 10 in the accompanying Sequence Listing and summarized in Table 2 (inserted prior to 
claims). Each identified polynucleotide represents sequence from at least a partial mRNA transcript. 

10 Summary of polynucleotides of the invention 

Table 2 (inserted prior to claims) provides a summary of polynucleotides isolated as described. 
Specifically, Table 2 provides: 1) the SEQ ID NO ("SEQ ID") assigned to each sequence for use in 
the present specification; 2) theCluster Identification No. ("CLUSTER"); 3) the Sequence Name 
assigned to each sequence; 3) the sequence name ("SEQ NAME") used as an internal identifier of the 

15 sequence; 4) the orientation of the sequence ("ORIENT') (either forward (F) or reverse (R)); 5) the 
name assigned to the clone from which the sequence was isolated ("CLONE ID"); and the name of the 
library from which the sequence was isolated ("LIBRARY"), where the notation indicates that name 
of the cell line or patient sample {e.g. , UC2-NormColon indicates the sequence was isolated from 
nonnla colon tissue of the patient assigned the idnetification UC#2). Because at least some of the 

20 provided polynucleotides represent partial mRNA transcripts, two or more polynucleotides may 

represent different regions of the same mRNA transcript and the same gene and/or may be contained 
within the same clone. Thus, for example, if two or more SEQ ID NOS: are identified as belonging to 
the same clone, then either sequence can be used to obtain the full-length mRNA or gene 

25 Example 2:Results of Public Database Search to Identify Function of Gene Products 

SEQ ID NOS:1-6010 were translated in all three reading frames, and the nucleotide sequences 
and translated amino acid sequences used as query sequences to search for homologous sequences in 
either the GenBank (nucleotide sequences) or Non-Redundant Protein (amino acid sequences) 
databases. Query and individual sequences were aligned using the BLAST 2.0 programs, available 

30 over the world wide at a site sponsored by the National Center for Biotechnology Information, which is 
supported by the National Library of Medicine and the National Institutes of Health (see also Altschul, 
et al. Nucleic Acids Res. (1997) 25:3389-3402). The sequences were masked to various extents to 
prevent searching of repetitive sequences or poly-A sequences, using the BLASTX program for 
masking low complexity as described above in Example 1. 
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Tables 3A and 3B (inserted prior to claims) provides the alignment summaries having a p 
value of 1 x 10e-2 or less indicating substantial homology between the sequences of the present 
invention and those of the indicated public databases. Specifically, Table 3 A provides the SEQ ID NO 
of the query sequence, the accession number of the GenBank database entry of the homologous 
5 sequence, and the individual p value of each alignment Table 3 A provides the SEQ ID NO of the 

query sequence, the accession number of the Non-Redundant Protein database entry of the homologous 
sequence, and the individual p value of each alignment. The alignments provided in Tables 3 A and 3B 
are the best available alignment to a DNA or amino acid sequence at a time just prior to filing of the 
present specification. The activity of the polypeptide encoded by the SEQ ID NOS listed in these 

10 tables can be extrapolated to be substantially the same or substantially similar to the activity of the 

reported nearest neighbor or closely related sequence. The accession number of the nearest neighbor is 
reported, providing a publicly available reference to the activities and functions exhibited by the 
nearest neighbor. The public information regarding the activities and functions of each of the nearest 
neighbor sequences is incorporated by reference in this application. Also incorporated by reference is 

15 all publicly available information regarding the sequence, as well as the putative and actual activities 
and functions of the nearest neighbor sequences listed in Tables 3 A and 3B and their related sequences. 
The search program and database used for the alignment, as well as the calculation of the p value are 
also indicated 

Full length sequences or fragments of the polynucleotide sequences of the nearest neighbors 
20 can be used as probes and primers to identify and isolate the full length sequence of the corresponding 
polynucleotide. The nearest neighbors can indicate a tissue or cell type to be used to construct a library 
for the full-length sequences of the corresponding polynucleotides. 

Example 3 Members of Protein Families 

25 SEQ ID NOS : 1-60 10 were used to conduct a profile search as described in the specification 

above. Several of the polynucleotides of the invention were found to encode polypeptides having 
characteristics of a polypeptide belonging to a known protein family (and thus represent members of 
these protein families) and/or comprising a known functional domain. Table 4 (inserted before claims) 
provides the SEQ ID NO: of the query sequence, the Sequence Name, the Cluster to which the 

30 sequence is assigned, a brief description of the profile hit, the orientation (Direction, "Dir") of the 
query sequence with respect to the individual sequence )where forward (for) indicates that the 
alignment is in the same direction (left to right) as the sequence provided in the Sequence Listing and 
reverse (rev) indicates that the alignment is with a sequence complementary to the sequence provided in 
the Sequence Listing), and the score of the profile hit 
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Some polynucleotides exhibited multiple profile hits where the queiy sequence contains 
overlapping profile regions, and/or where the sequence contains two different functional domains. 
Each of the profile hits of Table 4 is described in more detail below. The acronyms for the profiles 
(provided in parentheses) are those used to identify the profile in the Pfam, Prosite, and InterPro 
5 databases. The Pfam database can be accessed through web sites supported by Genome Sequencing 
Center at the Washington University School of Medicine or by the European Molecular Biology 
Laboratories in Heidelberg, Germany. The Prosite database can be accessed at the ExPASy Molecular 
Biology Server on the internet. The InterPro database can be accessed at a web site supported by the 
EMBL European Bioinformatics Institute. The public information available on the Pfam, Prosite, and 
10 InterPro databases regarding the various profiles, including but not limited to the activities, function, 
and consensus sequences of various proteins families and protein domains, is incorporated herein by 
reference. 

Seven Transmembrane Inte|gral Membrane Proteins — Rhodopsin Family (7tm 1 : Pfam 
Accession No. PFOOOOl). SEQ ID NOS:2973, 5467, and 5508 correspond to sequences encoding a 

15 polypeptides that are members of the seven transmembrane (7tm) receptor rhodopsin family. G- 

protein coupled receptors of the (7tm) rhodopsin family (also called R7G) are an extensive group of 
hormones, neurotransmitters, and light receptors which transduce extracellular signals by interaction 
with guanine nucleotide-binding (G) proteins (Strosberg, Eur. J. Biochem. (1991) 196:1; Kerlavage, 
Curr. Opin. Struct. Biol. (1991) 1:394; Probst et al., DNA Cell Biol. (1992) 11:1; Savarese et al., 

20 Biochem. J. (1992) 283: 1. The consensus pattern that contains the conserved triplet and that also 
spans the major part of the third transmembrane helix is used to detect this widespread family of 
proteins: [GSTALIVMFWCHGSTANCT^ [LIVMFT]- 
[GSTANC]-[LIVMFW [LIVM]. 

Ank Repeats f ANK: Pfam Accession No. PF0023). SEQ ID NOS:445, 487 and 3013 

25 represent polynucleotides encoding Ank repeat-containing proteins. The ankyrin motif is a 33 amino 
acid sequence named after the protein ankyrin which has 24 tandem 33-amino-acid motifs. Ank 
repeats were originally identified in the cell-cycle-control protein cdclO (Breeden et al., Nature (1987) 
329:65 1). Proteins containing ankyrin repeats include ankyrin, myotropin, I-kappaB proteins, cell 
cycle protein cdclO, the Notch receptor (Matsuno et al., Development (1997) 124(21):4265); G9a (or 

30 BAT8) of the class in region of the major histocompatibility complex (Biochem J. (1993) 290:8 1 1- 
818); FABP, GABP, 53BP2, Linl2, glp-1, SW14, and SW16. The functions of the ankyrin repeats 
are compatible with a role in protein-protein interactions (Bork, Proteins (1993) 17(4):363; Lambert 
and Bennet, Eur. J. Biochem. (1993) 211:1; Kerr et al., Current Op. Cell Biol. (1992) 4:496; Bennet et 
al. 9 J. Biol Chem. (1980) 255:6424). 

52 



INSDOCID: <WO 0214500A2_L> 



WO 02/14500 



PCT/US01/25840 



Basic Region Plus Leucine Zipper Transcription Factors (BZIP: Pfam Accession 
No. PF00170Y SEQIDNOS:108, 1714, 3931, and4356 represent polynucleotides encoding novel 
members of the family of basic region plus leucine zipper transcription factors. The bZIP superfamily 
(Hurst, Protein Prof. (1995) 2:105; and EUenberger, Curr. Opin. Struct. Biol (1994) 4:12) of 
5 eukaryotic DNA-binding transcription factors encompasses proteins that contain a basic region 

mediating sequence-specific DNA-binding followed by a leucine zipper required for dimerization. The 
consensus pattern for this protein family is: [KR]-x(l,3)-[RKSAQ]-N-x(2)-[SAQ](2)-x- 
[RKTAENQ]-x-R-x-[RK]. 

DEAD and DEAH box families ATP-dependent helicases (Dead box helic: Pf am Accession 

10 No. PF00270. SEQ ID NOS:38, 415, and 5756 correspond to sequences encoding a polypeptides that 
are members of the DEAD box family. A number of eukaryotic and prokaryotic proteins have been 
characterized (Schmid et al, Mol. Microbiol. (1992) <5:283; Linder et al, Nature (1989) 337:121; 
Wassarman et al, Nature (1991) 349:463) on the basis of their structural similarity. All are involved 
in ATP-dependent, nucleic-acid unwinding. All DEAD box family members of the above proteins 

15 share a number of conserved sequence motifs, some of which are specific to the DEAD family while 
others are shared by other ATP-binding proteins or by proteins belonging to the helicases 
'superfamily 5 (Hodgman, Nature (1988) 333:22 and Nature (1988) 333:578 (Errata)). One of these 
motifs, called the 'D-E-A-D-box', represents a special version of the B motif of ATP-binding proteins. 
Some other proteins belong to a subfamily which have His instead of the second Asp and are thus said 

20 to be T>-E-A-H-box ! proteins (Wassarman et al, Nature (1991) 349:463; Harosh et al., Nucleic Acids 
Res. (1991) 19:633 1; Koonin et al., J. Gen. Virol. (1992) 73:989). The following signature patterns 
are used to identify member for both subfamilies: 1) I]LIVMF](2)-D-E-A-D-piKEN]-x- 
[LIVMFYGSTN]; and 2) [GSAH]-x-[LIVMF](3)-D-E-[ALrV]-H-[NECR]. 

Defensins (defensins: Pfam Accession No. PF00323. SEQ IDNO:486 corresponds to a 

25 sequence encoding a polypeptide that is a member of the mammalian defensin family. Defensins are a 
family of structurally related cysteine-rich peptides which are active against many Gram-positive 
bacteria, fungi, and enveloped viruses (Lehrer et al, ASMNews (1990) 56:3 15-318; Lehrer et al, Cell 
(1991) 64:229-230; Kaganef al, Toxicology (1994) 87:131-149; Lehrer et al f Annu. Rev. Immunol 
(1993) 77:105-128; White et al, Curr. Opin. Struc/Biol (1995) 5:521-527). Some defensins inhibit 

30 corticotropoid-stimulated corticosteroid production and also play a significant role in innate immunity 
to infection and neoplasia. The peptides known to belong to this family range in length from 29 to 35 
amino acids and have seven invariant residues, including six cysteines that are all involved in intrachain 
disulfide bonds. The following consensus pattern is used to identify members of this protein family: C- 
x-C-x(3,5)-C-x(7)-G-x-C-x(9)-C-C 
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EF Hand (Efhand: Pfam Accession No. PF00036V SEQ ID NO:4373 corresponds to a 
polynucleotide encoding a member of the EF-hand protein family, a calcium binding domain shared by 
many calcium-binding proteins belonging to the same evolutionary family (Kawasaki et al 9 Protein. 
Prof (1995) 2:305-490). The domain is a twelve residue loop flanked on both sides by a twelve 

5 residue alpha-helical domain, with a calcium ion coordinated in a pentagonal bipyramidal 

configuration. The six residues involved in the binding are in positions 1, 3, 5, 7, 9 and 12; these 
residues are denoted by X, Y, Z 9 -Y, -X and -Z. The invariant Glu or Asp at position 12 provides two 
oxygens for liganding Ca (bidentate ligand). The consensus pattern includes the complete EF-hand 
loop as well as the first residue which follows the loop and which seem to always be hydrophobic: D- 

10 x-pNS]-{ILWW}-[DENSTG]-^ 
[LIVMFYW]. 

Epidermal Growth Factor (EGF: Pfam Accession No. PF00008V SEQ ID NO:3689 represents 
a polynucleotide encoding a member of the EGF family of proteins. The distinguishing characteristic 
of this family is the presence of a sequence of about thirty to forty amino acid residues found in 

15 epidermal growth factor (EGF) which has been shown to be present, in a more or less conserved form, 
in a large number of other proteins (Davis, New Biol (1990) 2:410-419; Blomquist et al, Proc. Natl 
Acad. Set U.S.A. (1984) 57:7363-7367; Barkert etal t ProteinNucl AcidEnz. (1986) 2P:54-86; 
Doolittle et al, Nature. (1984) 307:558-560; Appella et al, FEBSLett. (1988) 231 : 1-4; Campbell 
and Bork, Curr. Opin. Struct. Biol. (1993) 3:385-392). A common feature of the domain is that the 

20 conserved pattern is generally found in the extracellular domain of membrane-bound proteins or in 

proteins known to be secreted. The EGF domain includes six cysteine residues which have been shown 
to be involved in disulfide bonds. The main structure is a two-stranded beta-sheet followed by a loop 
to a C-terminal short two-stranded sheet Subdomains between the conserved cysteines strongly vary 
in length. The following consensus patterns are used to identify members of this family: C-x-C-x(5)- 

25 G-x(2)-C and C-x-C-x(s)-[GP]-[FYW]-x(4,8)-C. 

Ets Domain (Ets Cterm; Pfam Accession No. PF00178V SEQ ED NO:6 represents a 
polynucleotide encoding a polypeptide with C-terminal homology in the ETS domain. Proteins of this 
family contain a conserved domain, the "ETS-domain," that is involved in DNA binding. The domain 
appears to recognize purine-rich sequences; it is about 85 to 90 amino acids in length, and is rich in 

30 aromatic and positively charged residues (Wasylyk et al, Eur. J. Biochem. (1993) 211 :718). The ets 
gene family encodes a novel class of DNA-binding proteins, each of which binds a specific DNA 
sequence and comprises an ets domain that specifically interacts with sequences containing the 
common core tri-nucleotide sequence GGA. In addition to an ets domain, native ets proteins comprise 
other sequences which can modulate the biological specificity of the protein. Ets genes and proteins 
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are involved in a variety of essential biological processes including cell growth, differentiation and 
development, and three members are implicated in oncogenic process. 

Pleckstrin Homology CPU: Pfam Accession No. PF00169). SEQ IDNOS:228 and 6001 
correspond to polynucleotides encoding members of the PH family. The pleckstrin homology domain 

5 is a domain of about 1 00 residues that occurs in a wide range of proteins involved in intracellular 
signaling or as constituents of the cytoskeleton (Mayer etaL, Cell (1993) 73:629-630; Haslam et al., 
Nature (1993) 363:309-310; Musacchio et al, Trends Biochem. Sci. (1993) i8:343-348; Gibson et 
al, Trends Biochem, Sci. (1994) 7P:349-353; Pawson, Nature (1995) 373:573-580; Ingley and 
Hemmings, J. Cell. Biochem. (1994) 56:436-443; Saraste andHyvonen, Curr. Opin. Struct. Biol. 

10 (1995) 5:403-408). All known structures of PH domains have two perpendicular anti-parallel beta 
sheets, followed by a C-terminal amphipathic helix. The loops connecting Ifche beta-strands differ 
greatly in length, making the PH domain relatively difficult to detect (Riddihough, Nat Struct. Biol 
( 1 994) J :755-757). There are no totally invariant residues within this domain. 

Protein Kinase (protltinase: Pfam Accession No. PF00069V SEQ ID NOS:9, 39, 69, 1 18, 229 

1 5 and 4151 represent polynucleotides encoding protein kinases, which catalyze phosphorylation of 

proteins in a variety of pathways, and are implicated in cancer. Eukaryotic protein kinases (Hanks et 
aL 9 FASEBJ. (1995) 9:576; Hunter,Metf*. Enzymol. (1991) 200:3; Hanks et al. 9 Meth. Enzymol. 
(1991) 200:38; Hanks, Curr. Opin. Struct. Biol. (1991) 7:369; Hanks etaL, Science (1988) 241:42) 
belong to a very extensive family of proteins that share a conserved catalytic core common to both 

20 serine/threonine and tyrosine protein kinases. There are a number of conserved regions in the catalytic 
domain of protein kinases. The first region, located in the N-terminal extremity of the catalytic 
domain, is a glycine-rich stretch of residues in the vicinity of a lysine residue, which has been shown to 
be involved in ATP binding. The second region, located in the central part of the catalytic domain, 
contains a conserved aspartic acid residue that is important for the catalytic activity of the enzyme 

25 (Knighton et al 9 Science (1991) 253:407). 

The protein kinase profile includes two signature patterns for this second region: one specific 
for serine/threonine kinases and the other for tyrosine kinases. A third profile is based on the 
alignment in Hanks et ah (FASEBJ. (1995) P:576) and covers the entire catalytic domain. The 
consensus patterns are as follows: 1) [LIV]-G-{P}-G-{P}-[FYW 

30 [LWCATHPD}-x-[GSTACLIVM^ 

where K binds ATP; 2) [LIVMFYC]-x-[HY]-x^ where D is 

an active site residue; and 3) [LIVMFYC]-x-[HY^ 
where D is an active site residue. 
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Retroviral aspartvl protease (RVP: Pfam Accession No. PF00077V SEQIDNO:2038 
corresponds to a polynucleotide encoding a member of the Aspartyl Protease family. Aspartyl 
proteases, also know as acid proteases, are a widely distributed family of proteolytic enzymes known to 
exist in vertebrates, fungi, plants, retroviruses and some plant viruses (Foltmann, Essays Biochem. 

5 (1981) 1 7:52-84; Davies,^«ra. Rev. Biophys. Chem. (1990) 19: 189-215; Rao etal, Biochemistry 
(1991) 30:4663-4671). Most retroviral aspartyl proteases (RVP) are homodimers of a chain of about 
95 to 125 amino acids. In most retroviruses, the protease is encoded as a segment of a polyprotein 
which is cleaved during the maturation process of the virus. RVP is generally part of the pol 
polyprotein and, more rarely, of the gag polyprotein The consensus pattern is as follows: 

10 [LIVMFGAC]-[LIVMT^ 

[LIVMFGTA] [D is the active site residue]. 

Reverse Transcriptase (rvt: Pfam Accession No. PF00078Y SEQ IDNOS:1511 and 2514 
represent polynucleotides encoding reverse transcriptases, which occur in a variety of mobile elements, 
including retrotransposons, retroviruses, group II introns, bacterial msDNAs, hepadnaviruses, and 

15 caulimoviruses (Xiong and Eickbush, EMBO J (1990) 9:3353-3362). Reverse transcriptases catalyze 
RNA-template-directed extension of the 3 5 -end of a DNA strand by one deoxynucleotide at a time and 
require an RNA or DNA primer. 

Transforming growt h factor beta like domain (TGF beta: Pfam Acces sion No. PF00019V 
SEQ ID NO:5522 represents a polynucleotide encoding a polypeptide of the TGF-beta family. 

20 Proteins from the transforming growth factor-beta family are active as homo- or hetero-dimers with the 
two chains being linked by a single disulfide bond (Roberts and Sporn, In Peptide growth factors and 
their receptors, Handbook of Experimental Pharmacology, Vol. 95, pp4 19-475, Springer Verlag, 
Heidelberg, (1990); Burt, Biochem. Biophys. Res. Commun. (1992) 7S4:590-595; Burt and Law, 
Prog. Growth Factor Res. (1994) 5:99-1 18). It is known from X-ray studies that all the other 

25 cysteines of the sequence are involved in interchain disulfide bonds (Daopin et al, Science (1992) 

257:369-373). Members of this family can be recognized by the following consensus pattern: [LIVM]- 
x(2)-P-x(2)-[FY]-x(4)-C-x-G-x-C (The two C's are involved in disulfide bonds). 

WD Domain (WTMO) G-Beta Repeats (WD domain: Pfam Accession No. PF00400) . SEQ 
ID NO:l 17 represents a member of the WD domain/G-beta repeat family. Beta-transducin (G-beta) is 

30 one of the three subunits (alpha, beta, and gamma) of the guanine nucleotide-binding proteins (G 
proteins) which act as intermediaries in the transduction of signals generated by transmembrane 
receptors (Gilman, Annu. Rev. Biochem. (1987) 55:615). The alpha subunit binds to and hydrolyzes 
GTP; the beta and gamma subunits are required for the replacement of GDP by GTP as well as for 
membrane anchoring and receptor recognition. In higher eukaryotes, G-beta exists as a small 
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multigene family of highly conserved proteins of about 340 amino acid residues. Structurally, G-beta 
has eight tandem repeats of about 40 residues, each containing a central Trp-Asp motif (this type of 
repeat is sometimes called a WD-40 repeat). The consensus pattern for the WD domain/G-Beta repeat 
family is: [LIVMSTAC]-[LIVM^ 
5 [LIVMWSTAC]-x-[LIVM 

Zinc Finger, C2H2 Type (Zincfing C2H2: Pfam Accession No. PF000961 SEQIDNOS:61, 

502, 700, 847, 2034, 2054, 3403, 3524, 3653, 3723, 4688, and 4979 correspond to polynucleotides 

encoding members of the C2H2 type zinc finger protein family, which contain zinc finger domains that 

facilitate nucleic acid binding (Klug et at, Trends Biochem. Set (1987) 72:464; Evans et at, Cell 

10 (1988) 52:1; Pzyre et al,FEBSLett. (1988) 234:245; Miller etal.,EMBO J. (1985) 4:1609; and 

Berg, Proc. Natl Acad. Set USA (1988) 55:99). In addition to the conserved zinc ligand residues, a 

number of other positions are also important for the structural integrity of the C2H2 zinc fingers 

(Rosenfeld et al, J. Biomol Struct Dyn. (1993) 1 1 :557). The best conserved position, which is 

generally an aromatic or aliphatic residue, is located four residues after the second cysteine. The 

15 consensus pattern for C2H2 zinc fingers is: C-x(2,4)-C-x(3)-[LIV]^^ The 

* 

two Cs and two H ! s are zinc ligands. 

Zinc Finger. C2HC Type (Zincfing C2HC: Pfam Accession No. PF01530Y SEQID 

NO:3814 represents a member of the C2HC zinc finger family. The 18 residue C2HC domain is 

mainly found in the retroviral nucleocapsid protein, and is required for viral genome packaging and for 
20 the early infection process (Katz and Jentoft, Bioessays (1989) 1 1 : 176-18 1 ; Urbaneja et at, JMol 

Biol (1999) 2#7:59-75). In addition, the CCHC domain is found in eukaiyotic proteins involved in 

RNA binding or single-strand DNA binding. 

Zinc finger, C3HC4 type (RING finger), signature (Zincfing C3HC4; Pfam Accession 

No. PF00097Y SEQ ID NO:3 140 represents a polynucleotide encoding a polypeptide having a 
25 C3HC4 type zinc finger signature. A number of eukaryotic and viral proteins contain this signature, 

which is primarily a conserved cysteine-rich domain of 40 to 60 residues (Borden et al f Cum Opin. 

Struct Biol. (1996) 6:395) that binds two atoms of zinc, and is probably involved in mediating 

protein-protein interactions. The 3D structure of the zinc ligation system is unique to the RING 

domain and is referred to as the "cross-brace" motif The spacing of the cysteines in such a domain is 
30 C-x(2)-C-x(9 to 39)-C-x(l to 3)-H-x(2 to 3)-C-x(2)-C-x(4 to 48)-C-x(2)-C. The signature pattern for 

the C3HC4 finger is based on the domain's central region: C-x-H-x-|LIVMFY]^ 

Zinc finger. CCCH type. (Zincfing CCCH: Pfam Accession No. PF00642) . SEQIDNO:877 

corresponds to a polyncleotide encoding a member of the CCCH zinc finger protein family. This 

domain is present in many eukaryotic proteins, including zinc finger proteins involved in cell cycle or 
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growth phase-related regulation and regulatory proteins involved in regulating the response to growth 
factors. It has been shown that proteins containing the CCCH zinc finger interact with the 3 9 
untranslated region of various mRNA (Carballo etal, Science (1998) 257:1001-1005; Lai et al, Mol 
Cell Biol (1999) 19:43 1 1-4323) and that this domain is often present in two copies. The consensus 
5 pattern for the CCCH zinc fingers is: C-x8-C-x5-C-x3-H. 

Exam ple 4: Description of Libraries and Detection of Differential Expression 

The relative expression levels of the polynucleotides of Hie invention were assessed in several 
libraries prepared from various sources, including cell lines and patient tissue samples. Table 1 above 

10 provides a summary of these libraries, including the shortened library name, the mRNA source used to 
prepared the cDNA library, the "nickname" of the library that is used in the tables below (in quotes), 
and the approximate number of clones in the library. 

Each of the libraries is composed of a collection of cDNA clones that in turn are representative 
of the mRNAs expressed in the indicated mRNA source. In order to facilitate the analysis of the 

15 millions of sequences in each library, the sequences were assigned to clusters. The concept of "cluster 
of clones" is derived from a sorting/grouping of cDNA clones based on their hybridization pattern to a 
panel of roughly 300 7bp oligonucleotide probes (see Drmanac et al, Genomics (1996) 37(1):29). 
Random cDNA clones from a tissue library are hybridized at moderate stringency to 300 7bp 
oligonucleotides. Each oligonucleotide has some measure of specific hybridization to that specific 

20 clone. The combination of 300 of these measures of hybridization for 300 probes equals the 
"hybridization signature" for a specific clone. Clones with similar sequence will have similar 
hybridization signatures. By developing a sorting/grouping algorithm to analyze these signatures, 
groups of clones in a library can be identified and brought together computationally. These groups of 
clones are termed "clusters". Depending on the stringency of the selection in the algorithm (similar to 

25 the stringency of hybridization in a classic library cDNA screening protocol), the "purity" of each 
cluster can be controlled. For example, artifacts of clustering may occur in computational clustering 
just as artifacts can occur in "wet-lab" screening of a cDNA library with 400 bp cDNA fragments, at 
even the highest stringency. The stringency used in the implementation of cluster herein provides 
groups of clones that are in general from the same cDNA or closely related cDNAs. Closely related 

30 clones can be a result of different length clones of the same cDNA, closely related clones from highly 
related gene families, or splice variants of the same cDNA. 

Differential expression for a selected cluster was assessed by first detennining the number of 
cDNA clones corresponding to the selected cluster in the first library (Clones in 1 st ), and the 
determining the number of cDNA clones corresponding to the selected cluster in the second library 
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(Clones in 2 ). Differential expression of the selected cluster in the first library relative to the second 
library is expressed as a "ratio" of percent expression between the two libraries, hi general, the "ratio" 
is calculated by: 1) calculating the percent expression of the selected cluster in the first library by 
dividing the number of clones corresponding to a selected cluster in the first library by the total number 
of clones analyzed from the first library; 2) calculating the percent expression of the selected cluster in 
the second library by dividing the number of clones corresponding to a selected cluster in a second 
library by the total number of clones analyzed from the second library; 3) dividing the calculated 
percent expression from the first library by the calculated percent expression from the second library. 
If the "number of clones" corresponding to a selected cluster in a library is zero, the value is set at 1 to 
aid in calculation. The formula used in calculating the ratio takes into account the "depth" of each of 
the libraries being compared, i.e., the total number of clones analyzed in each library. 

In general, a polynucleotide is significantly differentially expressed between two samples when 
the ratio value is greater than at least about 2, preferably greater than at least about 3, more preferably 
greater than at least about 5 , where the ratio value is calculated using the method described above. 
The significance of differential expression is determined using a z score test (Zar, Biostatistical 
Analysis . Prentice Hall, Inc., USA, "Differences Between Proportions," pp 296-298 (1974). 

Using this approach, a numbed of polynucleotide sequences were identified as being 
differentially expressed between, for example, cells derived from high metastatic potential cancer tissue 
and low metastatic cancer cells, and between cells derived from metastatic cancer tissue and normal 
tissue. Evaluation of the levels of expression of the genes corresponding to these sequences can be 
valuable in diagnosis, prognosis, and/or treatment (e.g., to facilitate rationale design of therapy, 
monitoring during and after therapy, etc.). Moreover, the genes corresponding to differentially 
expressed sequences described herein can be therapeutic targets due to their involvement in regulation 
(e.g. , inhibition or promotion) of development of, for example, the metastatic phenotype. For example, 
sequences that correspond to genes that are increased in expression in high metastatic potential cells 
relative to normal or non-metastatic tumor cells may encode genes or regulatory sequences involved in 
processes such as angiogenesis, differentiation, cell replication, and metastasis. 

Detection of the relative expression levels of differentially expressed polynucleotides described 
herein can provide valuable information to guide the clinician in the choice of therapy. For example, a 
patient sample exhibiting an expression level of one or more of these polynucleotides that corresponds 
to a gene that is increased in expression in metastatic or high metastatic potential cells may warrant 
more aggressive treatment for the patient. In contrast, detection of expression levels of a 
polynucleotide sequence that corresponds to expression levels associated with that of low metastatic 
potential cells may warrant a more positive prognosis than the gross pathology would suggest. 
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A number of polynucleotide sequences of the present invention are differentially expressed 
between human microvascular endothelial cells (HMVEC) that have been treated with growth factors 
relative to untreated HMVEC, Sequences that are differentially expressed between growth factor- 
treated HMVEC and untreated HMVEC can represent sequences encoding gene products involved in 

5 angiogenesis, metastasis (cell migration), and other development and oncogenic processes. For 

example, sequences that are more highly expressed in HMVEC treated with growth factors (such as 
bFGF or VEGF) relative to untreated HMVEC can serve as drug targets for chemotherapeutics, e.g., 
decreasing expression of such up-regulated genes or inhibiting the activity of the encoded gene product 
would serve to inhibit tumor cell angiogenesis. Detection of expression of these sequences in colon 

10 cancer tissue can be valuable in determining diagnostic, prognostic and/or treatment information 

associated with the prevention of achieving the malignant state in these tissues, and can be important in 
risk assessment for a patient. A patient sample displaying an increased level of one or more of these 
polynucleotides may thus warrant closer attention or more frequent screening procedures to catch the 
malignant state as early as possible. 

15 The differential expression of the polynucleotides can thus be used as, for example, diagnostic 

and/or prognostic markers, for risk assessment, patient treatment and the like. These polynucleotides 
can also be used in combination with other molecular and/or biochemical markers. 

The differential expression data for polynucleotides of the invention that have been identified 
as being differentially expressed across various combinations of the libraries described above is 

20 summarized in Table 5 (inserted prior to the claims). Table 5 provides: 1) the Sequence Identification 
Number ("SEQ ID NO") assigned to the polynucleotide; 2) the cluster ("CLUSTER") to which the 
polynucleotide has been assigned as described above; 3) the library comparisons that resulted in 
identifcation of the polynucleotide as being differentially expressed ("PAIR AB") 9 where the cDNA 
libraries used are referenced by their library numbers; 4) the number of clones corresponding to the 

25 polynucleotide in the first library listed ("CLONES A"); 5) the number of clones corresponding to the 
polynucleotide in the second library listed ("CLONES B"); 6) the "RATIO PLUS" where the 
comparison resulted in a finding that the number of clones in library A is greater than the number of 
clones in library B; and 7) the "RATIO MINUS" where the comparison resulted in a finding that the 
number of clones in library B is greater than the number of clones in library A. 

30 Detection of expression of genes that correspond to the above polynucleotides may be of 

particular interest in diagnosis, prognosis, risk assessment, and monitoring of treatment. Furthermore, 
differential expression of a specific gene across multiple libraries can also be indicative of a gene 
whose expression is associated with, for example, suppression of the metastatic phenotype or with 
development of the cell toward a metastatic phenotype. For example, SEQ ID NO:3744 corresponds 
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to a gene that is expressed at relatively higher levels in metastatized colon tumor than in normal colon 
tissue. Thus a relatively increased level of expression of the gene corresponding to SEQ ID NO:3744 
may be used as marker of a metastatic or pre-metastatic colon eels either alone or in combination with 
other markers. 

5 Some polynucleotides exhibited similar differential expression trends in libraries of different 

tissue origin (see, e.g. SEQ ID NO:337). These data suggest that the differential expression patterns 
of some genes associated with development of tumors indicate a role for those genes that is non- 
specific to the tissue of origin. 

10 Example 5 : Detection of Differential Expression Using Arrays 

mRNA isolated from samples of cancerous and normal colon tissue obtained from patients 
were analyzed to identify genes differentially expressed in cancerous and normal cells. Normal and 
cancerous cells collected from ciyopreserved patient tissues were isolated using laser capture 
microdissection (LCM) techniques, which techniques are well known in the art (see, e.g., Ohyama et 

15 al. (2000) Biotechniques 29:530-6; Curran et al (2000) Mot Pathol. 53:64-8; Suarez-Quian et al. 
(1999) Biotechniques 26:328-35; Simoneef (1998) Trends Genet 14:272-6; Coniaef a/. (1997) J. 
Clin. Lab. Anal. 1 1:28-38; Emmert-Buck et al. (1996) Science 274:998-1001). 

Table 6 (inserted before the claims) provides information about each patient from which colon 
tissue samples were isolated, including: the Patient ID ("PT ff> H )and Path ReportED (}Path ID"), which 

20 are numbers assigned to the patient and the pathology reports for identification purposes; the group 
( ,f Grp ,f )to which the patients have been assigned; the anatomical location of the tumor ("Anatom 
Loc"); the primary tumor size ("Size"); the primary tumor grade ("Grade"); the identification of the 
histopathological grade ("Histo Grade"); a description of local sites to which the tumor had invaded 
("Local Invasion"); the presence of lymph node metastases ("LN Met"); the incidence of lymph node 

25 metastases (provided as a number of lymph nodes positive for metastasis over the number of lymph 
nodes examined) ("Incidence Lymphnode Met"); the "Regional Lymphnode Grade"; the identification 
or detection of metastases to sites distant to the tumor and their location ("Dist Met & Loc"); the grade 
of distant metastasis ("Dist Met Grade"); and general comments about the patient or the tumor 
("Comments"). Histophatology of all primary tumors incidated the tumor was adenocarcinmoa except 

30 for Patient ID Nos. 130 (for which no information was provided), 392 ( in which greater than 50% of 
the cells were mucinous carcinoma), and 784 (adenosquamous carcinoma). Extranodal extensions 
were described in three patients, Patient ID Nos. 784, 789, and 791. Lymphovascular invasion was 
described in Patient ID Nos. 128, 278, 517, 534, 784, 786, 789, 791, 890, and 892. Crohn's-like 



61 



3NSDOCID: <WO 021 450OA2_l_> 



WO 02/14500 



PCT/US01/25840 



infiltrates were described in seven patients, Patient ID Nos. 52, 264, 268, 392, 393, 784, and 791. 
Table 7 (below) provides information about the patients from whom the prostate tissue was isolated. 
Table 7. Prostate paitent data. 



Prostate Patient ID 


Tumor Gleason Grade 


Normal Prostate Description 


96 


3+3 Adenocarcinoma 


Normal prostate; Benign hyperplasia 


282 


44-3 Adenocarcinoma 


Normal prostate; Benign hyperplasia 


286 


3+3 Adenocarcinoma 


iNormal prostate; Benign hyperplasia 


294 


3+4 Adenocarcinoma 


formal prostate; Benign hyperplasia 


362 


3+3 Adenocarcinoma 


Normal prostate; Benign hyperplasia 


428 


4+3 Adenocarcinoma 


Normal prostate; Benign hyperplasia 


492 


3+3 Adenocarcinoma 


Normal prostate; Benign hyperplasia 


492 


3+3 Adenocarcinoma 


Normal prostate; Benign hyperplasia 


493 


3+4 Adenocarcinoma 


Normal prostate; Benign hyperplasia 


510 


3+3 Adenocarcinoma 


Normal Prostate; Benign hyperplasia 



5 Identification of differentially expressed genes 

cDNA probes were prepared from total RNA isolated from the patient cells described above. 

Since LCM provides for the isolation of specific cell types to provide a substantially homogenous cell 

sample, this provided for a similarly pure RNA sample. 

Total RNA was first reverse transcribed into cDN A using a primer containing a T7 RNA 
10 polymerase promoter, followed by second strand DNA synthesis. cDNA was then transcribed in vitro 

to produce antisense RNA using the T7 promoter-mediated expression (see, e.g., Luo et al (1999) 

Nature Med 5 : 1 1 7- 1 22), and the antisense RNA was then converted into cDNA. The second set of 

cDNAs were again transcribed in vitro, using the T7 promoter, to provide antisense RNA. Optionally, 

the RNA was again converted into cDNA, allowing for up to a third round of T7~mediated 
1 5 amplification to produce more antisense RNA. Thus the procedure provided for two or three rounds of 

in vitro transcription to produce the final RNA used for fluorescent labeling. 

Fluorescent probes were generated by first adding control RNA to the antisense RNA mix, and 

producing fluorescently labeled cDNA from the RNA starting material. Fluorescently labeled cDNAs 

prepared from the tumor RNA sample were compared to fluorescently labeled cDNAs prepared from 
20 normal cell RNA sample. For example, the cDNA probes from the normal cells were labeled with Cy3 

fluorescent dye (green) and the cDNA probes prepared from the tumor cells were labeled with Cy5 

fluorescent dye (red), and vice versa. 

Each array used had an identical spatial layout and control spot set. Each microarray was 

divided into two areas, each area having an array with, on each half, twelve groupings of 32 x 12 spots, 
25 for a total of about 9,2 1 6 spots on each array. The two areas are spotted identically which provide for 

at least two duplicates of each clone per array. 
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Polynucleotides for use on the arrays were obtained from both publicly available sources and 
from cDNA libraries generated from selected cell lines and patient tissues. PGR products of from 
about 0.5kb to 2.0 kb amplified from these sources were spotted onto the array using a Molecular 
Dynamics Gen III spotter according to the manufacturer's recommendations. For polynucleotides 

5 described herein, the microarray spot contained a clone having a cDNA from which the sequence was 
derived The first row of each of the 24 regions on the array had about 32 control spots, including 4 
negative control spots and 8 test polynucleotides. The test polynucleotides were spiked into each 
sample before the labeling reaction with a range of concentrations from 2-600 pg/slide and ratios of 
1:1. For each array design, two slides were hybridized with the test samples reverse-labeled in the 

10 labeling reaction. This provided for about four duplicate measurements for each clone, two of one color 
and two of the other, for each sample. 

Table 8 (inserted before the claims) describes sequences present on the arrays. Table 8 
includes: 1) athe SEQ ID NO of the sequence of the polynucleotide; and 2)the Spot ID, which is a 
unique identifier for each spot ontaining target sequence of interest on all arrays used 

15 The differential expression assay was performed by mixing equal amounts of probes from 

tumor cells and normal cells of the same patient. The arrays were prehybridized by incubation for 
about 2 hrs at 60°C in 5X SSC/0.2% SDS/1 mM EDTA, and then washed three times in water and 
twice in isopropanol. Following prehybridization of the array, the probe mixture was then hybridized 
to the array under conditions of high stringency (overnight at 42°C in 50% formamide, 5X SSC, and 

20 0.2% SDS. After hybridization, the array was washed at 55°C three times as follows: 1) first wash in 
IX SSC/0.2% SDS; 2) second wash in 0. IX SSC/0.2% SDS; and 3) third wash in 0.1X SSC. 

The arrays were then scanned for green and red fluorescence using a Molecular Dynamics 
Generation HI dual color laser-scanner/detector. The images were processed using BioDiscovery 
Autogene software, and the data from each scan set normalized to provide for a ratio of expression 

25 relative to normal. Data from the microarray experiments was analyzed according to the algorithms 
described in U.S. application serial no. 60/252,358, filed November 20, 2000, by E.J. Moler, M.A. 
Boyle, and F.M. Randazzo, and entitled "Precision and accuracy in cDNA microarray data," which 
application is specifically incorporated herein by reference. 

The experiment was repeated, this time labeling the two probes with the opposite color in 

30 order to perform the assay in both "color directions." Each experiment was sometimes repeated with 
two more slides (one in each color direction). The level fluorescence for each sequence on the array 
expressed as a ratio of the geometric mean of 8 replicate spots/genes from the four arrays or 4 replicate 
spots/gene from 2 arrays or some other permutation. The data were normalized using the spiked 
positive controls present in each duplicated area, and the precision of this normalization was included 
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in the final determination of the significance of each differential. The fluorescent intensity of each spot 
was also compared to the negative controls in each duplicated area to determine which spots have 
detected significant expression levels in each sample. 

A statistical analysis of the fluorescent intensities was applied to each set of duplicate spots to 
assess the precision and significance of each differential measurement, resulting in a p-value testing the 
null hypothesis that there is no differential in the expression level between the tumor and normal 
samples of each patient. During initial analysis of the microarrays, the hypothesis was accepted if p > 
10~ 3 , and the differential ratio was set to 1.000 for those spots. All other spots have a significant 
difference in expression between the tumor and normal sample. If the tumor sample has detectable 
expression and the normal does not, the ratio is truncated at 1000 since the value for expression in the 
normal sample would be zero, and the ratio would not be a mathematically useful value (e.g., infinity). 
If the normal sample has detectable expression and the tumor does not, the ratio is truncated to 0.00 1, 
since the value for expression in the tumor sample would be zero and the ratio would not be a 
mathematically useful value. These latter two situations are referred to herein as "on/off." Database 
tables were populated using a 95% confidence level (p>0.05). 

Table 8 (inserted before the claims) provides the results for gene products differentially 
expressed in the colon tumor samples relative to normal tissue samples. Table 8 includes: 1) the SEQ 
ID NO; 2) the spot identification number ("SpotBD"); 3) the percentage of patients tested in which 
expression levels of the gene (as detected using the correponding clone) was at least 2-fold greater in 
cancerous colon tissue (primary colon tumor) than in matched normal tissue ("Colon>2x T/N "); 4) the 
percentage of patients tested in which expression levels of the gene was less than or equal to one-half 
of the expression level in matched normal cells ("Colon <=halfit T/N"); and 5) the colon number ratios, 
indicating the number of patients upon which the provided ratios was based. 

Table 9 below provides the data for differential expression analysis on the arrays using 
samples from metastazed colon tissue. In this example, the samples used for hybridization to 
sequences on the microarray were derived from the matched metastasized (MT) colon tissue and 
normal (N) colon tissues of the patients. Table 9 includes: 1) the SEQ ID NO; 2) the percentage of 
patients tested in which expression levels of the gene (as detected using the correponding clone) was at 
least 2-fold greater in metastasized cancerous colon tissue (MT) than in matched normal tissue 
( ft Colon>2x MT/N "); 5) the percentage of patients tested in which expression levels of the gene was 
less than or equal to one-half of the expression level in matched normal cells ("Colon <?=halfx T/N"); 
and 8) the colon number ratios, indicating the number of patients upon which the provided ratios was 
based. The corresponding data with the same sequence of the colon tumor tissue versus matched 
normal colon tissue (T/N) are provided for convenience in comparison. 



64 



WO 02/14500 



PCT/US01/25840 



Table 9. Polynucleotides Corresponding to Differnetially Expressed Genes in Metastasized 



Colon Cancer Tissue 









Colon MT/N 














Num Ratios 


Colon T/N 


Colon T/N 


Colon T/N 


SEQID 
NO 


Colon 
MT/N>2x 


Colon MT/N 
<halfx 


KJy V> , XVJlIt» 


>2x 


<halfx 


Num Ratios 


217 


40.0 


0.0 


5.0 


0.0 


. 50.0 


8.0 


324 


0.0 


40.0 


5.0 


0.0 


37.5 


8.0 


1653 


0.0 


40.0 


5.0 


0.0 


37.5 


8.0 


1972 


40.0 


0.0 


5.0 


0.0 


42.9 


7.0 


2346 


20.0 


40.0 


5.0 


1 0.0 


85.7 


7.0 


2352 


i 20.0 


80.0 


5.0 


0.0 


71.4 


7.0 


2353 


20.0 


40.0 


5.0 


0.0 


85.7 


7.0 


2647 


0.0 


40.0 


5.0 


0.0 


62.5 


8.0 


5341 


0.0 


40.0 


5.0 


0.0 


37.5 


8.0 



Table 10 below provides the data for differential expression analysis on the arrays using 
samples from matched cancerous and normal prostate tissue (PT/N)- Table 10 includes: 1) the SEQ ID 
NO; 2) the percentage of patients tested in which expression levels of the gene (as detected using the 
correponding clone) was at least 2-fold greater in metastasized cancerous prostate tissue (PT) than in 
matched normal tissue ("Colon>2x PT/N "); 3) the percentage of patients tested in which expression 
levels of the gene was less than or equal to one-half of the expression level in matched normal cells 
("Colon <=halfc PT/N"); and 4) the prostate PT/N number ratios, indicating the number of patients 
upon which the provided ratios was based. The corresponding data with the same sequences for the 
colon tumor versus normal (T/N) and metastasized colon tissue versus normal (MT/N) are provided for 
convenience in comparison. 

Table 10. Polynucleotides Corresponding to Differnetially Expressed Genes in Prostate 



Cancer Tissue 



SEQ 
ID 
NO 


Prostate Prostate 
(PT/N) (PT/N) 
>2x <habx 


Prostate 
(PT/N) 
Num 
Ratios 


Colon 
T/N 
>2x 


Colon 
T/N 
<halfx 


Colon T/N 
Num 
Ratios 


Colon 
MT/N 
>2x 


Colon 
MT/N 
<halfe 


Colon 
MT/N 
Num 
Ratios 


139 


11.1 


33.3 


9.0 


0.0 


50.0 


8.0 








490 


37.5 


12.5 


8.0 


0.0 


71.4 


7.0 








629 


33.3 


11.1 


9.0 














644 


12.5 


37.5 


8.0 


0.0 


42.9 


7.0 








1674 


33.3 


0.0 


9.0 














2346 


37.5 


25.0 


8.0 


0.0 


85.7 


7.0 


20.0 


40.0 


5.0 


2352 


37.5 


12.5 


8.0 


0.0 


71.4 


I 7.0 


20.0 


80.0 


5.0 


2420 


22.2 


33.3 


9.0 
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Prostate 












Colon 


SEQ 


Prostate Prostate 


(PT/N) 


Colon 


Colon 


Colon T/N 


Colon 


Colon 


MT/N 


ID 


(PT7N) (PT/N) 


Num 


T/N 


T/N 


Num 


MT/N 


MT/N 


Num 


NO 


>2x <halfe 


Ratios 


>2x 


<halfx 


Ratios 


>2x 


<halfx 


Ratios 


3296 


33.3 0.0 


9.0 


0.0 


37.5 


8.0 









Example 6: Antisense Regulation of Gene Expression 

The expression of the differentially expressed genes represented by the polynucleotides in the 
cancerous cells can be further analyzed using antisense knockout technology to confirm the role and 

5 function of the gene product in tumorigenesis, e.g. , in promoting a metastatic phenotype. 

Methods for analysis using antisense technology are well known in the art. Eor example, a 
number of different oligonucleotides complementary to the mRNA generated by the differentially 
expressed genes identified herein can be designed as antisense oligonucleotides, and tested for their 
ability to suppress expression of the genes. Sets of antisense oligomers specific to each candidate 

10 target are designed using the sequences of the polynucleotides corresponding to a differentially 
expressed gene and the software program HYBsimulator Version 4 (available for Windows 
95/Windows NT or for Power Macintosh, RNAture, Inc. 1003 Health Sciences Road, West, Irvine, CA 
92612 USA). Factors considered when designing antisense oligonucleotides include: 1) the secondary 
structure of oligonucleotides; 2) the secondary structure of the target gene; 3) the specificity with no or 

15 minimum cross-hybridization to other expressed genes; 4) stability; 5) length and 6) terminal GC 
content. The antisense oligonucleotide is designed to so that it will hybridize to its target sequence 
under conditions of high stringency at physiological temperatures (e.g., an optimal temperature for the 
cells in culture to provide for hybridization in the cell, e.g., about 37°C), but with minimal formation of 
homodimers. 

20 Once synthesized and quantitated, the oligomers are screened for efficiency of a transcript 

knock-out in a panel of cancer cell lines. The efficiency of the knock-out is determined by analyzing 
mRNA levels using lightcycler quantification. The oligomers that resulted in the highest level of 
transcript knock-out, wherein the level was at least about 50%, preferably about 80-90%, up to 95% or 
more up to undetectable message, are selected for use in a cell-based proliferation assay, an anchorage 

25 independent growth assay, and an apoptosis assay. 

For example, where the polynucleotide is identified as having a role in colon cancer, the ability 
of the corresponding designed antisense oligonucleotide to inhibit gene expression is tested through 
transfection into SW620 colon colorectal carcinoma cells. For each transfection mixture, a carrier 
molecule, preferably a lipitoid or cholesteroid, is prepared to a working concentration of 0.5 mM in 

30 water, sonicated to yield a uniform solution, and filtered through a 0.45 |im PVDF membrane. The 
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antisense or control oligonucleotide is then prepared to a working concentration of 100 \xM in sterile 
Millipore water. The oligonucleotide is further diluted in OptiMEM™ (Gibco/BRL), in a microfuge 
tube, to 2 |aM, or approximately 20 ^ig oligo/ml of OptiMEM™. In a separate microfuge tube, lipitoid 
or cholesteroid, typically in the amount of about 1.5-2 nmol lipitoid/^ig antisense oligonucleotide, is 
diluted into the same volume of OptiMEM™ used to dilute the oligonucleotide. The diluted antisense 
oligonucleotide is immediately added to the diluted lipitoid and mixed by pipetting up and down. 
Oligonucleotide is added to the cells to a final concentration of 30 nM. 

The level of target mRNA that corresponds to a target gene of interest in the transfected cells 
is quantitated in the cancer cell lines using the Roche LightCycler™ real-time PCR machine. Values 
for the target mRNA are normalized versus an internal control (e.g., beta-actin). For each 20 fxl 
reaction, extracted RNA (generally 0.2-1 \xg total) is placed into a sterile 0.5 or 1.5 ml microcentrifuge 
tube, and water added to a total volume of 12.5 To each tube 7.5 |xl of a buffer/enzyme mixture is 
added, which is prepared by mixing (in the order listed) 2.5 jil H 2 0, 2.0 jil 10X reaction buffer, 10 \d 
oligo dT (20 pmol), 1.0 \il dNTP mix (10 mM each), 0.5 *il RNAsin® (20u) (Ambion, Inc., ffialeah, 
FL), and 0.5 pi MMLV reverse transcriptase (50u) (Ambion, Inc.). The contents are mixed by 
pipetting up and down, and the reaction mixture incubated at 42°C for 1 hour. The contents of each 
tube are centrifuged prior to amplification. 

An amplification mixture is prepared by mixing in the following order: IX PCR buffer II, 3 
mM MgCl 2 , 140 pM each dNTP, 0.175 pmol each oligo, 1:50,000 dil of SYBR® Green, 0.25 mg/ml 
BSA, 1 unit Taq polymerase, and H 2 0 to 20 (PCR buffer II is available in 10X concentration from 
Perkin-Elmer, Norwalk, CT). In IX concentration it contains 10 mM Tris pH 8.3 and 50 mM KC1. 
SYBR® Green (Molecular Probes, Eugene, OR) is a dye which fluoresces when bound to double 
stranded DNA. As double stranded PCR product is produced during amplification, the fluorescence 
from SYBR® Green increases. To each 20 jul aliquot of amplification mixture, 2 pi of template RT are 
added, and amplification carried out according to standard protocols. 

The results can be expressed as the percent decrease in expression of the corresponding gene 
product relative to non-transfected cells, vehicle-only transfected (mock-transfected) cells, or cells 
transfected with reverse control oligonucleotides. 

Example 7: Effect of Expression on Proliferation 

The effect of gene expression on the inhibition of cell proliferation can be assessed in, for 
example, metastatic breast cancer cell lines (MDA-MB-231 ("231")), SW620 colon colorectal 
carcinoma cells, or SKOV3 cells (a human ovarian carcinoma cell line). 
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Cells are plated to approximately 60-80% confluency in 96-well dishes. Antisense or reverse 
control oligonucleotide iss diluted to 2 pM in OptiMEM™ and added to OptiMEM™ into which the 
delivery vehicle, lipitoid 1 16-6 in the case of SW620 cells or 1:1 lipitoid lxholesteroid 1 in the case of 
MDA-MB-23 1 cells, had been diluted. The oligo/delivery vehicle mixture is then further diluted into 
5 medium with serum on the cells. The final concentration of oligonucleotide for all experiments was 
300 nM, and the final ratio of oligo to delivery vehicle for all experiments iss 1.5 nmol lipitoid/ng 
oligonucleotide. 

Antisense oligonucleotides are prepared as described above (see Example 6). Cells are 
transfected overnight at 37°C and the transfection mixture replaced with fresh medium the next 
10 morning. Transfection is carried out as described above in Example 3 . 

Those antisense oligonucleotides that inhibit proliferation represent genes that play a role in 
production or maintenance of the cancerous phenotype. 

Example 8 : Effect of Gene Expression on Colony Formation 

15 The effect of gene expression upon colony formation of, for example, SW620 cells, SKOV3 

cells, and MD-MBA-23 1 cells can be tested in a soft agar assay. Soft agar assays are conducted by 
first establishing a bottom layer of 2 ml of 0.6% agar in media plated fresh within a few hours of 
layering on the cells. The cell layer is formed on the bottom layer by removing cells transfected as 
described above from plates using 0.05% trypsin and washing twice in media. The cells are counted in 

20 a Coulter counter, and resuspended to 10 6 per ml in media. 10 pi aliquots are placed with media in 96- 
well plates (to check counting with WST1), or diluted further for the soft agar assay. 2000 cells are 
plated in 800 pi 0.4% agar in duplicate wells above 0.6% agar bottom layer. After the cell layer agar 
solidifies, 2 ml of media is dribbled on top and antisense or reverse control oligo (produced as 
described in Example 6) added without delivery vehicles. Fresh media and oligos are added every 3-4 

25 days. Colonies usually are expected to form in 10 days to 3 weeks. Fields of colonies are counted by 
eye. Wst-1 metabolism values can be used to compensate for small differences in starting cell number. 
Larger fields can be scanned for visual record of differences. 

Those antisense oligonucleotides that inhibited colony formation represent genes that play a 
role in production or maintenance of the cancerous phenotype. 

30 

Example 9: Induction of Cell Death upon Depletion of Polypeptides by Depletion of mRNA 
f 11 Antisense Knockout"^ 

In order to assess the effect of depletion of a target message upon cell death, SW620 cells, or 
other cells derived from a cancer of interest, are transfected for proliferation assays. For cytotoxic 
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effect in the presence of cisplatin (cis), the same protocol is followed but cells are left in the presence 
of 2 pM drug. Each day, cytotoxicity was monitored by measuring the amount of LDH enzyme 
released in the medium due to membrane damage. The activity of LDH is measured using the 
Cytotoxicity Detection Kit from Roche Molecular Biochemicals. The data is provided as a ratio of 
5 LDH released in the medium vs. the total LDH present in the well at the same time point and treatment 
(rLDH/tLDH). A positive control using antisense and reverse control oligonucleotides for BCL2 (a 
known anti-apoptotic gene) is included; loss of message for BCL2 leads to an increase in cell death 
compared with treatment with the control oligonucleotide (background cytotoxicity due to 
transfection). 

10 

Example 10: Functional Analysis of Gene Products Differentially Expressed in Cancer 

The gene products of sequences of a gene differentially expressed in cancerous cells can be 
further analyzed to confirm the role and function of the gene product in tumorigenesis, e.g., in 
promoting or inhibiting development of a metastatic phenotype. For example, the function of gene 
15 products corresponding to genes identified herein can be assessed by blocking function of the gene 
products in the cell. For example, where the gene product is secreted or associated with a cell surface 
membrane, blocking antibodies can be generated and added to cells to examine the effect upon the cell 
phenotype in the context of, for example, the transformation of the cell to a cancerous, particularly a 
metastatic, phenotype. 

20 Where the gene product of the differentially expressed genes identified herein exhibits 

sequence homology to a protein of known function (e.g., to a specific kinase or protease) and/or to a 
protein family of known function (e.g., contains a domain or other consensus sequence present in a 
protease family or in a kinase family), then the role of the gene product in tumorigenesis, as well as the 
activity of the gene product, can be examined using small molecules that inhibit or enhance function of 

25 the corresponding protein or protein family. 

Additional functional assays include, but are not necessarily limited to, those that analyze the 
effect of expression of the corresponding gene upon cell cycle and cell migration. Methods for 
performing such assays are well known in the art 

30 Example 1 1 : Contig Assembly and Additional Gene Characterization 

The sequences of the polynucleotides provided in the present invention can be used to extend 
the sequence information of the gene to which the polynucleotides correspond (e.g., a gene, or mRNA 
encoded by the gene, having a sequence of the polynucleotide described herein). This expanded 
sequence information can in turn be used to further characterize the corresponding gene, which in turn 
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provides additional information about the nature of the gene product {e.g., the normal function of the 
gene product). The additional information can serve to provide additional evidence of the gene 
product's use as a therapeutic target, and provide further guidance as to the types of agents that can 
modulate its activity. 

5 For example, a contig can be assembled using the sequence of a polynucleotide described 

herein. A "contig" is a contiguous sequence of nucleotides that is assembled from nucleic acid 
sequences having overlapping (e.g. , shared or substantially similar) sequence information. The 
sequences of publicly-available ESTs (Expressed Sequence Tags) and the sequences of various clones 
from several cDNA libraries synthesized at Chiron were used in the contig assembly. The contig is 

10 assembled using the software program Sequencher, version 4.05, according to the manufacturers 
instructions. The resulting contig can then be used to search both the public databases as well as 
databases internal to the applicatns to matchthe polynucleotide contiged with homology data and/or 
differential gene expressed data. 

The sequence information obtained in the contig assembly described above can be used to 

15 obtain a consensus sequence derived from the contig using the Sequencher program. The consensus 
sequence can then be used as a query sequence in a BLASTN search of the DGTI DoubleTwist Gene 
Index (DoubleTwist, Inc., Oakland, CA), which contains all the EST and non-redundant sequence in 
public databases. Alternatively, a sequence of a polynucleotide described herein can be used directly as 
a query sequence in a BLASTN search of the DGTI DoubleTwist Gene Index. 

20 Through contig assembly and the use of homology searching software programs, the sequence 

information provided herein can be readily extended to confirm, or confirm a predicted, gene having the 
sequence of the polynucleotides described in the present invention. Further the information obtained 
can be used to identify the function of the gene product of the gene corresponding to the 
polynucleotides described herein. While not necessary to the practice of the invention, identification of 

25 the function of the corresponding gene, can provide guidance in the design of therapeutics that target 
the gene to modulate its activity and modulate the cancerous phenotype (e.g., inhibit metastasis, 
proliferation, and the like). 

Although the foregoing invention has been described in some detail by way of illustration and 
30 example for purposes of clarity of understanding, it is readily apparent to those of ordinary skill in the 
art in light of the teachings of this invention that certain changes and modifications may be made 
thereto without departing from the spirit or scope of the appended claims. Those skilled in the art will 
recognize, or be able to ascertain, using not more than routine experimentation, many equivalents to the 

70 



INSDOCID: <WO 021 4500A2_I_> 



WO 02/14500 



PCT/US01/25840 



specific embodiments of the invention described herein. Such specific embodiments and equivalents 
are intended to be encompassed by the following claims. 
Deposit Info rmation . 

A deposit of the biological materials in the tables referenced below was made with the 
5 American Type Culture Collection, 10801 University Blvd., Manasas, VA 201 10-2209, under the 
provisions of the Budapest Treaty, on or before the filing date of the present application. The 
accession number indicated is assigned after successful viability testing, and the requisite fees were 
paid. Access to said cultures will be available during pendency of the patent application to one 
determined by the Commissioner to be entitled to such under 37 C.F.K §1.14 and 35 U.S.C. §122. 

10 All restriction on availability of said cultures to the public will be irrevocably removed upon the 

granting of a patent based upon the application. Moreover, the designated deposits will be maintained 
for a period of thirty (30) years from the date of deposit, or for five (5) years after the last request for 
the deposit; or for the enforceable life of the U.S. patent, whichever is longer. Should a culture 
become nonviable or be inadvertently destroyed, or, in the case of plasmid-containing strains, lose its 

15 plasmid, it will be replaced with a viable culture(s) of the same taxonomic description. 

These deposits are provided merely as a convenience to those of skill in the art, and are not an 
admission that a deposit is required. A license may be required to make, use, or sell the deposited 
materials, and no such license is hereby granted. The deposit below was received by the ATCC on or 
before the filing date of the present application. 

20 



Table 11. Cell Lines Deposited with ATCC 



Cell Line 


Deposit Date 


ATCC Accession No. 


CMCC Accession No. 


KM12L4-A 


March 19, 1998 


CRL-12496 


11606 | 


Kml2C 


May 15, 1998 


CRL-12533 


11611 


MDA-MB- 
231 


May 15, 1998 


CRL- 12532 


10583 


MCF-7 


October 9, 1998 


CRL- 12584 


10377 



In addition, pools of selected clones, as well as libraries containing specific clones, were 



assigned an "ES" number (internal reference) and deposited with the ATCC. Table 13 below provides 
the ATCC Accession Nos. of the ES deposits, all of which were deposited on or before June 13, 2000. 
25 The names of the clones contained within each of these deposits are provided in the tables numbered 22 
and greater (inserted before the claims). 



Table 12: Pools of Clones and Libraries Deposited with ATCC on or before June 13, 2000. 



Library No. 


CMCC No. 


ATCC Accession No. 


Library No. 


CMCC No. 


ATCC Accession No. 


ES168 


5276 


PTA-2027 


ES 189 


5304 


PTA-2052 


ES169 


5277 


PTA-2028 


ES 190 


5305 


PTA-2053 | 


ES170 


5284 


PTA-2029 


ES 191 


5306 


PTA-2054 


ES171 


5285 


PTA-2030 


ES 192 


5307 


PTA-2055 
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Library No. 


CMCC No. 


ATCC Accession No. 


Library No. 


CMCC No. 


A 1 cc Accession No. 


ES172 


5286 


PTA-2031 


ES 193 


5308 


PTA-2056 


ES173 


5287 


PTA-2032 


ES 194 1 


5309 


PTA-2057 


ES174 


5288 


PTA-2033 


ES 195 


5310 


PTA-2058 


ES175 


5289 


PTA-2034 


ES 196 


5311 


PTA-2059 


ES176 


5290 


PTA-2035 


ES 197 


5312 


PTA-2060 


ESI 77 


5291 


PTA-2036 


ES 198 


5313 


PTA-2061 


ES178 


5292 


PTA-2037 


ES 199 


5314 


PTA-2062 


ES179 


5293 


PTA-2038 


ES 200 


5315 


PTA-2048 


ES180 


5294 


PTA-2039 


ES 201 


5316 


PTA-2049 


ES181 


5295 


PTA-2040 


ES 202 


5317 


PTA-2063 


ESI 82 


5296 


PTA-2041 


ES 203 


5318 


PTA-2064 


ES183 


5297 


r 1 A-2U4Z 




Dj If 




ES184 


5298 


PTA-2043 


ES205 


5320 


PTA-2066 ! 


ES185 


5299 


PTA-2044 


ES206 


5321 


PTA-2067 


ES 186 


5301 


PTA-2045 


ES207 


5322 


PTA-2068 


ES 187 


5302 


PTA-2046 


ES208 


5253 


PTA-2050 


ES 188 


5303 


PTA-2047 


ES209 


5324 


PTA-2051 



Table 13 (inserted before the claims) provides the names of the clones in each of the above libraries. 

Retrieval of Individual Clones from Deposit of Pooled Clones, Where the ATCC deposit is 
composed of a pool of cDNA clones or a library of cDNA clones, the deposit was prepared by first 

5 transfecting each of the clones into separate bacterial cells. The clones in the pool or library were then 
deposited as a pool of equal mixtures in the composite deposit Particular clones can be obtained from 
the composite deposit using methods well known in the art. For example, a bacterial cell containing a 
particular clone can be identified by isolating single colonies, and identifying colonies containing the 
specific clone through standard colony hybridization techniques, using an oligonucleotide probe or 

10 probes designed to specifically hybridize to a sequence of the clone insert (e.g. , a probe based upon 
unmasked sequence of the encoded polynucleotide having the indicated SEQ ID NO). The probe 
should be designed to have a T m of approximately 80°C (assuming 2°C for each A or T and 4°C for 
each G or C). Positive colonies can then be picked, grown in culture, and the recombinant clone 
isolated. Alternatively, probes designed in this manner can be used to PCR to isolate a nucleic acid 

1 5 molecule from the pooled clones according to methods well known in the art, e.g. , by purifying the 

cDNA from the deposited culture pool, and using the probes in PCR reactions to produce an amplified 
product having the corresponding desired polynucleotide sequence. 
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Table 2 



SEQ 
ID NO 


CLUSTER 


SEQ NAME 


OR1EN 
T 


CLONE ID 


LIBRARY 


1 


498509 


2102H02ez43 275066 


F 


M00063514B.E04 


GRRpz 


At 


644927 


2102 F08 ez43 275160 


F 


M00063493D G07 


GRRpz 


*\ 

J 


621081 


2102 G10 ez43 275193 


F 


M00063507B:B08 


GRRpz 


A 


558900 


2l02F14ez43 275256 


F 


M00063496BF07 


GRRdz 


< 


464701 

*TVJ*T i 71 


2102 A16 ez43 275283 


F 

X 


M00063 157BB09 


GRRdz 


O 


558147 

J JO X*T / 


2102 Bl 8 az43 275316 


F 

X 


M00063457 AB 12 


GRRdz 


7 


370 1 54 


2102T24&743 275419 


F 

X 


M00063601DC05 


GRRpz 


o 


643600 

UTjUV/7 


9103 F01 ff^43 275432 


F 


M00063 866BH02 


GRRpz 


Q 


377606 

Jit \JZs\J 


2103 M06 ffr43 275519 


F 


M00063943BG12 


GRRpz 


10 


554305 


2103 K10 ez43 275581 


F 


M00063927AB08 


GRRpz 


1 1 

i j. 


4093 S3 1 


2103 Bl 3 ffz43 275620 


F 


M00063803BF11 


GRRpz 


19 


380514 

J OUJ X"T 


2103 El 7 ez43 275687 


F 


M00063864A:H10 


GRRpz 


13 


147970 


2103 J21 2z43 275756 


F 


M00063923BA04 


GRRpz 


14 


884 


2128K06az43 277507 


F 


M00028627BF12 


MV-522 




1 88377 


91 98 HI 5 »43 277648 


F 


M00028359D:F09 


MV-522 


16 
ID 


1 5561 5 


9128P17az43 277688 


F 


M00032476DF07 


MV-522 


17 
1 / 


195604 


9198 PI 8 f^743 277704 


F 


M00032477A:B02 


MV-522 


1 o 


439 1 50 


2128A24ffz43 277785 


F 


M00028061D:D10 


MV-522 


10 


490735 


2130 N02 sz43 277847 


F 


M00032736A.B06 


MV-522 




79838 


2130 005 sz43 277896 


F 


M00032745C:F03 


MV-522 


9 1 


436888 


2130N09ffz43 277959 


F 


M00032738D:G11 


MV-522 


99 

ZZ 


497571 


2130H13ffz43 978017 


F 


M00032685B:C10 


MV-522 


93 


38404 


9130H16o743 978065 


F 


M00032686CD 10 


MV-522 


94 
Z*f 


7R6H7 


9 1 30 HI 8 <*743 278007 


F 

X 


M00032686DG09 


MV-522 


9^ 


0O109 


9130 118 a^43 978000 


X 


M00032703D *E 1 0 


MV-522 


9£ 
ZO 


4461 5 


9130 190 W743 978130 


X 


M00032695B-A0 1 


MV-522 


97 
Z / 


376753 


9 1 30 M9 1 <T743 978 1 50 


F 

X 


M00032732AA03 


MV-522 


9R 
Zo 


6349 


9 1 59 HI 0 CT743 978370 


F 

X 


M00039 1 09AH09 


UCP-3 


90 
zy 


3R7530 


9159 H19 av43 978308 

Z lJZ,.l/lZ.gi,tJ / 0J70 


F 

X 


M0003 9069D E 1 2 


UCP-3 


30 


36453 


9159B14<*743 978498 


F 

X 


M00039052BH03 


UCP-3 


3 1 


376044 


9159 A18 0743 978401 

ZlJZ.n.lO.g^tJ / Ot7l 


F 

X 


M00039047CA05 


UCP-3 


39 
«3Z 


375/11 5 


9159T18ot43 978500 


X 


M00039 12 1 CC06 


UCP-3 


Do 


^7^70A 
O / j fVO 


91 53 P03 otAI 97R756 


F 

X 


M00030222C-G06 


UCP-3 


34 


^7QOzl£ 


9153 T06 ar43 97R80R 
ZlJJjvQ.gZ'rJ Z / OOvO 


F 


M00039244CF10 

IVlV/vu J 7LTTV.1 XV 


UCP-3 


3^ 


oooni 7 


9153 TOB o-^43 97R830 


F 


M0003 9240AH08 


UCP-3 


36 


377596 


2153.O08.gz43_278845 


F 


M00039274C:D12 


UCP-3 


37 


376384 


2153.111.gz43_278887 


F 


M00039241A:H11 


UCP-3 


38 


372952 


2153.K14.gz43_278937 


F 


M00039251A.G12 


UCP-3 


39 


377696 


2154.M04.gz43_279163 


F 


: M00039404B-.G11 


UCP-3 


40 


60037 


2 154.G05 .gz43__279 173 


F 


1 M00039330B:H09 


UCP-3 


41 


390968 


2154.118.gz43_279383 


F 


M00039346C:G08 


UCP-3 


42 


376044 


2154.G21.gz43_279429 


F 


M00039336A:C07 


UCP-3 
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Table 2 



SEQ 
ID NO 


CLUSTER 


SEQ NAME 


ORIEN 
T 


CLONED) 


LIBRARY 


AO 
*r D 


60037 


2155 T03 ffz43 279528 


F 


M00039632A'C01 i 

XVJLxJ \J\J <mJ ^ \JmJ £*J* V - V-' v J. 


UCP-3 


AA f 


376830 


9155 1M05 (jy43 279564 


X 


M00039664BH10 


UCP-3 


AK 


O7OA80 


9155 PI 1 <jv43 979662 

LlJJ.l H.^ZitJ £.IZ7\J\>ZJ 


F 

X 


M00039674BG1 1 


UCP-3 


AA 
*fO 


D I Dz7 1 D 


9155 CI 9 ^43 279665 


F 

X 


M00039457AD07 


UCP-3 


A7 


07Q805 

D /70VJ 


9 1 55 Ml 5 c*743 279724 


F 

X 


M00039665CB01 


UCP-3 


Aft 


07AA0Q 


9155 F1 6 cr743 979739 


F 

X 


M0003 9479A' A 1 0 


UCP-3 




07AQQA 

D I 05*57** . 


9155T16cr743 979735 


X 


M00039619CB01 


UCP-3 




O7RA90 


91 55 TV? 1 a743 279808 


F 


M00039451BF01 


UCP-3 


D 1 


D / 1U 


9155 F91 <*743 279812 


F 

X 


M00039479DC06 


UCP-3 


3Z 


9A71 n 


9155P99a743 279838 


F 


M00039675CC05 


UCP-3 ! 


^O 

DD 


0071 A7 

J7 / J.O / 


9164 T03 ar43 979912 


F ' 

X 


M00039778DD05 


UCP-3 | 


<A 


o.ftnn9^ 


9 1 64 T1 0 ar43 980023 


F 


M00039774AE11 


UCP-3 


jj 


9 1 A 1 70 
zio x / 7 


9 1 64 PI 0 a-743 280030 


F 


M00039823DD09 


UCP-3 


«A 
DO 


070046 


9164 T315 a743 280096 


F 


M00039681BF05 


UCP-3 


^7 
3 / 


O7O97O 
D I 7Z / ¥ 


9164A94ay43 780239 


F 


M00039679C:A02 


UCP-3 


JO 


07AH7A 


9164 094ay43 980253 


F 


M00039820DF08 


UCP-3 


^O 


0.7000^ 


9165 1401 az43 780262 


F 

X 


M00039883DG06 


UCP-3 


Afi 


1 300 i 


9165 K01 a^43 280265 


F 


M00039905BF09 


UCP-3 


A1 


oooaos 


9 1 65 FT06 *743 280342 


F 


M00039885BA10 


UCP-3 


oz 


OOAQAO 


9165 P07 <r743 980366 


F 

X 


M00039978DC04 


UCP-3 


AO ' 


OORHA1 


9165 F09 cr743 280388 


F 

X 


M00039871CC01 


UCP-3 


AA 


D /OOUO 


9 1 65 T1 0 try&l 280407 


F 


M00039893AG12 

XTxwv/«y y uy *. . vj 


UCP-3 


A^ 
OD 


6349 


9166 A06 Q743 281279 


F 

X 


M00039986B* Al 1 


UCP-3 


AA 
DO 


90.0ft 1 A 
Zjjo x*t 


9166K07ot43 781305 


F 


M00040076BD0 1 


UCP-3 


A7 


9A9 1 n 


9 1 66 TSJ07 crz43 981308 


F 

X 


M00040096DC03 


UCP-3 


Aft 


9A91 n 


9166 All ot43 981359 


F 

X 


M00039988BC08 


UCP-3 


AO 


077AQA 


9166 Til ay43 981368 


F 

X 


M00040070CD11 


UCP-3 


/u 


J700J 1 


9166 A93 <T743 981551 


F 

X 


M0004000 1 A*H02 


UCP-3 


/I 


LodhDZ 


91AAP94 crrAO 981589 


F 

X 


M00040 1 1 3C-H09 


UCP-3 


T9 

/z 


DVDZ, / 3 


9 1 HA H0 1 cttAO. 907393 
ZlU*f .JJU1 Z7 1 Dz»D 


T? 

X 


M00064082CD1 1 


GRRdz 


ID 


1 1 070 


9 1 OA "MOT ovAIl 907333 


X 


M00064194BA02 


GRRpz 


HA 


A A AO 07 


9 1 HA >JH^ rr-rAO. 907365 


F 

X 


M00064195CB02 

J-V.LV/ V/\/v/~ X V^. MJ\J^ 


GRRpz 


ID 


A1 ft7A0 
*f 15 iod 


9 1 0A PfiA niA'X 907383 
ZxUT-.JrUH-.gZ*f3 Z7 / DOD 


F 

X 


M00064220BE0 1 


GRRoz 

vyj.vi».|y*rf 


7A 
/O 


AAOOA7 


9 1 04 TM05 ot43 997397 


F 


M00064196DC10 


GRRpz 


/ / 


A00^7ft 

OJ7J /O 


9 1 AA COft cytAO 907434 


F 

X 


M00064077BH02 


GRRpz 


78 


376559 


2104E08.gz43 297436 


F 


M00064101B:E12 


GRRpz 


79 


649035 


2104.Dll.gz43 297483 


F 


M00064085B:A12 


GRRpz 


80 


647586 


2104.Hll.gz43 297487 


F 


M00064131B:A09 


GRRpz 


81 


646187 


2104.F12.gz43 297501 


F 


M00064112A:G03 


GRRpz 


82 


727888 


2104.N13.gz43 297525 


F 


M00064201A.C08 


GRRpz 


83 


558382 


2104.E17.gz43 297580 


F 


M00064104B:A01 


GRRpz 


84 


52644 


2104.C19.gz43 297610 


F 


M00064081A:D04 


GRRpz 
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Table 2 



i r 

SEO 
ID NO 


PT TT^TFT? 


< 

SEO NAME 


3RIEN 
T 


CLONE ID 


l 

LIBRARY j 


85 


454622 




r 


Mnflflfi4 1 47R-G08 


GRRdz ! 


86 


554032 


OlA/f AOI rtrrdl. 707£40 

Z 1U4.AZ l.gZ4.5_X;7 /0*tU 






GRRdz 1 


1 87 | 


A /O^ 1 O 

463217 


2 1 U4 ,gz43_zy / O / O 


JT 


MOnOfid 1 01C *E03 


GRRdz \ 


1 88 1 


189073 


1521 .rl0o.gz43 3U^4yo 


r 


.lYJLU\J\JZ» / AvI jAI.1jvJ 


MCF-7 j 




1 OO^ f\f\ 

188309 


1 -CO 1 Tf\£L ~—A1 "SA^/IO© 

152 1 J Uo.gz4 J_3U^>4Vo 


r 




MCF-7 | 


1 c\r\ 1 

! 90 


187525 


J.D2l.JNlU.gZ4J 


JT 


M00097S 1 4r*F01 


MCF-7 S 


1 ^ 1 


215366 


ICOI XTIO nrrAI O^AOCQQ 

1 32 Jl . IN 1 2 .gZ43_3 Uo J V o 


JT 




MCF-7 1 


1 92 1 


186594 


1 52 1 .ri 1 3 .gZ4 J .3 OU 5 


17 

JT 




MCF-7 1 


1 93 1 


218904 




T7 
r 




MCF-7 j 


1 94 1 


189993 


lD21.^14.gZ4J_3UjOiy 


T7 

JT 


M00023406A-G03 


MCF-7 


| 95 1 


O O O 1 o 

222818 


KOI TT1 A r*~A1 

1521.rll4.gZ43 3UJ024 


17 


M00077 1 96CH05 


MCF-7 1 


S 96 1 


185056 


152 1 .U 1 o.gz43_JU dood 


17 




MCF-7 1 


97 


186404 


icoi "noA n ~Ai ""iAUOOQ 
1521.r2U.gz4J_3U3 /z© 


17 
r 


iVAv/UUZ, / Uy^TV/.V/ A A 


MCF-7 j 


98 


649744 


O 1 1 £L 1f\C 3 1 A/£0 1 O 

21 lo.I0o.gz43 3Uo21 / 


17 


M00n6^SR0A*A07 


WOca 1 


99 


535955 


21 lo.FUo.gZ43 3UO230 


r 




WOca j 


100 ! 


375328 


2116.J09.gz43 3Uo2oo 


17 


IVIUvUDj J"^D.lw\/~ 


WOca j 


101 


449206 


2116.A13.gz43 3U5321 


17 


Tv>innn^^ 1 ^^P-Fl o 

lVlUUUU^ A vJ J\w/.A 1 \J 


WOca 1 


102 | 


89082 


21 16.bl5.gz43_3Uo35 / 


17 

r 


'MClCMAI SS7RR06 


WOca 1 


103 


730600 


2116.D21.gz43 3Uo452 


r 


iviuuuoj j*for>.A^vf a 


WOca ! 


104 


i 372621 


2116.022.gz43 3Uo4/y 


■p 


1VAU \ J U v)-> 37 o OxV, v^v z» 


WOca j 


1 105 


416886 


2117.M03.gz43 3Uo35/ 


17 




WOca _J 


106 


1 372621 


2117 .DU5 .gz43_3 Uo5 oU 


17 




WOca 


| 107 


! 644919 


1 1 O XTA1 . A f\tCC\A O 

2118.N03.gz43 306^42 


17 

r 




WOca 


108 


| 446397 


olio A . A 1 *2 AT AO C 

2118.A09.gz43 3U/U23 


T7 


iVIU U tf OH ,< +'tO a-/ .vvO 


WOca 1 


109 


f 177443 


1 -i O "\ fAn .-.—vto O. AO A*2 O 

2118.M09.gz43 3U/U3/ 


17 




WOca | 


1 no 


730238 


OllO TJTO yl *2 ^ ATO TO 

2 1 1 8 .Jrl24 . gz4 3 3 U / 2 / 2 


17 

r 




j WOca J 


! Ill 


421901 


OIOI AA1 ~~A 1 OATQft^\ 

2131 .AO 1 .gz43_3 U / o«$D 


17 

r 


M000^97fifiA'A10 


MV-522 [ 


112 


f 48238 


OIOI li>TAO ~™A O OATOIQ 

2131.M02.gz43 3U/yl3 


17 


M000^7RR6A-D04 


MV-522 \ 


113 


1 226324 


2131.1304.gz43 3U/y34 


17 


M000^77R^A*H08 


MV-522 1 




441801 


2131.ri0o.gz43 3U/yoy 


r 


M000^7XOQB'E10 


MV-522 j 


115 


| 62016 


21 3 1 .Kl 1 .gz43_3U5U5D 


17 

r 


"MfOOO^ 7 877"R ■ A02 


MV-522 1 


I 116 


| 48238 


2131.E12.gz43 3UOU05 


T7 

r 


M000^781 1 AGIO 


MV-522 1 


| 117 


| 34071 


2131.113.gz43 3U5U85 


r 


1V^000^7RS7A*R02 


MV-522 1 


118 


j 221686 


2131.B14.gz43 305U^4 


17 


A^000^77S^ A *H04 


MV-522 1 


119 


| 440284 


2131.J16.gz43 3Uol34 


17 


Ayf000^7R6^ A-FIl 1 


MV-522 j 


120 


1 12481 


2131.P18.gz43 308172 


F 


M00032914B.D09 


MV-522 


121 


| 20453 


2131.A19.gz43 308173 


F 


M00032779A-.A04 


MV-522 1 


1 122 


26926 


2131.I19.gz43 308181 


F 


M00032858D:H11 


MV-522 


1 123 


37805. 


2131.P19.gz43 308188 


F 


M00032915B:D01 


MV-522 | 


124 


441874 


2131.I21.gz43 308213 


F 


M00032859C:E04 


MV-522 1 


125 


; 48238 


2131.A23.gz43 308237 


F 


! M00032780A:B09 


MV-522 j 


1 126 


j 169458 


1513.O03.gz43 300292 


F 


M00022648C:D08 


MDA-MB-231 | 
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Table 2 



SEQ 
ID NO 


CLUSTER 


SEQ NAME 


ORIEN 
T 


CLONE ID 


LIBRARY 


127 


145815 


15 13.F09.gz43_300379 


F 


M00022135D:D06 


MDA-MB-231 


128 


158321 


1513.H09.gz43_300381 


F 


M00022255B:F12 


MDA-MB-231 


129 


168195 


1513.L10.gz43_300401 


F 


M00022537D:C05 


MDA-MB-231 


130 


101499 


1513.J13.gz43_300447 


F I 


M00022445D:E12 


MDA-MB-231 


! 131 


142842 


1513.A14.gz43_300454 


F 


M00007960D:E09 


MDA-MB-231 


132 


153316 


1513.E14.gz43_300458 


F 


M00022072A:E12 


MDA-MB-231 


133 


142614 


1 5 1 3 . Al 5 .gz43_300470 


F 


M00007963D:D03 


MDA-MB-231 


134 


99011 


1 5 1 3 .N 1 6.gz43_300499 


F 


M00022622A:G01 


MDA-MB-231 


135 


171073 


1513.P18.gz43_300533 


F 


M00022710C:H03 


MDA-MB-231 


136 


120049 


1513.D19.gz43_300537 


F 


M00021925A:H07 


MDA-MB-231 


137 


446572 


1562.B01.gz43_207804 


F 


M00042546A:D03 


UC2-NormColon 


138 


464091 


1562.F01.gz43_207808 


F 


M00042552DA11 


UC2-NormColon 


139 


408386 


1562.J01.gz43_207812 


F 


M00042560BA01 


UC2-NormColon 


140 


446829 


1562.L02.gz43_207830 


F 


M00042563AF10 


UC2-NormColon 


141 


43338 


1 562.P03 .gz43_207850 


F 


M00042569B.G07 


UC2-NonnColon 


142 


451780 


1562.F04.gz43_207856 


F 


M00042554A:C02 


UC2-NormColon 


143 


456462 


1562.J04.gz43_207860 


F 


M00042560B:G10 


UC2-NormColon 


144 


469511 


1562.G05.gz43_207873 


F 


M00042555D:G10 


UC2-NormColon 


145 


455075 


1562.H05.gz43_207874 


F 


M00042557A:D09 


UC2-NonnColon 


146 


447346 


1562.P05.gz43_207882 


F 


M00042569C-.B05 


UC2-NormColon 


147 


147196 


1562.M06.gz43_207895 


F 


M00042564D:F10 


UC2-NonnColon 


148 


467262 


1562.F08.gz43_207920 


F 


M00042554C:E02 


UC2-NormColon 


149 


459536 


1562.L09.gz43_207942 


F 


M00042563C:D08 


UC2-NormColon 


150 


460190 


1562.P09.gz43_207946 


F 


M00042569D:D02 


UC2-NonnColon 


151 


447597 


1 562.F 10.gz43_207952 


F 


M00042554C:F09 


UC2-NonnColon 


152 


451737 


1562.G10.gz43_207953 


F 


M00042556A.G12 


UC2-NormColon 


153 


446614 


1562.L10.gz43_207958 


F 


M00042563C:E02 


UC2-NormColon 


154 


432159 


1 562.M1 0.gz43_207959 


F 


M00042565AG05 


UC2-NormColon 


155 


459523 


1562.L1 1 .gz43_207974 


F 


M00042563D:D02 


UC2-NormColon 


156 


460516 


1562.Pll.gz43_207978 


F 


M00042570A:E08 


UC2-NonnColon 


157 


446674 


1562.D12.gz43_207982 


F 


M00042550A-.D12 


UC2-NormColon 


158 


465594 


1562.F12.gz43_207984 


F 


M00042554D:C08 


UC2-NormColon 


159 


466719 


1562.G12.gz43 ,207985 


F 


M00042556B:D12 


UC2-NormColon 


160 


463487 


1562.M12.gz43_207991 


F 


M00042565A:H03 


UC2-NormColon 


161 


446389 


1 562 .B 1 3 .gz43_207996 


F 


M00042546D:E06 


UC2-NormColon 


! 162 


462149 


1562.C14.gz43_208013 


F 


M00042548B:G01 


UC2-NonnColon 


163 


447174 


1562.G14.gz43_208017 


F 


M00042556B:E10 


UC2-NormColon 


164 


457405 


1562.K14.gz43_208021 


F 


M00042562CA07 


UC2-NormColon 


165 


460766 


1562.O15.gz43_208041 


F 


M00042568C:E08 


UC2-NormColon 


166 


446703 


1562.N16.gz43J208056 


F 


| M00042567C:E02 


UC2-NormColon 


167 


446981 


1562.D18.gz43 208078 


F 


M00042550C:H10 


UC2-NormColon 


168 


387077 


1562.H18.gz43 208082 


F 


M00042558A:D03 


UC2-NonnColon 
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SEQ 
ID NO 


CLUSTER 


SEQ NAME 


ORIEN 
T 


CLONE ID 


LIBRARY 


169 


460972 


1562.N18.gz43_208088 


F 


M00042567C:E07 


UC2-NormColon 


170 


417078 


1 5 62 .P 1 8 .gz43_208090 


F 


M00042570B:F11 


UC2-NormColon 


171 


461559 


1562.L19.gz43 208102 


F 


M00042564A:F01 


UC2-NormColon 


172 


464905 


1562.F20.gz43 208112 


F 


M00042555B:B07 


UC2-NonnColon 


173 


468330 


1 562 .H20 .gz43_208 1 1 4 


F 


M00042558A:F11 


UC2-NormColon 


174 


446620 


1562. J2 1 .gz43_208 132 


F 


M00042561B:E12 


UC2-NormColon 


175 


457842 


1562.P2 1 .gz43_208 138 


F 


M00042570C:B12 


UC2-NonnColon 


176 


446531 


1562.B22.gz43_208140 


F 


M00042547B:D11 j 


UC2-NormColon 


177 


446621 


1562. J24.gz43_208 180 


F 


M00042561C:E12 


UC2-NormColon 


178 


380127 


1562.N24.gz43 208184 


F 


M00042567D:E12 


UC2-NoimColon 


179 


464275 


1563 .D01 .gz43_208 190 


F 


M00042575D:A11 


UC2-NormColon 


180 


456920 


1563.C04.gz43 208237 


F 


M00042574C:A04 


UC2-NoimColon 


181 


466971 


1563.B05.gz43 208252 


F 


M00042573A:D05 


UC2-NormColon 


182 


142559 


1563J05.gz43 208260 


F 


M00042698A:A01 


UC2-NonnColon 


183 


457890 


1563.G08.gz43 208305 


F 


M00042691D:B03 


UC2-NormColon 


184 


453605 


1563.K08.gz43 208309 


F 


M00042699C:H06 


UC2-NormColon 


185 


465339 


1563 N08 ez43 208312 


F j 


M00042705A:B07 


UC2-NonnColon 


186 

X Ov 


461734 


1563.I09.gz43 208323 


F 


M00042695C:F05 


UC2-NormColon 


187 

X o / 


459961 


1563 D12 az43 208366 


F 


M00042576B:D11 


UC2-NormColon 


188 


46395 1 


1563 F13 cz43 208384 


F 


M00042580C:A03 


UC2-NonnColon 


189 

X \jZ7 


468783 


1563 A14 ez43 208395 


F 


M00042571C:F03 


UC2-NoimColon 


190 

X Z7\J 


447597 


1563 E14 ez43 208399 


F 


M00042578A:E08 


UC2-NonnColon 


191 


463368 


1563 114 ez43 208403 


F 


M00042695D:H01 


UC2-NormColon 


192 


451383 


1563 C15 sz43 208413 


F 


M00042575A:E02 


UC2-NonnColon 


193 


467293 


1563.G16.gz43 208433 


F 


M00042692A:E11 


UC2-NormColon 


194 


446922 


1563.016.gz43 208441 


F 


M00042707C:A09 


UC2-NormColon 


195 


446213 


1563D17ez43 208446 


F 


M00042576D:A08 


UC2-NormColon 


| 196 


460244 


1563.F19.gz43 208480 


F 


M00042691A:D08 


UC2-NormColon 


197 


460789 


1563 A20gz43 208491 


F 


M00042572B:E05 


UC2-NormColon 


198 


451382 


1563 C20 ez43 208493 


F 


M00042575B:F02 


UC2-NoraiColon 


199 


468109 


1563 K20 ez43 208501 


F 


M00042700C:F11 


UC2-NoimColon 


200 


447326 


1563 L20 ez43 208502 


F 


M00042702B:B07 


UC2-NonnColon 


201 


463896 


1563 B21 sz43 208508 


F 


M000425 73D : A 1 0 


UC2-NormColon 


202 


446839 


1563 D21 sz43 208510 


F 


M00042576D:F01 


UC2-NonnColon 


20^ 


TTV7JJ 


1563 E22 sz43 208527 


F 


M00042579AB05 


UC2-NormColon 


204 


447826 


1563.L22.gz43_208534 


F 


M00042702B:G07 


UC2-NormColon 


205 


446409 


1563 .B23 .gz43__208540 


F 


M00042574A:F05 


UC2-NormColon 


206 


453908 


1 573 .NO 1 .gz43_208584 


F 


M00042895C:C10 


UC2-NormColon 


207 


447421 


1573 .D02.gz43_208590 


F 


M00042747D:C08 


UC2-NonnColon 


208 


447141 


1573.P02.gz43_208602 


F 


M00042898A:H05 


UC2-NonnColon 


209 


447645 


1573.E04.gz43_208623 


F 


M00042750D:E07 


UC2-NormColon 


210 


401426 


1573.K06.gz43_208661 


F 


M00042890D:G05 


UC2-NormColon 
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SEQ 
ID NO 


CLUSTER 


SEQ NAME 


ORIEN 
T 


CLONE ID 


LIBRARY 


211 


466894 


1573.B07.gz43_208668 


F 


M00042744A:D11 


UC2-NoimColon 


212 


447161 


1573.J08.gz43_208692 


F 


M00042889D:A01 


UC2-NormColon 


213 


633946 


1573.M08.gz43_208695 


F 


M00042894B:E05 


UC2-NormColon 


214 


557974 


1573.L09.gz43_208710 


F 


M00042892C:E03 


UC2-NormColon 


215 


466920 


1573.D10.gz43_208718 


F 


M00042748D:D08 


UC2-NormColon 


216 


642146 


1573.F10.gz43_208720 


F 


M00042882C:F06 


UC2-NormColon 


217 


464205 


1573.J10.gz43_208724 


F 


M00042889D:A12 


UC2-NormColon 


218 


641890 


1573.L10.gz43_208726 


F 


M00042892D:C04 


UC2-NormColon 


219 


650195 


1573.Hll.gz43_208738 


F 


M00042886C:F01 


UC2-NonnColon 


220 


467293 


1573.F12.gz43_208752 


F 


M00042882C:G07 


UC2-NormColon 


221 


650756 


1573 .N12.gz43_208760 


F 


M00042896AJD04 


UC2-NormColon 


222 


452611 


1 573 .N 13 .gz43_208776 


F 


M00042896A:E03 


UC2-NonnColon 


223 


650944 


1573.N14.gz43_208792 


F 


M00042896A:F09 


UC2-NonnColon 


224 


639372 


1573.E15.gz43_208799 


F 


M00042881C:C11 


UC2-NormColon 


225 


645690 


1 573 .F 15 .gz43_208800 


F 


M00042882D:C04 


UC2-NormColon 


226 


645470 \ 


1573 .HI 6.gz43_2088 1 8 


F 


M00042886D:E10 


UC2-NormColon 


227 


651029 


1573.M17.gz43_208839 


F 


M00042894D:G05 


UC2-NormColon 


228 


639849 


1573.F18.gz43_208848 


F 


M00042883A:F06 


UC2-NormColon 


229 


486238 


1573.K19.gz43_208869 


F 


M00042891C:H01 


UC2-NormColon 


230 


463060 


1573 .G2 1 .gz43_208897 


F 


M00042885A:G09 


UC2-NormColon 


231 


469150 


1573 .C22.gz43_208909 


F 


M00042747A:G12 


UC2-NormColon 


232 


472101 


1574.O01.gz43_208969 


F 


M00054797D:F01 


UC2-NormColon 


233 


641875 


1574.P02.gz43_208986 


F 


M00054799C:G11 


UC2-NormColon 


234 


465104 


1574.A03.gz43_208987 


F 


M00042900B:B10 


UC2-NomiColon 


235 


; 470641 


1574.C03.gz43_208989 


F 


M00042904A-.H10 


UC2-NormColon 


236 


451624 


1574.D04.gz43_209006 


F 


M00042906C:A10 


UC2-NonnColon 


237 


639391 


1574.N04.gz43_209016 


F 


M00054796B:A01 


UC2-NormColon 


238 


470462 


1574.B05.gz43_209020 


F 


M00042902B:H01 


UC2-NonnColan 


239 


482043 


1 574. J05 .gz43_209028 


F 


M00042915A.E06 


UC2-NormColon 


240 


641525 


1574.N05.gz43_209032 


F 


M00054796B:C08 


UC2-NoimColon 


241 


642691 


1574.E06.gz43_209039 


F 


M00042908A:B01 


UC2-NormColon 


242 


466697 


1574.G06.gz43_209041 


F 


M00042910D:A02 


UC2-NormColon 


243 


649965 


1574.H06.gz43_209042 


F 


M00042912A:C01 


UC2-NormColon 


244 


651051 


1574J06.gz43_209044 


F 


M00042915A:G10 


UC2-NormColon 


! 245 


450506 


1574.M06.gz43_209047 


F 


M00054794C:G11 


UC2-NormColon 


246 


453572 


1574.O06.gz43_209049 


F 


M00054798B:A01 


UC2-NonnColon 


247 


447660 


1574.B07.gz43_209052 


F 


M00042902C:E11 


UC2-NormColon 


248 


447147 


1574.C07.gz43_209053 


F 


M00042905A:A07 


UC2-NormColon 


249 


647639 


1574.F07.gz43 _209056 


F 


M00042909B:C04 


UC2-NormColon 


250 


649965 


1574.G08.gz43_209073 


F 


M00042910D:E11 


UC2-NormColon 


251 


639371 


1574.109.gz43_209091 


F 


M00042914A:B05 


UC2-NormColon 


■ 252 


474298 


1574.M09.gz43_209095 


F 


M00054794D:D02 


UC2-NonnColon 
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Table 2 



SEQ 
ID NO 


CLUSTER 


SEQ NAME 


ORIEN 
T 


CLONE ID 


LIBRARY 


253 


646568 


1574.M10.gz43 209111 


F 


M00054794D:D08 


UC2-NormColon 


254 


639787 


1574.P10.gz43 209114 


F 


M00054800B:C06 


UC2-NormColon 


255 


447750 


1574 All ez43 209115 


F 


M00042901AF12 


UC2-NormColon 


256 


649810 


1574 Gil ez43 209121 


F 


M00042911A:A02 


UC2-NormColon 


257 


645924 


1574Hllez43 209122 


F 


M00042912B:F11 


UC2-NormColon 


258 


413767 


1574 Lll gz43 209126 


F 


M00054793C:D11 


UC2-NonnColon 


259 


641069 


1574 E12 ez43 209135 


F 


M00042908B:A1 1 


UC2-NormColon 


260 


649900 


1574.G12.gz43 209137 


F 


M00042911A:B02 


UC2-NormColon 


261 


560868 

~_/ 


1574J12.gz43_209140 


F 


M00042915C:E05 


UC2-NormColon 


262 


513262 


1574.P12.gz43 209146 


F 


M00054800B:C11 


UC2-NormColon 


263 


640306 


1574.L13.gz43 209158 


F 


M00054793D:H11 


UC2-NonnColon 


264 


389591 


1574.A14.gz43 209163 


F 


M00042901A:H11 


UC2-NonnColon 


265 


447815 


1574.C14.gz43 209165 


F 


M00042905C:G08 


UC2-NormColon 


266 


95617 


1574.D14.gz43 209166 


F 


M00042907A:B11 


UC2-NonnColon 


267 


465984 


1574.B15.gz43 209180 


F 


M00042903B:C09 


UC2-NormColon 


268 


447692 


1574 D15 ez43 209182 


F 


M00042907A:F03 


UC2-NormColon 


269 


641029 


1574 G15 ez43 209185 


F 


M00042911A:D04 


UC2-NormColon 


270 


650527 


1574 J15 sz43 209188 


F 


M00042915D:A11 


UC2-NormColon 


771 

xV / J. 


641315 


1574 015ez43 209193 


F 


M00054798D:A12 


UC2-NormColon 


777 

J~> i x- 


474298 


1574 P15 ez43 209194 


F 


M00054800B:E08 


UC2-NormColon 


77^ 


645954 


1574 H16 ez43 209202 


F 


M00042912D:H08 


UC2-NormColon 


774 


640181 


1574 M16 ez43 209207 


F 


M00054795A:A08 


UC2-NormColon 


">75 

.... r «y 




1574F17ez43 209216 


F 


M00042909C:F10 


UC2-NormColon 


^76 




1574 H17 ez43 209218 


F 


M00042912D:H10 


UC2-NormColon 


^77 


64Q744 


1574E18ez43 209231 


F 


M00042908C:A03 


UC2-NormColon 


11 R 


650564 


1574F18oz43 209232 


F 

A 


M00042909DB 1 1 


UC2-NormColon 


779 


527355 


1574 G18 ez43 209233 


F 


M00042911AJH12 


UC2-NormColon 


780 


452989 


1574H18cz43 209234 


F 


M00042913A:D09 


UC2-NormColon 


781 


648159 


1574 K18 ez43 209237 


F 


M00054792B:A03 


UC2-NormColon 


287 


714629 


1574 E20 ez43 209263 


F 


M00042908C:D12 


UC2-NormColon 


283 


467364 


1574 120 ez43 209267 


F 


M00042914B:H03 


UC2-NormColon 


284 

X<0*T 


644789 


1574 N20 ez43 209272 


F 


M00054797C:F03 


UC2-NormColon 


285 


646404 


1574J21sz43 209284 


F 


M00054791A:G04 


UC2-NormColon 


286 


639740 


1574 H22 ez43 209298 


F 


M00042913B:E10 


UC2-NormColon 


287 


649852 


1574 122 ez43 209299 


F 


M00042914D:B10 


UC2-NonnColon 


288 


644376 


1574.G23.gz43_209313 


F 


M00042911BJH08 


UC2-NormColon 


289 


639901 


1574.P23.gz43_209322 


F 


M00054800D:D08 


UC2-NormColon 


290 


647086 


1574.E24.gz43_209327 


F 


M00042908D:G12 


UC2-NormColon 


291 


! 562247 


1574.G24.gz43_209329 


F 


M00042911C:D01 


UC2-NormColon 


292 


446974 


1574J24.gz43_209332 


F 


M00054791B:C09 


UC2-NormColon 


293 


513248 


1575.A01.gz43_209339 


F 


M00054800D:F08 


UC2-NormColon 


294 


! 467989 


1575.M01.gz43_209351 


F 


M00055429B:B12 


UC2-NormColon 
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PCT/US01/25840 



Table 2 



SEQ 
BONO 


CLUSTER 


SEQ NAME 


ORIEN 
T 


CLONE ID 


LIBRARY 


295 


650184 


1 575 .NO 1 .gz43_209352 


F 


M00055430D:F04 


UC2-NormColon 


296 


649872 


1575 .E02.gz43_209359 


F 


M00054916A:A05 


UC2-NormColon 


297 


649506 


1575.G02.gz43 209361 


F 


M00054918A:D02 


UC2-NormColon 


298 


446254 


1575.M02.gz43_209367 


F 


M00055429B:E12 


UC2-NormColon 


299 


641542 


1575.O02.gz43 209369 


F 


M00055432B:H02 


UC2-NormColon 


300 


649933 


1575 103 ez43 209379 


F 


M00054920B:C04 


UC2-NormColon 


301 


639444 


1575.M03.gz43 209383 


F 


M00055429B:G04 


UC2-NormColon 


302 


553100 


1575.G04.gz43_209393 


F 


M00054918A:F09 


UC2-NormColon 


303 


549699 


1 575 . AOS .gz43_209403 


F 


M00054911A:C08 


UC2-NormColon 


304 


639256 


1575.E05 .gz43_209407 


F 


M00054916A:E05 


UC2-NormColon 


305 


644556 


1 575 .H05 .gz43_2094 1 0 


F 


M00054919CF06 


UC2-NormColon 


306 


500337 

\y \J 9 


1575.I05.gz43 209411 


F 


M00054920B:C12 


UC2-NormColon 


307 


643924 


1 575 .K05 .gz43_2094 1 3 


F 


M00055427A:F01 


UC2-NonnColon 


308 


639394 


1575.E06.gz43_209423 


F 


M00054916A:F10 


UC2-NormColon 


309 

~> \j j 


452986 


1575.J06.gz43 209428 


F 


M00055426B:B02 


UC2-NormColon 


310 


654723 


1575.K06.gz43 209429 


F 


M00055427A:F02 


UC2-NormColon 


311 


645344 ! 


1575.N07.gz43 209448 


F 


M00055431A:H05 


UC2-NormColon 


312 


645162 


1575.O07.gz43 209449 


F 


M00055432C:F01 


UC2-NormColon 


313 


589098 


1575.B08.gz43_209452 


F 


M00054912B:C05 


UC2-NormColon 


314 


643843 


1575.F08.gz43 209456 


F 


M00054917C:D03 


UC2-NormColon 


315 


647109 


1 575 .P08 .gz43_209466 


F 


M00055434A:A03 


UC2-NormColon 


316 


467381 


1575.D09.gz43 209470 


F 


M00054914D:G07 


UC2-NormColon 


317 


570812 


1 575 .N09.gz43 209480 


F 


M00055431B:A01 


UC2-NormColon 


318 


559776 


1575.A10.gz43 209483 


F 


M00054911A:G01 


UC2-NormColon 


319 


648532 


1575.F10.gz43 209488 


i F 


M00054917C:F03 


UC2-NormColon 


320 


449861 


1575 G10 ez43 209489 


F 


■M00054918B:H01 


UC2-NormColon 


321 


643843 


1575.I10.gz43 209491 


F 


M00054920C:A06 


UC2-NormColon 




41141 


1575K10ez43 209493 


F 


M00055427B:E01 


UC2-NormColon 




648664 


1575 Ell ez43 209503 


F 


M00054916B:E02 


UC2-NormColon 


324 


640814 


1575Kllez43 209509 


F 


M00055427B:F06 


UC2-NormColon 


325 


584071 


1575 Mil ez43 209511 


F 


M00055429D:G07 


UC2-NormColon 


326 


450225 


1575 C12 ez43 209517 


F 


M00054913C:G03 


UC2-NormColon 


377 


452707 


1575 112 ez43 209523 


F 


M00054920C:D05 


UC2-NormColon 




943722 


1575 L12 ez43 209526 


F 


M00055428C:G06 


UC2-NoimColon 


32Q 


641057 


1575 B13 ez43 209532 


F 


M00054912CC01 


UC2-NormColon 


330 


413767 


1575.D13.gz43 209534 


F 


M00054915A:G03 


UC2-NormColon 


331 


504568 


1575.K13.gz43_209541 


F 


M00055427C:A06 


UC2-NoimColon 


332 


639992 


1575.A15.gz43_209563 


F 


M00054911B:E10 


UC2-NormColon 


333 


456923 


1575.G15.gz43_209569 


F 


M00054918D:C03 


UC2-NormColon 


334 


649555 


1575.I15.gz43 209571 


F 


M00054920CF02 


UC2-NormColon 


335 


649746 


1575.J15.gz43 209572 


F 


M00055426C:C10 


UC2-NonnColon 


| 336 


644692 


1575.F16.gz43 209584 


F 


M00054917D:A12 


UC2-NormColon 
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Table 2 



SEQ 
ID NO 


CLUSTER 


SEQ NAME 


ORIEN 
T 


CLONE ID 


LIBRARY 


337 


452204 

~ fc/ 4&> iw \J ~ 


1575 G16 sz43 209585 


F 


M00054918D:C11 


UC2-NormColon 


338 


61616 


1575H16az43 209586 


F 


M00054920A:A05 


UC2-NormColon 


339 


495143 


1575.M16.gz43 209591 


F 


M00055430B:E08 


UC2-NormColon 


340 


644371 


1575 016 sz43 209593 


F 


M00055433A:B07 


UC2-NormColon 


341 


630269 


1575K17sz43 209605 


F 


M00055427C:E12 


UC2-NormColon 


342 


446932 


1575 017 ez43 209609 


F 


M00055433A:B08 


UC2-NormColon 


343 


639662 


1575 F18 ez43 209616 


F 


M00054917D:D12 


UC2-NormColon 


344 

~S • I 


611927 


1575 L18 ez43 209622 


F 


M00055428D:G12 


UC2-NormColon 


345 


641925 


1575 018 gz43 209625 


F 


M00055433A:C02 


UC2-NortnColon 


346 


651020 


1575 P18 ez43 209626 


F 


M00055434C:B11 


UC2-NormColon 


347 


639255 


1575 F19 ez43 209632 


F 


M00054917D:E05 


UC2-NormColon 


348 


468222 


1575 G19 ez43 209633 


F 


M00054918D:H09 


UC2-NormColon 


349 


468783 


1575 K19 ez43 209637 


F 


M00055427C:H1 1 


UC2-NormColon 




417130 


1575 B20 ez43 209644 


F 


M00054912D:G01 


UC2-NormColon 




448606 


1575 B21 ffz43 209660 


F 


M00054912D:G04 


UC2-NoimColon 


359 


456490 


157SK21 &z43 209669 


F 


M00055427D:E05 


UC2-NormColon 


353 


640369 


1575 P21ffz43 209674 


F 


M00055434D:B06 


UC2-NonnColon 


354 


473854 


1575E22ez43 209679 


F 


M00054917A:F07 


UC2-NormColon 


355 


564440 


1575H22e:z43 209682 


F 


M00054920A:C11 


UC2-NormColon 


356 


639590 


1 575 L22 bz43 209686 


F 


M00055429A:H04 


UC2-NonnColon 


357 


649195 


1575 022 ez43 209689 


F 


M00055433A:E04 


UC2-NormColon 


358 


57183 


1575P22ez43 209690 


F 


M00055434D:E09 


UC2-NormColon 


35Q 


648609 


1575 E23 ez43 209695 


F 


M00054917B:A05 


UC2-NormColon 


360 


648S39 


1575F23ez43 209696 


F 


M00054917D:H02 


UC2-NormColon 


361 

J\J x 


649073 


1575 N23 ez43 209704 


F 


M00055432B:B04 


UC2-NormColon 


369 


377855 


1575B24ez43 209708 


F 


M00054913A:B12 


UC2-NormColon 


363 


459845 


1575G24ez43 209713 


F 


M00054919A:H04 


UC2-NorxnColon 


364 


638758 


1575 024 ez43 209721 


F 


M00055433CA11 


UC2-NormColon 


365 


63Q89Q 


1576A04ez43 209771 


F 


M00055435B:C09 


UC2-NoimColon 


366 


484145 


1 576 T05 ffz43 209795 


F 


M00055446B:A12 


UC2-NormColon 


! 367 


640055 


1 576 A08 ffz43 209835 


F 


M00055435C:E12 


UC2-NormColon 


368 


63 909 8 

OJ77ZO 


1 576 T08 ^43 209843 


F 


M00055446BD08 


UC2-NormColon 


360 


596606 

JZ.UUV/U 


1 576 T.08 az43 209846 


F 


M00055450B:G07 


UC2-NormColon 


\ 37fi 
j /SJ 


6^3A1 6 
OjjOIO 


1 ^76 F00 crr43 90Q855 


F 


M00055440DD02 


UC2-NormColon 


371 




1 576 M19 <*743 90991 1 


F 


M00055451C-B08 


UC2-NormColon 


372 


560791 


1576.O12.gz43_209913 


F 


M00055453D:E12 


UC2-NormColon 


373 


639287 


| 1576.A13.gz43_209915 


F 


M00055435D:G11 


UC2-NormColon 


374 


475184 


1576.M14.gz43_209943 


F 


M00055451C:E10 


UC2-NonnColon 


375 


626061 


1576.A15.gz43_209947 


F 


M00055436A:E04 


UC2-NormColon 


376 


640734 


1576.E16.gz43 209967 


F 


! M00055441A:G07 


UC2-NormColon 


377 


466092 


1576.B17.gz43_209980 


F 


M00055437D:B06 


UC2-NormColon 


378 


639711 


1576.A19.gz43 210011 


F 


M00055436B:B06 


UC2-NormColon 
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WO 02/14500 



PCT/US01/25840 



Table 2 



SEQ 
ID NO 


CLUSTER 


SEQ NAME 


ORIEN 
T 


CLONE ID 


LIBRARY 


379 


640522 

V*~ \J+S i*t.r 


1576.C21.gz43_210045 


F 


M00055439B:B07 


UC2-NormColon 


380 


651038 


1585.101.gz43_210211 


F 


M00055465A:C05 


UC2-NonnColon 


381 


466440 


1585 .B02.gz43_2 1 0220 


F 


M00055456C:A10 


UC2-NonnColon 


382 


648123 


1585 .C02.gz43__2 1022 1 


F 


M00055458B.B04 


UC2-NormColon 


383 


646318 1 


1585 .K02 ,gz43_2 1 0229 


F 


M00055468A:A05 


UC2-NonnColon 


384 


598589 


1585 .E03 .gz43_2 1 0239 


F 


M00055460B:G06 


UC2-NormColon 


385 


447863 


1585 .F03 .gz43_2 1 0240 


F | 


M00055461AH03 


UC2-NomiColon 


386 


562236 


1585 .P05 .gz43_2 1 0282 


F 


M00055494C:G10 


UC2-NormColon 


387 


559369 


1585.A06.gz43_2 10283 


F 


M00055456AB03 


UC2-NormColon 


388 


484126 


1585.B06.gz43_210284 


F 


M00055456C:G04 


UC2-NonnColon 


389 


558839 


1585 .P06 .gz43_2 1 0298 


F 


M00055494C:G11 


UC2-NormColon 


390 


555820 


1585J07.gz43_210308 


F 


M00055467AA07 


UC2-NormColon 


391 


544461 


1585 .H09.gz43_2 10338 


F 


M00055463D:G01 


UC2-NonnColon 


392 


642478 


1585.A10.gz43_210347 


F 


M00055456AF01 


UC2-NormColon 


^9^ 


643843 


1585.Cll.gz43 210365 


F 


M00055458C-.G02 


UC2-NormColon 


^94 


640400 


1585.G12.gz43_210385 


F 


M00055462CA11 


UC2-NormColon 


39S 


645201 


1585.L12.gz43 210390 


F 


M00055469C-.F09 


UC2-NormColon 


396 


640792 


1 5 85 . F 1 3 .gz43_2 1 0400 


F 


M00055461C:E05 


UC2-NonnColon 


397 


640913 


1585.H13.gz43 210402 


F 


M00055464B.E06 


UC2-NonnColon 


398 


639607 


1585 113 ez43 210403 


F 


M00055465D:F12 


UC2-NormColon 


399 

J77 


639932 

vJ 7> Ji» 


1585.B14.gz43_210412 


F 


M00055456D:G04 


UC2-NormColon 


400 


640662 


1585.D14.gz43 210414 


F 


M00055460AD01 


UC2-NonnColon 


401 

j •TV/ X 


598589 


1585.G14.gz43_210417 


F 


M00055462C:C03 


UC2-NonnColon 


407 


556654 


1585.H14.gz43 210418 


F 


M00055464B-.E11 


UC2-NonnColon 


! 403 


555193 


1585 014 gz43 210425 


F 


M00055493D:B07 


UC2-NormColon 


404 


641066 


1585.E15.gz43 210431 


F 


M00055460D.B06 


UC2-NormColon 


405 


664711 


1 5 85 .HI 5 .gz43_2 1 0434 


F 


M00055464B:G03 


UC2-NormColon 


406 


549611 


1585.115.gz43_210435 


F 


M00055466A-.C05 


UC2-NormColon 


407 


639726 


1585.K16.gz43_210453 


F 


M00055468C:B07 


UC2-NorraColon 


408 


607422 


1585 F17 gz43 210464 


F 


M00055461D:C09 


UC2-NormColon 


409 


584745 


1585 M17 az43 210471 


F 


M00055491AH01 


UC2-NoimColon 


410 


661194 


1585.H19.gz43_210498 


F 


M00055464D.A04 


UC2-NormColon 


41 1 

Til 


559549 


1585 J20 ez43 210516 


F 


M00055467DA01 


UC2-NormColon 


417 


641645 


1585 A22 ez43 210539 


F 


M00055456B:G08 


UC2-NormColon 


413 


641467 


1585 C22 sz43 210541 


F 


M00055459BA02 


UC2-NormColon 


414 


640368 


1585.F22.gz43 210544 


F 


M00055462AA09 


UC2-NormColon 


415 


412416 


1585.G22.gz43 210545 


F 


M00055462D.H12 


UC2-NonnColon 


416 


650914 


1585.H22.gz43 210546 


F 


M00055464D-.F08 


UC2-NormColon 


417 


3 


1585.J22.gz43 210548 


F 


M00055467D-.C10 


UC2-NormColon 


418 


646318 


1585.N22.gz43 210552 


F 


M00055492C:H07 


UC2-NormColon 


419 


237288 


1585.P22.gz43 210554 


F 


M00055495C-.F03 


UC2-NormColon 


420 


650605 


1585.B23.gz43 210556 


F 


M00055457D:F09 


UC2-NormColon 
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TD NO 


PT TTQTFT* 


( 

SEONAME 


DRDEN 
T 


1 

CLONE ID | 


1 

LIBRARY | 


421 1 


643991 | 


1585 .ls23 .gz4 3__2 1 U o d y 


r 


1^0^0*5546^^06 1 

IVlUUuJ Jtw i A. AVU 1 


UC2-NonnColon 


422 1 


398061 | 


1COC Z^OO ^/lO Ol 


r 


A/rn00SS463A*A1 1 1 

IVlwUJJT'wJA.Al X I 


UC2-NormColon 


423 1 


505933 


15o5.J24.gz43 211D5U 


r 


MOnnSS467D-G08 1 


UC2-NormColon | 


424 


640068 1 


1COC T/1 /f ^ /I O Ol 

1585.K24.gz43_zlUDol 


r 




UC2-NormColon 


425 1 


474821 1 


1 5 85 .N24.gz43 _2 1 UD o4 


r 


A/fnonSS4Q2DB08 1 


UC2-NormColon j 


426 


234606 | 


1CO^ - /tO O110./C1 

1587.C01.gz43 zll3ol 


r 


IVJLV/V/V/ J J J x^^V.x v/o 1 


UC2-NormColon | 


427 


597780 | 


1587.E01.gz43 211303 




IVlvlvV/J J 1 v/.i>vO | 


UC2-NormColon | 


428 


492483 | 


15o7.B02.gz43 2113 /o 


p 
r 


MOOOS S ^ 1 7C *H07 1 


UC2-NormColon | 


429 1 


644051 | 


i cot PAO o oil 'JG^ 
15o7.C03.gz43 2113V3 


p 


MOOO^S519AH01 1 

lVJL\/V/V/»/»/~' A 7A.11V J. 1 


UC2-NoraiColon | 


430 I 


639056 1 


15o7.xiU3.gz43 2113V3 




M0005S521CC08 


UC2-NormColon | 


431 1 


643723 1 


1 C O T T f\1 sv.~/1 O Oil AHO 

15o7.L03.gz43 2114U2 


p 


MOOOSS529DD05 1 


UC2-NormColon | 


432 


639459 I 


1 COT \TAO ^-/lO Oil /f A A 

1587.N03.gz43 2114U4 


p 


Mnoo^ss32r*no8 1 


UC2-NormColon | 


433 1 


639826 


15o7.rU3.gz43 2114UO 


p 


M00055535B*A11 i 


UC2-NormColon 


434 


552783 1 


15o7.A04.gz43 2114U/ 


p 
r 




UC2-NonnColon | 


435 ! 


590218 ( 


KOT T\H/I sw^/l^ 011A1H 

15o7.DU4.gz43 21141U 


p 


M00055520BD11 ! 


UC2-NormColon I 


436 


559324 1 


ICO'7 ~cr\A O. 0 1 1 A 1 1 

15o7.liU4.gz43 211411 


r 


MOOO^SS21CD02 1 

lVlUVVJ JJ^< 1\»^.X«'V/^* J 


UC2-NormColon | 


437 


646711 | 


icon T (\ A r^wmA 1 1 1 1 A 1 £ 

15o7.LU4.gz43 211415 


r 


M00055529DD1 1 ! 


UC2-NormColon 


438 


639132 


1 COT TV/fA/l s*.~/t"2 O 1 1 /I 1 O 

15o7.MU4.gz43 21141^ 


p 


M00055531BD10 


UC2-NormColon 


439 


207552 


1 COT "XT A A *~—A 1 11 1 /I1A 

1587.NU4.gz43 2I142U 


p 
r 


MOnOSS^32DA12 


UC2-NormColon | 


440 


116869 


"1 COT /"\ A /4 A O Ol 1 /111 

15o7.O04.gz43 211421 


p 
r 


M0nn^S^35DG02 

iYiWvvJJJJ'JA'.vlv/^ 


UC2-NormColon | 


441 


| 481220 


1 COT T\AC "2 Oil /IO/£ 

1587.DU5.gz43 21142o 


p 
r 


M00nSS^20BF04 


UC2-NonnColoii | 


442 


1 645544 


t O *T T7" r\ C - /IO Oil /HI 

1 1587.KU5.gz43 211433 


P 
r 


A/Tftftfl 5 5? 8D • B02 


1 UC2-NormColon | 


443 


1 650617 


1 i rnn ik M\£ . >• O O 1 1 /IOC 

| 1587.M05.gz43 21143!) 


p 
r 


iVlUvfUJ J J-5 xxj .xli\J~) 


UC2-NormColon 


444 


1 194095 


! i t~\r\JZ /to Ol 1 /lOT 

| 1587.005 .gz43 21143/ 


p 
r 


1V1U V v J J J . ±2t\J \J 


1 UC2-NormColon | 


445 


| 446984 


1 1 ^ot *n A/^ /f 0 Ol 1 A A C\ 

1 1587.B06.gz43 21144U 


p 


jyisjyjyjjjjx. i xj .xjsjz/ 


| UC2-NormColon 1 


446 


! 290226 


1 "1 ^ OT VkAT ^1 1 /I C /C 

1 1587.B07.gz43 211456 


p 
r 


1Y1UUU J J J 1 / xJ .xJ l X 


UC2-NormColon 


447 


1 634409 


1 1 er 0 T t»/\*7 . 4 0 O 1 1 /I OA 

1 1587.P07.gz43 2114/U 


p 


JLY1UUU J JJJJv/ 


I UC2-NormColon | 


448 


| 640419 


PI 1 roT /"LAO A O Oil /I TO 

| 1587.G08.gz43 2114// 


p 
r 


MOOOSSS24BB08 


1 UC2-NormColon | 


449 


1 649149 


1 1 e a *w T A A - A O O 1 1 /I AO 

1 1587.L09.gz43 211495 


p 
r 


IVluUUJJJJwA.Vv # 


1 UC2-NormColon 1 


450 


\ 640072 


1 if 0^ - >i 0 01 1 C 1 Q 

| 1587.P10.gz43_21lDlo 


P 
r 


1VX V/ \J\J -/ J JJ-J \^ . Xwv U 


1 UC2-NormColon | 


451 


| 641615 


1 1 f r\^t Ti "1 T1 1 COO 

| 1587.Jll.gz43 211528 


r 


JV1U U U -> J ^ o/\ . n u o 


TJC2-NormColon 1 


452 


1 639213 


I 1 f OT lf1<) . yl O Oil C/CQ 

| 1587.M13.gz43 211563 


p 

Jr 




1 UC2-NormColon 1 


453 


1 641216 


1 1 r 0/7 XT 1 C iflO Oil C O/C 

1 1587.N15.gz43 211DVO 


p 
r 




1 UC2-NonnColon | 


454 


650161 


I 4 ro^r A 1 /*" . /in OIIC AA 

1 1587.A16.gz43 211599 


P 
r 




| UC2-NormColon 1 


455 


558785 


1 1587.B23.gz43 211/lz 


p 
r 


J.VXV/ Ul/JJJ 17A.VV X 


1 TjC2-NormColon| 


456 


642631 


1 1588.B02.gz43 211780 


F 


M00055538C:E04 


UC2-NormColon 


457 


I 639543 


1588.F02.gz43 211784 


F 


M00055543D:H03 


UC2-NonnColon 


458 


! 473701 


1588.F03.gz43 211800 


F 


M00055544A.A07 


UC2-NormColon 


459 


562274 


1588.G03.gz43 211801 


F 


M00055545BA04 


UC2-NormColon 


460 


j 643277 


| 1588.D05.gz43 211830 


F 


M00055541C:D02 


UC2-NormColon 


461 


556867 


i 1588.G05.gz43 211833 


F 


1 M00055545B:C01 


UC2-NormColon 


462 


] 646350 


1 1588.P05.gz43_211842 


F 


M00055556A:A04 


| UC2-NormColon | 
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SEQ 
ID NO 


CLUSTER 


SEQ NAME 


ORIEN 
T 


CLONE ID 


LIBRARY 


463 


649335 


1588.B06.gz43_211844 


F 


M00055538D.D12 


UC2-NormColon 


464 


646159 


1588.J06.gz43_211852 


F ! 


M00055548B-.C03 


UC2-NormColon 


465 


649099 


1588.P06.gz43_211858 


F 


M00055556A:E07 


UC2-NormColon 


466 


32812 


1588.H09.gz43_211898 


F 


M00055546C-.F11 


UC2-NormColon 


467 


639480 


1588. J09.gz43_2 1 1900 


F | 


M00055548B:G06 


UC2-NormColon 


468 


644242 


1588.N09.gz43_21 1904 


F 


M00055553B:H04 


UC2-NormColon 


469 


644723 


1588.N1 1 .gz43_21 1936 


F 


M00055553C:D06 


UC2-NormColon 


470 


648748 


1588JF13.gz43_211960 


F 


M00055544C:A01 


UC2-NormColon 


471 


639099 


1588.G13.gz43_211961 


F 


M00055545C:D09 


UC2-NormColon 


472 


562414 


1 5 8 8 .F 15 .gz43_2 1 1 992 


F 


M00055544C:B07 


UC2-NormColon 


473 


641078 


1588.N15.gz43_212000 


F 


M00055553CH12 


UC2-NormColon 


474 


505042 


1588.G16.gz43_212009 


F 


M00055545D:B06 


UC2-NormColon 


475 


640747 


1588.N16.gz43_212016 


F 


M00055553D:C07 


UC2-NormColon 


476 


453606 


1588.D17.gz43_212022 


F 


M00055542B:B11 


UC2-NonnColon 


477 


419706 


1588.A18.gz43_212035 


F 


M00055538A:C05 


UC2-NormColon 


478 


557983 


1 5 88 .H2 1 .gz43_2 1 2090 


F 


M00055547B:C05 


UC2-NormColon 


479 


639114 


1588.D22.gz43_212102 


F 


M00055542C:D07 


UC2-NormColon 


480 


442531 


1588J22.gz43_212108 


F 


M00055549A:B04 


UC2-NormColon 


481 


649668 


1597:N02.gz43_212176 


F 


M00055574D.E02 


UC2-NormColon 


482 


639781 


1597.D05.gz43_212214 


F 


M00055561C-.C07 


UC2-NormColon 


483 


639056 


1597.E05.gz43_212215 


F 


M00055562D:B06 


UC2-NonnColon 


484 


639698 


1597.P05.gz43_212226 


F 


M00055577C:G02 


UC2-NormColon 


485 


46973 1 


1597.B06.gz43_212228 


F 


M00055559A:D06 


UC2-NormColon 


486 


639593 


1597.G06.gz43_212233 


F 


M00055566A:E02 


UC2-NormColon 


487 


557975 


1 597. J06.gz43_2 12236 


F 


M00055569D:C02 


UC2-NormColon 


488 


640356 


1597,E08.gz43_212263 


F 


M00055563AA02 


UC2-NonnColon 


489 


647431 


1597.E09.gz43_212279 


F 


M00055563A:C03 


UC2-NormColon 


490 


31112 


1597.C10.gz43_212293 


F 


M00055560D:C03 


UC2-NormColon 


491 


454825 


1597.F10.gz43_212296 


i F 


M00055565A:C08 


UC2-NormColon 


492 


555021 


1597.B12.gz43_212324 


F 


M00055559C:D07 


UC2-NormColon 


493 


650740 


1597.I12.gz43 212331 


F 


M00055569A:D01 


UC2-NormColon 


494 


659031 


1597J12.gz43_212332 


F 


M00055570B:F07 


UC2-NormColon 


495 


592122 


1597.D14.gz43 212358 


F 


M00055562B:D03 


UC2-NormColon 


496 


45921 


1597N14ez43 212368 


F 


M00055575C:B04 


UC2-NormColon 


497 


556511 


1597 H15 ez43 212378 


F 


M00055568A:B04 


UC2-NormColon 


! 498 


637966 


1597.A16.gz43_212387 


F 


M00055558B.G11 


UC2-NormColon 


499 


553318 


1597.F16.gz43 212392 


F 


M00055565A-.F11 


UC2-NormColon 


500 


23961 


1597.C17.gz43_212405 


F 


M00055561A:F07 


UC2-NormColon 


501 


650235 


1597.M17.gz43_212415 


F 


M00055574B:F06 


UC2-NormColon 


502 


596882 


1597.F18.gz43_212424 


F 


M00055565B:F04 


UC2-NonnColon 


503 


644928 


1597.118.gz43_212427 


F 


M00055569A:F06 


UC2-NormColon 


504 


589483 


1597.L18.gz43_212430 


F 


M00055573B:D11 


UC2-NormColon 
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Table 2 



SEQ 
ID NO 


CLUSTER 


SEQ NAME 


ORIEN 
T 


CLONE ID 


LIBRARY 


505 


640956 


1597.N18.gz43_212432 


F 


M00055575D:G04 


UC2-NomiColon 


506 


553602 


1597.L19.gz43_2 12446 


F 


M00055573B-.E01 


UC2-NormColon 


507 


640851 


1597.O20.gz43_212465 


F 


M00055577A:G09 


UC2-NormColon 


508 


460666 


1597.A21 .gz43_212467 


F 


M00055558D-.C08 


UC2-NormColon 


509 


640195 


1597.D22.gz43_212486 


F 


M00055562C:F02 


UC2-NormColon 


510 


668852 


1597.P22.gz43_212498 


F 


M00055578C:F11 


UC2-NormColon 


511 


645913 


1597.A23.gz43_212499 


F 


M00055558D:D07 


UC2-NormColon 


512 


446230 


1597.G23.gz43_212505 


F 


M00055567A:A06 


UC2-NormColon 


513 


649349 


1597.N23.gz43_212S12 


F 


M00055576AF07 


UC2-NormColon 


514 


645538 


1597.C24.gz43_212517 


F 


M00055561B:G12 


UC2-NormColon 


515 


647069 


1597 .G24.gz43_2 1-2521 


F 


M00055567A.A11 


UC2-NormColon 


516 


638869 


1597.H24.gz43_212522 


F 


M00055568C:F07 


UC2-NormColon 


517 


556 


1597.124.gz43_212523 


F 


M00055569B-.G10 


UC2-NormColon 


518 


648483 


1 597 .024.gz43_2 12529 


F 


M00055577B:F05 


UC2-NormColon 


519 


447272 


1 5 98 .MO 1 .gz43_2 12543 


F 


M00055595C:F12 


UC2-NormColon 


520 


640025 


1598.B02.gz43_212548 


F 


M00055581AC02 


UC2-NormColon 


t 521 


415538 


1598.E02.gz43_2 1255 1 


F 


M00055585B:F01 


UC2-NormColon 


i 522 


478229 


1598.M02.gz43_212559 


F 


M00055595C:G03 


UC2-NormColon 


523 


644572 


1598 H03 .gz43J2 12570 


F 


M00055589B:E08 


UC2-NormColoii 


f 524 


648580 


1598.B04.gz43 212580 


F 


M00055581CA01 


UC2-NormColon 


525 


470769 


1598.L04.gz43 212590 


F 


M00055594C:B03 


UC2-NonnColon 


526 


554273 


1 598 .M04 .gz43J2 1259 1 


F 


M00055595D:C04 


UC2-NoimColon 


521 


641262 


1598.C05.gz43_212597 


F 


M00055583AA05 


UC2-NormColon 


528 


645707 


1598.F05.gz43_2 12600 


F 


M00055586D:G07 


UC2-NonnColon 


529 


463028 


1598.N05.gz43_212608 


F 


M00055597B:B04 


UC2-NormColon 


530 


640695 


1598.P05 .gz43_2 126 1 0 


F 


M00055601B:D12 


UC2-NonnColon 


531 


641925 


1598.D06.gz43 212614 


F 


M00055584AG11 


UC2-NormColon 


532 


449247 


1598.E06.gz43_2 126 15 


F 


M00055585C-.F05 


UC2-NormColon 


533 


646590 


1598.L06.gz43_212622 


F 


M00055594C-.F11 


UC2-NormColon 


534 


641191 


1598. A07.gz43_2 12627 


F 


M00055579C:D04 


UC2-NormColon 


535 


641191 


1598P07gz43 212642 


F 


M00055601B:H02 


UC2-NonnColon 


536 


641838 


1598K09gz43 212669 


F 


M00055593AF08 


UC2-NormColon 


537 


641440 


1598N09ez43 212672 


F 


M00055597D:B05 


UC2-NonnColoii 


JJO 


640826 


1598M10gz43 212687 


F 


M00055596A:E07 


UC2-NormColon 


JJ7 


459521 


1 1598Allez43 212691 


F 


M00055579D:C11 


UC2-NormColon 


540 


! 140909 


1598.Gll.gz43 212697 


F 


M00055588B:H11 


UC2-NormColon 


541 


641467 


1598.J12.gz43 212716 


F 


M00055592B-.C10 


UC2-NormColon 


542 


455864 


1598.L12.gz43 212718 


F 


M00055594D-.F07 


UC2-NonnColon 


543 


641957 


1598.A13.gz43 212723 


F 


M00055579D:G09 


UC2-NormColon 


i 544 


638941 


1598.K14.gz43 212749 


F 


M00055593C:D08 


UC2-NoimColon 


545 


648159 


1598.B15.gz43 212756 


F 


M00055582BA06 


UC2-NonnColon 


546 


509973 


1598.G15.gz43 212761 


F 


M00055588C:G09 


UC2-NormColon 
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SEQ 
ID NO 


CLUSTER 


SEQ NAME 


ORIEN 
T 


CLONE ID 


LIBRARY 


547 


641425 


1598.A17.gz43_212787 


F 


M00055580B:B08 


UC2-NormColon 


548 


647312 


1598.O17.gz43_212801 


F 


M00055600C.C02 


UC2-NonnColon 


549 


86311 


1598.C18.gz43_212805 


F 


M00055583C:A01 


UC2-NormColon 


550 


515931 


1598.019.gz43_212833 


F 


M00055600D:B02 


UC2-NontiColon 


551 


651000 


1598.D21.gz43_212854 


F 


M00055585A:E12 


UC2-NormColon 


552 


553087 


1598.E21.gz43_212855 


F 


M00055586C:A06 


UC2-NotmColon 


553 


646914 


1598 .G2 1 .gz43_2 12857 


F 


M00055589A:B06 


UC2-NormColon 


554 


446371 


1598.K21.gz43_212861 


F 


M00055594B:A01 


UC2-NormColon 


555 


640298 


1598.N23.gz43_212896 


F 


M00055599D:C08 


UC2-NormColon 


556 


201904 


1 599 .AO 1 .gz43_2 12923 


F 


M00055602B:B10 


UC2-NonnColon 


557 


641338 


1599.B01.gz43_2 12924 


F 


M00055603D:A09 


UC2-NonnColon 


558 


640221 


1 599 .LO 1 .gz43_2 12934 


F 


M00055619C:F07 


UC2-NormColon 


559 


454527 


1599.F02.gz43_212944 


F 


M00055610B:E04 


UC2-NormColon 


560 


640868 


1599.108.gz43_213043 


F 


M00055615C:E01 


UC2-NonnColon 


561 


641069 


1599.N09.gz43_213064 


F 


M00055623D:G05 


UC2-NormColon 


562 


559380 | 


1599.B10.gz43_213068 


F 


M00055604D:F05 


UC2-NormColon 


563 


641680 


1599.D10.gz43_213070 


F 


M00055608C:E03 


UC2-NormColon 


564 


639703 


1599.11 l.gz43_213091 


F 


M00055615D:C07 


UC2-NoimColon 


565 


641542 


1599.C12.gz43_213101 


F 


M00055606D:C05 


UC2-NonnColon 


566 


5201 


1599 .A15.gz43_213 147 


F 


M00055602D:G08 


UC2-NormColon 


567 


607715 


1599.D15.gz43_213150 


F 


M00055608C:G11 


UC2-NormColon 


568 


640416 


1599.L17.gz43_213190 


F 


M00055620D:D05 


UC2-NormColon 


569 


554833 


1599.B19.gz43_213212 


F 


M00055606A:B11 


UC2-NormColon 


570 


639480 


1599.K19.gz43„213221 


F 


M00055619BJI04 


UC2-NormColon 


571 


607138 


1599.P21.gz43_213258 


F 


M00055628B:B07 


UC2-NormColon 


572 


561626 


1599.L23.gz43_213286 


F 


M00055621B:G03 


UC2-NormColon 


573 


548959 


1600.101.gz43_213315 


F 


M00055639D:D03 


UC2-NormColon 


574 


639934 


1600.H02.gz43_213330 


F 


M00055638D:D07 


UC2-NormColon 


575 


| 640956 


1600.103 .gz43_2 13347 


F 


M00055639D:F08 


UC2-NormColon 


576 


I 554722 


1600.L03.gz43_213350 


F 


M00055644A:D12 


UC2-NormColon 


577 


649717 


1600.D04.gz43_213358 


F 


M00055633D:A02 


UC2-NorniColon 


578 


646695 


1600.H04.gz43 213362 


F 


1 M00055638D:E09 


UC2-NonnColon 


579 


639886 


1600.N04.gz43_213368 


F 


M00055647C:D02 


UC2-NormColon 


580 


451615 


1600.K05.gz43_213381 


F 


M00055643A:C01 


UC2-NormColon 


581 


635965 

vy +*s +s \j w' 


1600.107.gz43_21341 1 


F 


M00055640A:G03 


UC2-NormColon 


582 


379040 


i 1600.L07.gz43_213414 


F 


M00055644B:H12 


UC2-NormColon 


583 


640799 


1600.C10.gz43_213453 


1 F 


M00055632D:A06 


UC2-NoraiColon 


584 


642246 


1600.Kll.gz43_213477 


F 


M00055643B:E05 


UC2-NormColon 


585 


467563 


1600.Nll.gz43_213480 


F 


M00055647D:B11 


UC2-NormColon 


586 


640320 


1600.01 l.gz43_213481 


F 


M00055649A:H07 


UC2-NonnColon 


587 


557401 


1600.112.gz43_213491 


F 


M00055640C:E06 


UC2-NormColon 


588 


646352 


1600.M13.gz43_213511 


F 


M00055646C:B04 


UC2-NormColon 
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X it 

SEQ 

|jdno ( 


CLUSTER 


< 

SEQ NAME 


I 

DRIEN 

T ]_ 


CLONE ID 


1 

LIBRARY 1 


1 J07 


^d^no4. 1 


1600B15.gz43 213532 


F 1 


M00055630B-.G04 


UC2-NormColon 


f ^on 


1 


1600P17ez43 213578 


F 


MO0055651A:E06 


UC2-NormColon 


! ^gi 

| JV1 


^£9000 1 


1600 119 ez43 213603 


F 


M00055641A:C12 


UC2-NormColon 


1 ^oo 




1600N19ffz43 213608 


F 


M00055648B:C01 


UC2-NormColon 


1 ^cyj 1 
1 1 




1600 M20 ez43 213623 


F 


M00055647B:A05 


UC2-NormColon1 


594 


458736 


1600 P20 ez43 213626 


F 


M00055651B-.F08 


UC2-NormColon 1 


595 


553285 


1600O21ez43 213641 


F j 


M00055650AB05 


UC2-NormColon 


596 I 


455075 


1 600 A22 sz43 213643 


F 


M00055629B:G09 


UC2-NormColon 


597 


166041 


1600D23ez43 213662 


F 


M00055634C:F09 


UC2-NormColon 


598 


553516 


1600T23oz43 213668 


F 


M00055642D:A05 


UC2-NonnColon 


599 


559423 


1600B24ez43 213676 • 


F 


M00055632A:B11 


UC2-NormColon 1 


600 131348 


1600 C24 ez43 213677 


F 


M00055633B:G02 


UC2-NormColon 


601 650180 


1600E24gz43 213679 


F 


M00055636A-.H12 


UC2-NormColon 


j 602 


642361 


1600 F24 ez43 213680 


F | 


M00055637B:A01 


UC2-NormCoiaa| 


603 


561069 


1600.K24.gz43 213685 


F 


M00055643D:G11 


UC2-NormColon 


604 


550515 


1600.P24.gz43 213690 


F i 


M00055651C:E01 


UC2-NoimColon 


[ 605 


559854 


1693.D01.gz43 213694 


F ! 


M00055819A:B10 


UC2-NormColon 


606 


650579 


1693J01.gz43_213700 


F 


M00055827A:A12 


UC2-NonnColon 


607 


648567 


1693.L02.gz43 213718 


F 


M00055829C:A07 


UC2-NonnColon 


I 608 


645155 


1693.H03.gz43 213730 


F 


M00055825B:C11 


UC2-NonnColon 


609 


650204 1693.H04.gz43 213746 


F 


| M00055825B:E03 


UC2-NormColon 


610 


643054 


i 1693.H06.gz43 213778 


F 


M00055825B:F09 


UC2-NormColon 


611 


640464 


1693.I06.gz43 213779 


F 


M00055826A.G04 


UC2-NormColon 


612 


| 363172 


1693.A07.gz43 213787 


F 


M00055816D:A10 


UC2-NormColon 


613 


674526 


S 1693.G08.gz43 213809 


F 


M00055823B-.F02 


UC2-NormColon 1 


614 


1 481864 


1693.N08.gz43 213816 


F 


M00055832A:A08 


UC2-NormColon 


615 


| 492242 


1693.A09.gz43 213819 


F 


M00055816D:B11 


UC2-NormColon 


616 


| 638837 


f 1693.C10.gz43 213837 


F 


M00055818BH03 


UC2-NormColon 


617 


517280 


1693.K10.gz43 213845 


F 


M00055828B-.E10 


UC2-NonnColon 


I 618 


452863 


1693.A13.gz43 213883 


F 


M00055816D.E10 


UC2-NormColon 


I 619 


i 642962 


I 1693.H13.gz43 213890 


F 


M00055825DA03 


UC2-NormColon 


620 


645746 1693.I13.gz43 213891 


F 


M00055826B:G07 


UC2-NormColon 


621 


f 645146 1693.L15.gz43 213926 


F 


M00055829DJI10 


UC2-NonnColon 


622 


644781 


1693.M15.gz43 213927 


F 


M00055831A:C06 


UC2-NormColon 


1 623 


643397 


1693.A17.gz43 213947 


F 


M00O55817A:E05 


UC2-NormColon 


624 


645073 


1693.L17.gz43 213958 


F 


M00055830A:G10 


UC2-NormColon 


625 


i 648580 


1693.A19.gz43 213979 


F 


M00055817A:H07 


UC2-NonnColon 1 


i 626 


650487 


\ 1693.I19.gz43 213987 


F 


! M00055826C:G06 


UC2-NormColon 


627 


455716 


1693.J19.gz43 213988 


F 


M00055827DA01 


UC2-NormColon 


628 


551681 


1693.H20.gz43 214002 


F 


M00055825D:D11 


UC2-NormColon 


629 


642054 


1693.P21.gz43 214026 


F 


I M00055835C:F08 


UC2-NormColon 


1 630 


1 556286 


1 1693.J22.gz43 214036 


F 


j M00055827D:C02 


UC2-NormColon | 
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Table 2 



SEQ 
ID NO 


CLUSTER 


SEQ NAME 


OPJEN 
T 


CLONE ED 


LIBRARY ! 


631 


449542 


1693.A23.gz43_214043 


F i 


M00055817B:C04 


UC2-NormColon 


632 


517237 


1693.M23.gz43_214055 


f I 


M00055831B:C04 


UC2-NormColon 


633 ! 


647191 


1694J01.gz43_214084 


F ! 


M00055846B:F11 


UC2-NormColon 


634 


644212 


1694.G04.gz43_214129 


F | 


M00055843B:D10 


UC2-NormColon 


635 


549624 


1694.P04.gz43_214138 


F ! 


M00055854C:C07 


UC2-NormColon 


636 


645149 


1694.A05.gz43_214139 


F 


M00055836A:B12 


UC2-NormColon 


637 


981 


1 694.F05 .gz43_2 14 144 


F 


M00055842B:A04 


UC2-NoimColon 


638 


647448 


1694.105 .gz43_214147 


F 


M00055845CA.il 


UC2-NormColon 


639 


570939 


1694.P05.gz43_214154 


F 


M00055854C:E03 


UC2-NormColon 


640 


547841 


1 694 .L06.gz43_2 14 166 


F 


M00055849CG07 


UC2-NormColon 


641 


466265 


1694.M06.gz43_214167 


F 


M00055851A:C03 


UC2-NonnColon 


642 


649842 


1 694.M07.gz43_2 14 1 83 


F 


M00055851A:C09 


UC2-NormColon 


643 


468222 


1694.108.gz43_214195 


F 


M00055845C:C12 


UC2-NormColon 


644 


450949 


1694.L08.gz43_214198 


F 


M00055849D:B04 


UC2-NormColon 


l 645 


603388 


1694.O08.gz43_214201 


F 


M00055853C:C12 


UC2-NormColon 


646 


485237 


1 694.E09.gz43_2 1 4207 


F 


M00055841B:F09 


UC2-NormColon 


647 


644063 


1694.F09.gz43_2 14208 


F 


M00055842B:D04 


UC2-NormColon 


648 


639341 


1694.N09.gz43_2 142 16 


F 


M00055852A:C12 


UC2-NormColon 


649 


471364 


1694 JP09.gz43 214218 


F 


M00055854C.H11 


UC2-NormColon 


650 


452735 


1694.A10.gz43_214219 


F 


M00055836C-.D01 


UC2-NormColon 


651 


503546 


1694 .E10.gz43_2 14223 


F 


M00055841B:H03 


UC2-NormColon 


I 652 


646420 


1694.110.gz43_2 14227 


F 


M00055845C.E02 


UC2-NormColon 


653 


647577 


1694.O10.gz43_214233 


i F 


M00055853C:H03 


UC2-NormColon 


654 


504944 


1694.Bll.gz43_214236 


1 F 


M00055837D:G10 


UC2-NormColon 


655 


522869 


1694.Dll.gz43_214238 


F 


M00055840B:B02 


UC2-NormColon 


656 


644548 


1694.F1 l.gz43_214240 


F 


M00055842C:A11 


UC2-NormColon 


657 


644314 


1694.H1 l.gz43_214242 


F 


M00055844D:E12 


UC2-NormColon 


658 


650492 


1694.01 l.gz43_214249 


F 


M00055853D:A07 


UC2-NoimColon 


659 


643800 


1694.B12.gz43_214252 


F 


M00055838A:A03 


UC2-NonnColon 


660 


643843 


1694.E12.gz43_214255 


F 


M00055841C.A03 


UC2-NormColon 


661 


448450 


1694.B13.gz43_214268 


F 


M00055838A:B02 


UC2-NormColon 


662 


643804 


1694.C13.gz43 214269 


F 


M00055839B:A10 


UC2-NormColon 


663 


469511 


1694.L13.gz43_214278 


F 


M00055849D:H09 


UC2-NormColon 


664 


643130 


1694.013.gz43 214281 


F 


M00055853D:B04 


UC2-NomiColon 


665 


640171 


1694.J14.gz43 214292 


F 


M00055846D:G11 


UC2-NormColon 


666 


647522 


1694.M14.gz43_214295 


F 


M00055851A:H10 


UC2-NormColon 


667 


642293 


1694.N14.gz43_2 14296 


F 


M00055852B-.G09 


UC2-NormColon 


668 


404816 


1694.P14.gz43_214298 


F 


M00055855A:B11 


UC2-NormColon 


669 


447597 


1694.A15.gz43_214299 


F 


M00055837A:B08 


UC2-NormColon 


670 


467901 


1694.B15.gz43. 214300 


F 


M00055838B:D06 


UC2-NormColon 


671 


648039 


1694.C15.gz43 214301 


F 


M00055839B:C07 


UC2-NonnColoii 


| 672 


607430 


1694.E15.gz43 214303 


F 


M00055841C:D05 


UC2-NormColon 
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SEQ 
DNO < 


XUSTER 


K 

SEQ NAME 


"YDTT7XT 

JKllilN 

T 


1 


T TRRARY 


673 


645505 


1694.F15.gz43_214304 


F 


M00055842D:C02 


Uv>2-JNorniv>oion | 


674 


472119 


1694.G15.gz43 214305 


F 


M00055843D:H01 


UC2-iMormL/Olon I 


675 


645197 


1694.K15.gz43 214309 


F 


M00055848C:A02 


UUz-Nontt^oion | 


676 


650217 


1694.015.gz43 214313 


F 


M00055853D:C07 


Ul^2-JN0TCn\^oioii | 


677 


644210 


1694.A16.gz43_214315 


F 


M00055837A:DO9 


UUz-lNonii^oiun | 


678 


466697 


1694.H16.gz43 214322 


F 


M00055844D:H09 


Uv^z-iSorniv^oion | 


679 


650276 


1694.I16.gz43 214323 


F 


M00055845D:G11 


UC2-JNorini^oion | 


680 


562229 


1694.M16.gz43 214327 


F 


M0005585 1B:B09 


Ui^2-lNonTii^oion | 


681 


462659 


1694.P16.gz43 214330 


F 


*» r r\ /"v m & C\ t* A A^% f\F 1 

M00055855A:G05 


UCZ-JNornii^oion i 


682 


644494 


1694.B17.gz43 214332 


F 


M00055838B:H04 1 


TTp'l "XT nrm nn 1 

uUZ-jNontiL/Oion. i 


683 


24730 


1694.D17.gz43 214334 


F 


M00055840C:D06 


Ui^z-iNonn^oion | 


684 


599759 


1694.E17.gz43 214335 


F 


M00055841C:D11 j 




685 


15001 


1694.F17.gz43 214336 


F 


M00055842D:D07 j 


UCZ-lNoniiv^oion i 


686 


558981 


1694.K17.gz43 214341 


F 


M00055848C:G07 f 


IJCZ-JNonricoion i 


687 


592122 


1694.0 17.gz43 214345 


F 


M00055854A:B07 1 


U^Z-rNOiiuv^oion | 


688 


489249 


1694.A18.sz43 214347 


F 


M00055837A:F02 | 


UCZ-iNonncoion | 


689 


447002 


1694.B18.gz43 214348 


F 


M00055838C:A08 1 


VJC2-JNorm\^oioii j 


690 


558890 


1694.C18.gz43 214349 


F 


M00055839B:E07 


UC2-N ormcoion | 


691 644548 


1694.D18.gz43 214350 


F 


M00055840C.H06 


UC2-Norm\Jolon | 


692 


643825 


1694.G18.gz43 214353 


F 


M00055844A:D03 


UC2-JNoniit^oioii l 


693 


645289 


1694.A19.gz43 214363 


F 


, M00055837A:H08 


UC2-Nonnv/Oloii I 


694 


643948 


1694.B19.gz43 214364 


F 


M00055838C.B08 


U C2-JN ormcoion 


1 695 


537586 


1694.C19.gz43 214365 


F 


M00055839B:H09 


| VJC2-JNormCoion | 


696 


462557 


1694.F19.gz43 214368 


F 


M00055842D:F07 


1 UC2-Nonn\^oloti ] 


| 697 


647577 


1694.G19.gz43 214369 


F 


1 M00055844A:D07 


| UC2-JNormi^oioii i 


698 


643999 


1694.H19.gz43 214370 


F 


| M00055845A:C03 


\ UC2-Normv>oioii | 


699 


651131 


1694.J19.gz43 214372 


F 


1 M00055847B:G12 


1 uc2-*iSonti^oioii i 


700 


425923 


1694.M19.gz43 214375 


F 


1 M00055851B:G10 


| UL/2-rsorm^oion j 


701 


639510 


1694.019.gz43 214377 


F 


| M00055854A:D01 


| uC2-rsormcoion i 


702 


643977 


1694.D20.gz43 214382 


F 


1 M00055840D:B03 


| 0^2-^011111^01011 i 


703 


140648 


1694.E20.gz43 214383 


F 


| M00055841C:H04 


| ucz-iNoniH^oioii 


704 


649068 


1694.G20.gz43 214385 


F 


M00055844A:F11 


| UC2-rsorm\^oion i 


705 


644708 


1694.N20.gz43 214392 


F 


M00055852D:B11 


1 UL/2-JNontiL/Oion 


i 706 


643909 


1694.P20.gz43 214394 


F 


M00055855B:B11 


1 uC2-WormL/Oion | 


707 


650528 


1694.E21.gz43 214399 


F 


M00055841D:C11 


| UC2-Norm^olon | 


708 


644569 


1694.H21.gz43 214402 


F 


1 MUUU55o43A.rillJ 


1 TTP9-MnrmPn1otl I 


709 


646459 


1694.J21.gz43 214404 


F 


M00055847C.A11 


UC2-NormColon 


710 


608873 


1694.L21 .gz43_2 14406 


F 


M00055850C:D01 


UC2-NonnColon 


711 


649611 


1694.N21.gz43 214408 


F 


M00055852D-.G12 


UC2-NonnColon 


I 712 


465576 


1694.P21.gz43 214410 


F 


M00055855B:D12 


UC2-NormColon 1 


1 713 


467521 


1694.C22.gz43 214413 


F 


[ M00055839C.B11 


UC2-NormColoa 


714 


646810 


1694.J22.gz43 214420 


F 


1 M00055847C:C01 


j UC2-NormColon | 
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Table 2 



SEQ 
ID NO 


CLUSTER 


SEQ NAME 


ORIEN 
T 


CLONE ID 


LIBRARY 


715 


647318 


1694.L22.gz43_214422 


F 


M00055850C:G05 


UC2-NormColon 


716 


638854 


1694.022.gz43_2 14425 


F 1 


M00055854AE04 


UC2-NormColon 


717 


644987 


1694.A23.gz43j214427 


F 


M00055837B:E07 


UC2-NormColon 


718 


645948 


1694.C23.gz43_214429 


F 


M00055839C:D06 


UC2-NormColon 


719 


419751 


1694.123 .gz43_2 14435 


F 


M00055846B:B12 


UC2-NormColon 


720 


643594 


1694.M23.gz43__214439 


F 


M00055851C:F12 


UC2-NormColon 


721 


603388 


1694.P23.gz43 214442 


F 


M00055855C:F11 


UC2-NoimColon 


722 


644075 


1694.G24.gz43 214449 


F 


M00055844B:C12 


UC2-NormColon 


723 


542282 


1694.124.gz43_2 1445 1 


F | 


M00055846B:C12 


UC2-NormColon 


724 


452976 


1694. J24.gz43_2 14452 


F 


M00055847C:H09 


UC2-NormColon 


725 


472801 


1694.M24.gz43_2 14455 


F 


MO0055851C:HO5 


UC2-NonnColon 


726 


646633 


1695.P03.gz43 214910 


F 


M00055875B:E09 


UC2-NormColon 


727 


509027 


1695.A04.gz43_21491 1 


F 


M00055855D:D12 


UC2-NormColon 


728 


596882 


1695.D05.gz43_214930 


F 


M00055860D:E04 


UC2-NormColon 


729 


644442 


1695 A07.gz43_2 14959 


F 


M00055855D:G08 


UC2-NormColon 


730 


644047 


1695.F07.gz43_214964 


F 


M00055863D:D09 


UC2-NormColon 


731 


397399 I 


1695 .H07.gz43_2 14966 


F 


M00055866AG10 


UC2-NormColon 


732 


564854 


1695 A08.gz43_2 14975 


F 


M00055856AC06 


UC2-NormColon 


733 


646372 


1695 JB08.gz43_2 14976 


F 


M00055857C:D09 


UC2-NormColon 


734 


645848 


1695.D08.gz43_214978 


F 


M00055861B:F04 


UC2-NormColon 


735 


447035 


1695 .M08.gz43_2 14987 


F 


M00055872D:D12 


UC2-NormColon 


l 736 


644149 


1695 A09.gz43_2 14991 


F 


M00055856AD12 


UC2-NormColon 


737 


640147 


1695.E09.gz43 214995 


F 


M00055862D:B02 


UC2-NonnColon 


738 


524261 


1695.K09.gz43_215001 


F 


M00055871AH06 


UC2-NormColon 


739 


449500 


1695.Ell.gz43 215027 


F 


M00055862DD06 


UC2-NormColon 


740 


648819 


1695.Kll.gz43_2 15033 


F 


M00055871B:BO3 


UC2-NormColon 


741 


! 644755 


1695.Lll.gz43_215034 


F 


M00055872A:C08 


UC2-NormColon 


742 


516512 


1695 .Pll.gz43_2 15038 


F 


M00055875C.C07 


UC2-NormColon 


743 


380550 


1695.L12.gz43_215050 


F 


M00055872A:D08 


UC2-NormColon 


744 


645538 


1695.D13.gz43_215058 


F 


M00055861C:GO3 


UC2-NormColon 


745 


644047 


1695.F13.gz43__215060 


F 


M00055864AC09 


UC2-NormColon 


746 


650773 


1695.F14.gz43__2 15076 


F 


M00055864AE11 


UC2-NormColon 


i 747 


449936 


1695.J14.gz43 215080 


F 


M00055869C:G06 


UC2-NormColon 


748 


554646 


1695.O14.gz43_215085 


F 


M00055875AGO5 


UC2-NormColon 


749 


640534 


1695.L15.gz43 215098 


F 


M00055872AE11 


UC2-NormColon 


750 


642411 


1695.J16.gz43 215112 


F 


M00055869DA07 


UC2-NormColon 


751 


643451. 


1695.M16.gz43_215115 


F 


M00055873A.E03 


UC2-NormColon 


752 


644435 


1695.H17.gz43_215126 


F 


M00055866C:G09 


UC2-NoraiColon 


753 


561877 


1695.F18.gz43_215140 


F 


M00055864B:C09 


UC2-NormColon 


754 


645497 


1695.N18.gz43 215148 


F 


M00055874A:F06 


UC2-NormColon 


755 


644030 


1695.E20.gz43 215171 


F 


M00055863B:C07 


UC2-NonnColon 


756 


140224 


1695.D21.gz43_215186 


F 


M00055862A:C01 


UC2-NoraiColon 
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Table 2 



SEQ 
ID NO 


CLUSTER 


SEQ NAME 


ORIEN 
T 


CLONE ID 


LIBRARY 


757 


643099 


1 695 K2 1 .gz43_2 1 5 1 93 


F 


M00055871C:C07 


UC2-NormColon 


758 


643333 


1695.P22.gz43_215214 


F 


M00055875D:D01 


UC2-NormColon 


759 


649660 


1 695 .G23 .gz43_2 1 5 22 1 


F 


M00055865C:G11 


UC2-NormColon 


760 


645288 


1 695 .H23 .gz43_2 1 5 222 


F 


M00055867A:B02 


UC2-NoraiColon 


761 


650517 


1695.G24.gz43 215237 


F 


M00055865CH06 


UC2-NormColon 


762 


552201 


1 696 JO 1 .gz43_2 1 5256 


F 


M00055885C:B07 


UC2-NonnColon 


763 


643513 


1696.P02.gz43 215278 


F 


M00055891B:F09 


UC2-NormColon 


764 


447807 


1696.D04.gz43 215298 


F 


M00055880B:A06 


UC2-NormColon 


765 


639256 j 


1696.M04.gz43 215307 


F 


M00055888C:F07 


UC2-NormColon 


766 


179760 


1696.G05.gz43_215317 


F 


M00055883A:C02 


UC2-NormColon 


767 


459274 


1696.G06.gz43 215333 


F 


M00055883A:C10 


UC2-NormColon 


768 


345761 


1696.106.gz43_2 15335 


F 


M00055884D:F07 


UC2-NormColon 


769 


640356 


1696.M06.gz43_215339 


F 


M00055888C:G09 


UC2-NoxmColon 


770 


645638 


1696.A07.gz43_215343 


F 


M00055877A:H04 


UC2-NonnColon 


771 


464171 


1696.Nll.gz43 215420 


F 


M00055890A:A06 


UC2-NormColon 


772 


446225 


1696.E13.gz43 215443 


F 


M00055881A:G07 


UC2-NormColon 


j 773 


450559 


1696.F14.gz43 215460 


F 


M00055882C:A06 


UC2-NormColon 


774 


643248 


1696L14ez43 215466 


F 


M00055887DC11 


UC2-NormColon 

^sAw X T VA lAIV/ VX VIA 


775 


549114 


1696.N14.gz43 215468 


F 


M00055890A:D01 


UC2-NormColon 


776 


645215 


1696.D15.gz43 215474 


F 


M00055880C:F07 


UC2-NormColon 


777 


555172 


1696.I15.gz43 215479 


F 


M00055885B:A11 


UC2-NormColon 


778 


593715 


1696.B16.gz43 215488 


F 


M00055878C:C02 


UC2-NormColon 


779 


586992 


1696.I16.gz43 215495 


F 


M00055885B:B04 


UC2-NormColon \ 


780 


451670 


1696.H17.gz43 215510 


F 


M00055884B:F10 


UC2-NormColon 


781 


552628 


1696.H18.gz43 215526 


F 


M00055884B:H07 


UC2-NormColon 


782 


643233 


1696.A21.gz43 215567 


F 


M00055877D:C05 


UC2-NonnColo'? 


783 


640400 


1696F21ez43 215572 


F 


M0005 5 8 82D B02 


UC2-NormCol< rti 


784 


644468 


1696G21ez43 215573 


F 


M00055883DB06 


UC2-NormColon 


785 


643046 


1696H21ez43 215574 


F 


M00055884CB07 


UC2-NormColon 


786 


649068 


1696N21ez43 215580 


F 


M00055890AG12 


UC2-NonnColon 


787 


648063 


1696.E22.gz43 215587 


F 


M00055881D:A02 


UC2-NormColon 


788 


570248 


1696M23sz43 215611 


F 


M00055889BE12 


UC2-NormColon 


789 


643724 


! 1696N23az43 215612 


F 


M00055890AH11 


UC2-NormColon 


790 


649027 


1696.P23.gz43 215614 


F 


M000558.92A:F04 


UC2-NormColon 


791 


643366 


1696P24cz43 215630 


F 


M00055892BD02 


UC2-NormColon 


792 


526575 


1705 .E01.gz43__2 15635 


F 


M00055900D:D03 


UC2-NormColon 


793 


449737 


1705.O02.gz43_215661 


F 


M00055914A:A03 


UC2-NormColon 


794 


394373 


1705.B03.gz43_215664 


F 


M00055894B:E09 


UC2-NormColon 


795 


557361 


1 1705.C03.gz43_215665 


F 


M00055896D:G10 


UC2-NoraiColon 


796 


644364 


1705.K04.gz43_215689 


F 


M00055908D:F09 


UC2-NormColon 


797 


644242 


1705.M05.gz43_215707 


F 


M00055911B-.E06 


UC2-NormColon 


! 798 


644407 


! 1705.N05.gz43_215708 


F 


M00055912D:F04 


UC2-NormColon 
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Table 2 



SEQ 
ID NO 


CLUSTER 


SEQ NAME 


ORIEN 
T 


CLONE ID 


LIBRARY 


7Q0 

/ ^7^7 


643513 


1705 F06 sz43 215716 


F 


M00055902A:H11 


UC2-NormColon 


ROO 


639194 


1705H06ez43 215718 

X / yJ -J .X XV/ VJ » jj>» ^» X # XI* 


F 


M00055905B.H01 


UC2-NormColon 


i R01 


454790 


1705 106 ffz43 215719 


F 


M00055906D:G03 


UC2-NonnColon 


1 R07 

Ol/Z 


OtJ J VI v 


1705 O09ffz43 215773 


F 


M00055914CD12 


UC2-NormCoIon 


R03 


647522 


1705E10ffz43 215779 


F 


M00055901B:A02 


UC2-NormColon 


! ROA 


64RS76 


1705 Fill oz43 215792 

X / V/~7 . XJ X X . gZjT ^ ■£» X~J I S 


F 

X 


M00055896AG01 


UC2-NormColon 


ROS 


4RS737 

'rOJZJ / 


1705 Cll oz43 215793 


X 


M00055898BE07 


UC2-NormColon 




AAA 1 OR 


170S 01 1 0743 71SR05 


F 

X 


M00055914CG01 


UC2-NormColon 

V^/Z-# 1 l vl lii W Vl v/xx 


RH7 


6A3Q14 

OHOi? 1*+ 


1 70S 01 7 oz43 215813 


F 

X 


M0005 5 904B BO 1 


UC2-NortnColon 


ror 


641645 


1705 012 oz43 215821 


F 


M00055914CG11 


UC2-NonnColon 


ROQ 


60SSQ6 ! 


1705 HI 3 o:z43 215830 


F 


M00055905DB06 


UC2-NormColon 


ri n 

O 1U 


S641 34. 


1705 Ol 3 o743 215837 


F 


M00055914DA08 


UC2-NormColon 


RI 1 

, Oil 


643 04R 1 


1705 R14 0-743 215840 


F 


M00055896B:C06 


UC2-NormColon 


R17 
O 1Z 


SS7Q47 


170SF14az43 215844 


F 


M00055902DA07 


UC2-NonnColoxi 


Rl 3 


1 S001 


170SO1Soz43 715861 


F 

X 


M00055904CA07 


UC2-NormColon 


CIA 


6AQ 1 70 


170SP1SO743 715R70 

1 / U-/ .x 1 J .gZtJ Z. JLJO/w 


F 

X 


M00055916BC02 


UC2-NormColon 


RI s 
513 


6AA3Q0 


170SD16o743 715R74 


F 

X 


M00055900AF08 


UC2-NormColon 

V_/Z> 1 1 villi W/lv/ii 


olO 


AA^ORQ 


170SK16o743 715RR1 

1 /UJ.XVIU.^Z'tO zuoox 


F 

X 


M00055909BG10 


UC2-NormColon 


01 / 


SS3SR7 


170SO16o743 715R85 

1 /UJ.U10.ftZ*tJ Z1JOOJ 


F 

X 


M00055914DG06 


UC2-NormColon 


SIR 
0 1 0 


SS07A7 


1 70S W1 7 o743 7 1 5R94 

1 /UJ.lll / .gZT"J Z1J07T 


F 

X 


M00055905DH02 

± YX V/ \y \J~S ^ \J ~f X^ »X X\J 


UC2-NormColon 


RI Q 
0 iy 


63Q41 3 


170SO17o743 215901 


F 

X 


M00055915AA06 


UC2-NonnColon 


R70 


64SR44 


170SP1Ro743 715905 


F 


M00055899AB03 


UC2-NormColon 


oZl 


6A10R7 


1 70S F 1 R o743 7 1 5907 


F 

X 


M00055901CC10 


UC2-NormColon 


R77 

5ZZ 


AR7 1 63 


170SF1Ro743 715908 


F 

X 


M00055902DH10 


UC2-NormColon 


R73 


643R0R 


170S CtIR o743 215909 


F 


M00055904D:A06 


UC2-NonnColon 


oZ*f 


A6R 1 47 

*rOO 1*t / 


170S T 19 o743 215930 


F 

X 


M00055910CG01 


UC2-NortnColon 


R7S 


AA677R 


170S T170 o^43 215938 


F 

X 


M00055900BE12 


UC2-NormColon 


oZO 


A47376 

*+*t / OZO 


1 70S 1^70 o743 7 1 5945 


F 

X 


M00055909CE08 


UC2-NoimColon 


oZ / 


6RS001 

OOJvU 1 


170S T 70 o743 215946 


F 


M00055910CG04 


UC2-NormColon 


0Z0 


6AS7R1 

OtJ / O 1 


170S 1C71 0743 715961 

1 /UJ.X\ZX.gZ t tJ J&XJ7UX 


F 

X 


M00055909C*G09 


UC2-NormColon 


ozy 


AA3R7S 


1 70S T 7 1 07-43 7 1 SQ67 

1 / UJ.iwZX.gZ^rJ ^U^Vx£r 


F 

X 


M000559 1 OD" A03 

IVlVVV*' *J X \J X** .A. X\J ~J 


UC2-NonnColon 


8^A 


37A3A0 


170S A 77 0743 715967 


F 

X 


M00055894AH08 


UC2-NormColon 


8^5 1 


£An7S6 


170S "M77 cr-7'43 71SQ79 

1 / U J .1V1ZZ .gZ'r J Z X J 7 / 7 


F 

X 


M00055912CC08 


UC2-NormColon 


8^0 


AHH7A 1 


17HS R73 ot43 71S0R4 


F 

X 


M00055896CH10 

i.VX V/ v/ \J *J *J KJ *S \J \»X . A XXV/ 


UC2-NonnColon 






1 70S T77A CT743 7 1 6004 


F 

X 


M00055903BC10 


UC2-NormColon 


834 


644354 


1705.G24.gz43_2160O5 


F 


M00055905A:F11 


UC2-NormColon 


835 


89082 


1705.K24.gz43_216009 


F 


M00055909D:E01 


UC2-NortnColon 


| 836 


643991 


1705.O24.gz43 216013 


F 


M00055915C:B09 


UC2-NormColon 


837 


459881 


1706.B01.gz43 216028 


F 


M00055918B:B10 


UC2-NoraiColon 


838 


644554 


1706.C01.gz43_216029 


F 


M00055919D:H07 


UC2-NormColon 


839 


557175 


1706.B03.gz43_216060 


F 


M00055918B:F10 


UC2-NormColon 


840 


641484 


1706.F03.gz43 216064 


F 


M00055924C:A09 


UC2-NormColon 
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Table 2 



SEQ 
ID NO 


CLUSTER 


SEQ NAME 


0RIEN 
T 


CLONE ID 


LIBRARY 

1*11 IXvfilV -L 


841 

OT X 


651 131 

UJ IXJl 


1706 TtfiS cyr43 916009 

1 /UO.DV/J .gZtJ Z IOU7Z 


A* 


tv/tooo^ ^ o i ft r 1 • r* 04 


1TP0 

u uz-jn oimi^oion 


842 


644690 


1706 T0S <rz43 916100 


V 
X* 


A>TO0O^^O3O A -fi04 
IVIUU l/JJ 7J U/V. \ Ju*fr 


u i^z-jn ormL>o ion 


843 


410000 


1 706 F06 0^4 3 916111 ; 

A /UU.AjUU.gZ*tJ ZIOAil 


r 


lvyfooo^^ooon-n-i 0 
iviuuu j j yzzu.vjr 1 u 


ucz-JNorrni^oioii 


844 


4467Q5 


1 706 A/f 06 o-t-43 9161 1 Q 
1 /UO.JYJUO.gZ'rJ ZlOliy 


Jr 


^^000^^034/^-^06 
XVIU UUj j y j*f .JdUO 


u L/Z-jn ormuoioii 


R4S 

OtJ 


638808 


1 706 T 07 otA'X 916134 
1 /UO.A^U / .gZ'f J_Z A 0 1 J*t 


JT 


AyfOOO>^Q33 A «T7A*; 

jviu uu j j y j j j\ . JC/U j 


ITfO "XTrxT-»^-» /-*1 y-»-M 


£46 1 


6465 Q6 


1 706 "M07 c*t43 916136 
1 /UO.iNU / ZlOl JO 


T7 

r 


A/f OOO^ ^Q3 61^ -TIO^ 
JLVaUUU JJ7J OVy . J3 U j 


U \^Z-1N OH1IV-/O1OII 


£47 


630001 


1 706 P07 fyz43 9 1 61 38 
1 /UO.JrU / .gZ 4 * j_Z 1 0 1 jo 


A* 


A/f000^^030Tl- All 
IVIUU Uj J7J7I) .AX 1 


u cz-iN ornu^oion 


£48 


648370 


1 706 "WOK a-743 9 1 6 1 S9 




A/TO0O5 5036P-PI03 
iVXU U U J J ^ J O V- . JL^ U j 


TT^O \Tnrm r>A rw\ 

U l^Z-iN 01im_/0 JOI1 


840 


650038 


1 706 TOO <r^43 9 1 6 1 64 

A /uOJU^.gZtJ ZAOIO 4 * 


17 


A/T0005 5 030^-1^03 
IVIUUU J J y j U . JLIU j 


T T^O MArmpAlAti 

u i^z-jn onncoion 


850 


643 1 70 

UtJ A / V/ 


1706F10cr743 91617S 


p 


A/T0005 5 093I*-r , O7 


U \ssL -IN OIlTl\-/0 JOlA 




643930 


1 706 141 0 a743 916178 


P 


A^00O55097TI-F1 1 
IVaUUUj Ji*Z / U.CL X 


u v^z -in onriv^o ion 


859 

O JZ, 


630506 


1706 T10 o^A'X 916180 
A /uu.J lU.gZTJ Z lOloU 


17 


A/f000^Q30r*-F06 
1V1U UU J J zf j Ul^ . JT U O 


1T#^0 \TAtW»PAlA« 


853 

OJJ 


605 114 

OUJ 


1 706 T 1 0 0^43 9 1 6 1 R9 




AA000^5Q33'R-'R1 1 
IYIUUUj Jf jjJj.JO 1 1 


IT^O "M^t-rM/^^1/-*-r» 

u v^z-in orniv^oion 


854 


456090 


1706"M11 cyz43 916900 
A /UO.IN A 1 -gZ 4 * J_Z JOZUU 


17 

A* 


A^O0O55Q36n-PO3 
IYIUUUj j 7j OJj .r U j 


TTPO T^m-mC'ryArxry 


85S 


6470^ O 
0*t /yjz 


1706 P1 19 rr-r43 016004 
A /UO.UlZ.gZ*fj_Z10ZU*l' 


T7 

r 


A^OOO^^QI O A • AO6 

jvluuu j j y 1 va. auo 


UL/Z-JNOIlTlvOlOll 


! 856 


6^0470 
OjU** /U 


1 706 A/11 9 0^43 01601^ 
1 /UO.lvllZ.gZf J__Z1DZ1 J 


p 
r 


A/TOOO^ ^IQ34ri'UOO 

iviuuu j j y j'f u . xiuy 


ucz-jn oimuoion 


5157 


46*1^76 


1 706 FIT 3 niA.'X 0 1 6999 
1 /UO.J-J1 J.gZ*t-j_Z10ZZZ 


17 

A 4 


AVfOAO^QOO A .T7n<; 
IVIUUU J j 7ZZ A. r Uj 


U uz-jn ormuolon 


OJO 


4^41 3 


rt-r/l^ 016034 
1 /UO.lr 1 J.gZ^j_Zi0Z:>4 


r 


Ayf000^^030T^'"TI07 

JVIUUUj jyjyJJ.JLJU / 


U L/Z-1N ormColon 






nn^iiH iytA^ oi60^a 

1 /UO.IT.1 J.gZHO__ZlOZjo 


17 


A>TOOO^^OOft A */^06 
JVLUUU J j yzo A. UO 


ITfO XT^t-*v»^^1^*% 

U uz-JN ormColon 


SAO 


JJ07ZJ 


1706 T1 ^ rr-741 01606O 
1 / UO . J 1 J .gZ^f J__Z 1 0Z0U 


P 


Ayrooo ^ ^ 03 0 .un ^ 
JVLUUU j j y jUI^.JtIU j 


u Uz-jn ormcoion 


£61 


6443 Ol 
0*FK>U1 


1 706 A 1 6 crrrAI O 1 6067 


p 

Jr 


AvfOOO^ < 0 1 71VPA7 
JVLUUUjjyi /D.tLV)/ 


ucz-JNormcoion 


869 
oOZ 


447^90 


1706 T16 cttAI 916076 
1 /UO.J lO.gZ4j_Z10Z/O 


p 
r 


AVTOOO^ ^ Q3 f\T\ • "DO < 

jviuuu j j y j uu .dUj 


TTr'o >lt r*. -i 

ui^z-jn oimuolon 


' 863 
ODj 


447496 


1 706 1 7 <v*43 916956 
1 /UO.L/1 /.gZ*fj__Z10Z50 


p 
r 


A/rono j ^<iooo'Q* a 06 
jviuuu j j yzzu . AUO 


TTr*7 \Ta«mPa1a« 


864 


61 oao3 

OJLUoyj 


1 706 Ml 7 rr-743 0 1 60OO 

i /uo.rii /.gz<+j_z iozyu 


p 
r 


AyfOOO^^IOOft A *P1 1 
JVIUUUj jyZoA.lil 1 


u cz- jn ontiL/Oion 


86S 
ouj 


613696 


1706 T17 016000 
1 / UO . J 1 / .gZ*f j Z A OZyZ 


p 

Jr 


AAOOO*\ ^030TVHnO 

iviuu UJJ7J kjxj .13 uy 


u i^z-jn ormv>oioti 


©66 


64^009 
D4JU7Z 


1 706 >J1 7 o-t-43 016906 

i /uo.jn I /.gz*to_ziozyo 


p 


AyfOOO^ ^037 A 'PA7 
JVIUUU JJ7J / A.JrU / 


TTr*0 ATAmipAlAri 

UL/Z-JNormcoioii 


867 
oO / 


64Q873 


1 706 P 1 Q rr-rAI 9 1 6330 
A /UO.A^iy.gZ^f j_Z10jjU 


P 

Jr 


Ayr000^^040*R« A Oft 
IVIUUU J J y*fUJ3 .AUo 


ITPO \TArm /^rt 

u i^z-JN orm^oion 


86R 
0O0 


64^098 


1706 AOO rr^4^ 016^^1 
A /UO j^VZU.gZH- j^ZIOj j 1 


p 


AyTOOO^OI 8 A«P1 O 
JVIUUUj J7 1 5A.r 1U 


UCZ-JNOiniL/OlOIl 


860 


6301 54 


1 706 POO otAI 9 1 6336 
A /UO.rZU.gZH-j ZIOjjO 


P 
r 


a^ooo^ ^ 00 • noo 
iviuuu j j yz jj3 . uuy 


u v_,z-iN onnv^ojoii 


870 

O / U 


640085 


1706 FI91 ot43 9163^0 
A / UO. l^Z a. gZHj_Z AO j jU 


P 


A^OOO^ ^ Q99T* -r^OQ 
IVIUU Uj j y ZZaj) . vjv y 


U l^Z-l>l OnTlv^OJOAA 


871 


643339 

UtJJJZ. 


1706 F91 crr43 9163^9 

A / UO.FZ 1 -gZt J ZlUJJZ 


p 


A100055095R*T11 0 

IVIUUU J J "ZjJj .JL/ A U 


u i^z-jn orniv^oioii 


877 


643383 


1 706 W9 1 or743 9 1 63^4 
1 /UO.riZl.gZH-J^jZ JLOjj** 


P 

r 


A^"OOO^^Q9»'R-P1 1 
IVIUUUj jyZoJD.JvA 1 


U V^Z-IN OJJAll^OlOIl 


873 

O / J 


904 


1706 T91 a-743 9163S6 
1 / UO.JZ A .gZ*f j_Z IOjjO 


p 
r 


Ayf0O0^^030n-PO^ 
XVaUUU j J y j U J-/ . J7 Uj 


u v^z-rs onriv^oion 


874 


64171 6 


1 706 TSJ9 1 <yy43 9 1 6360 
1 /U0«iNZl.gZ < TJ_Z10j0U 


p 
r 


A/TOOO^ 5 037^^051 
IVJLUUUJjyj /L^XUo 


u v^z-in oniiv^oioii 


87 S 

O / J 


643697 


1 706 F99 crrz43 9 1 6367 
A /UO.XlfZZ.gZI'J^ZlOjO / 


p 


A^00O^^094*R*TlO9 
IVIUU Uj j yz*f 13 . JjUZ 


u L/Z-jn orrni_/0 Jon 


876 


640672 


1706 M23 ffz43 216391 


P 

A 


M0005 5 93 5 C E03 


Ur^-'MormColoii 

VJ V>i< J. > V/X X 11V_/ WXl-ZAX 


877 


550237 


1707.J02.gz43_216453 


F 


M00055956D:B01 


UC2-NomiColon 


| 878 


646317 


1707.B04.gz43_216477 


F 


M00055942B:H10 


UC2-NormColon 


879 


643968 


1707.K05.gz43_216502 


F 


M00055958C:E03 


UC2-NormColon 


880 


649549 


1707.F06.gz43_216513 


F 


1 M00055949D:B07 


UC2-NormColon 


881 


513619 


1707.O07.gz43_216538 


F 


M00055967A:F11 


UC2-NormColon 


882 


460245 


1707J08.gz43_216548 


F 


M00055955B:C06 


UC2-NormColon 
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ID NO 


CLUSTER 


SEQ NAME 


ORIEN 
T 


CLONE ID 


LIBRARY 


883 


646061 


1707.O08.gz43_216554 


F 


M00055967A:G01 


UC2-NormColon 


884 


645273 


1707.O09.gz43_216570 


F 


M00055967B:B12 


UC2-NormColon 


885 


524546 


1707.A12.gz43_216604 


F 


M00055941B:A04 


UC2-NormColon 


886 


648782 


1707.B12.gz43__216605 


F 


M00055942D:A01 


UC2-NormColon 


887 


645900 


1707.0 1 3 .gz43_2 16634 


F 


M00055967B:F07 


UC2-NormColon 


888 


648555 


1707.D14.gz43_216639 


F 


M00055945C:D09 


UC2-NormColon 


889 


452316 


1707J14.gz43_216645 


F 


M00055957B:F11 


UC2-NormColon 


890 


461363 


1707.D17.gz43_216687 


F 


M00055945D:D01 


UC2-NormColon 


891 


644720 


1707.D18.gz43_216703 


F 


M00055945D:E08 


UC2-NormColon 


892 


647312 


1707.L18.gz43_216711 


F 


M00055961D:F06 


UC2-NormColon 


893 


646190 


1707.N18.gz43_216713 


F 


M00055966B:H09 


UC2-NormColon 


894 


531529 


1707.H19.gz43J216723 


F 


M00055954C:G07 


UC2-NormColon 


895 


393599 


1707.E20.gz43_216736 


F ! 


M00055947C:E09 


UC2-NonnColon 


896 


645508 


1707.N20.gz43_216745 


F 


M00055966C:D03 


UC2-NormColon 


897 


383609 


1707.O23.gz43_216794 


F 


M00055967D:C05 


UC2-NormColon 


898 


447805 


1707.B24.gz43_216797 


F 


M00055943B:E01 


UC2-NormColon 


899 


694643 


1 707 .P24.gz43_2 16811 


F 


M00055969A:F02 


UC2-NormColon 


900 


647924 ; 


1708.10 Lgz43_216820 


F 


M00055980B:B07 


UC2-NormColon 


901 


645920 


1708.P01.gz43_216827 


F 


M00055990A:F07 


UC2-NormColon 


902 


641484 


1708.M02.gz43_216840 


F 


M00055985D.E09 


UC2-NormColon 


903 


640672 


1708.A04.gz43_2 16860 


F 


M00055969B:B05 


UC2-NormColon 


904 


644226 


1708K04.gz43 216870 


F 


M00055982C:H01 


UC2-NormColon 


905 


557852 


1708.C06.gz43_216894 


F 


M00055972C:C09 


UC2-NormColon 


906 


558052 


1708.C07.gz43_2 1 69 10 


F 


M00055972C:F02 


UC2-NormColon 


907 


644354 


1708.I08.gz43 216932 


F 


M00055980C:G12 


UC2-NormColon 


908 


645746 


1708.L08.gz43_216935 


F 


M00055984D:E04 


UC2-NormColon 


909 


647539 


1708.M09.gz43J216952 


F 


M00055986A:F05 


UC2-NormColon 


910 


645288 


i 1708.L10.gz43_216967 


F 


M00055985A:B06 


UC2-NormColon 


911 


645289 


1708.All.gz43_216972 


F 


M00055969D:B08 


UC2-NormColon 


912 


642058 


1708 .G 1 1 .gz43_2 1 6978 


F 


M00055978A:H03 


UC2-NormColon 


913 


642955 


1708.A12.gz43_216988 


F 


M00055969D:F08 


UC2-NortnColon 


914 


640092 


1708.H12.gz43_2 16995 


F 


M00055979B:G07 


UC2-NormColon 


915 


558452 


1708.P 12.gz43_2 17003 


F 


M00055990D:B02 


UC2-NormColon 


916 


645427 


1708.L13.gz43 217015 


F 


M00055985B:C02 


UC2-NormColon 


917 


649082 


1708.B14.gz43_217021 


F 


M00055971B:A11 


UC2-NormColon 


I 918 


451709 


1708.K14.gz43 217030 


F 


M00055983C:C09 


UC2-NormColon 


919 


640204 


1708.H15.gz43_217043 


F 


M00055979C:B07 


UC2-NormColon 


920 


640179 


1708.N15.gz43_217049 


F 


M00055988A:A12 


UC2-NormColon 


921 


582549 


1708.P15.gz43 217051 


F 


M00055990D:G09 


UC2-NormColon 


922 


484355 


1708.G16.gz43 217058 


F 


M00055978B:F01 


UC2-NormColon 


923 


646013 


1708.J17.gz43_217077 


F 


M00055982A:G06 


UC2-NormColon 


924 


| 649309 


1708.P18.gz43_217099 


F 


M00055991A:D10 


UC2-NormColon 
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ID NO 


CLUSTER 


SEQ NAME 


ORIEN 
T 


CLONE ID 


LIBRARY 


925 


647333 


1708.F19.gz43_217105 


F 


M00055977A:G09 


UC2-NormColon 


926 


647058 


1708.A21.gz43 217132 


F 


M00055970CA05 


UC2-NormColon 

x ^ vyx xxiv^v^xvsxx 


927 


643723 i 


1708.F21.gz43 217137 


F 


M00055977B:F06 


UC2-NormColon 


928 


639359 


1708.N21.gz43 217145 


F 


M00055988AE05 


UC2-NormColon 


929 


647539 i 


1708H23ez43 217171 


F 


M00055980AH06 

x t xv v v/ fc/ *s v vx a*a jjlx/ x/ 


UC2-NormColon 


930 


647211 


1708C24sz43 217182 


F 


M00055973CF10 


UC2-NormColon 

W V^Xm X 1 V/X 111 WvXVU 


931 


650018 


1708 J24 ez43 217189 


F 


M00055982CA12 

A FA W X/ *J **J XJ X^ »Ajh, X ^* 


UC2-NormColon 

v*' x ^ vyx xxx^rv/xvyxx 


932 


645848 


1708.K24.gz43 217190 


F 


M00055984A:F05 


UC2-NormColon 


933 


646212 


1708.P24.gz43_2 17195 


F 


M00055991C:H11 


UC2-NormColon 


934 


406436 


1588.N01.gz43 217262 


F 


M00055553A.D08 


UC2-NormColon 


935 


553651 


1588.E02.gz43 217269 


F 


M00055542C:H05 


UC2-NormColon 


936 


466092 


1588.H02.gz43 217272 


F 


M00055546B:H03 


UC2-NormColon 


937 


645891 


1588.N02.gz43 217278 


F 


M00055553A:F11 


UC2-NormColon 


938 


649390 


1588A03ez43 217281 


F 


M00055536B:H11 


UC2-NormColon 


939 


448358 

1 ■ \J +J s 


1588H04az43 217304 


F 


M00055546C:B01 


UC2-NormColon 


940 


650982 


1588.I04.gz43 217305 


F 


M00055547C:B07 


UC2-NormColon 


941 


644012 


1588.O05.gz43 217327 


F 


M00055554C:B04 


UC2-NormColon 


942 


649099 


1588P06ez43 217344 

X X/X/ • A V/ V/ ig^T*/ X* X # I F 


F 


M00055556AE07 


UC2-NormColon 

Wxv X ^ V/X All x>wAwXX 


943 


551662 


1588.M07.gz43 217357 


F 


M00055551C:C08 


UC2-NormColon 


944 


639316 


1588.K08.gz43_2 17371 


F 


M00055549C:E10 


UC2-NormColon 


945 


639480 


1588J09.gz43 217386 


F 


M00055548B:G06 


UC2-NormColon 


946 


1318 


1588Allffz43 217409 

X *— ' V v-7 « A A X X J~l* — X / TV/ — ^ 


F 


M00055537CA01 


UC2-NormColon 


947 


457146 


1588.Lll.gz43 217420 


F 


M00055550D:D02 


UC2-NonnColon 


948 


451671 


1588 A15 ez43 217473 


F 


M00055537C:E04 


UC2-NormColon 


949 


641401 


1588 115 ez43 217481 


F 


M00055547DG06 


UC2-NormColon 

W*» X 1 VA AAA VVVAVAl 


950 


453606 


1588D17ez43 217508 


F 


M00055542BB11 


UC2-NormColon 

V-/X< X ^ V/A 111 V/ V/l V/ll 


951 


642414 


1588G17ez43 217511 


F 


. M00055545D C03 


UC2-NormColon 

X v V/X AilVvAV/li 


952 


649564 


1588H18ez43 217528 


F 


M00055547AH1 1 

If jLvv \J w/ i # JL X • X X X X 


UC2-NonnColon 

V_y V/m X 1 V/A AA A WAV/AA 


953 


529799 


1588 118 ez43 217529 


F 


M00055548AD07 


UC2-NormColon 

V-/ V*^ A ^ V/A AAA V/\/ AwAl 


954 


645781 


1588N18ffz43 217534 


F 


M00055553DE06 


UC2-NormColon 


955 


447379 


1588 M20 cz43 217565 


F 


M00055552CG03 


UC2-NormColon 


956 


639461 


1588A21ez43 217569 


F 


M00055538BG09 


UC2-NormColon 

X-/ X— ^^rf X T VA A A Aw V/AVAA 


957 


451361 


1588D21ez43 217572 


F 


M00055542CB02 


UC2-NormColon 


958 


449891 


1588 122 ez43 217593 


F 


M00055548A-F04 


UC2-NorxnColon 


959 


639395 


1588 022 ez43 217599 


F 


M00055555D-B05 


UC2-NormColon 

V/ V^X* J. ^ V/X UlWlvll 


960 


644173 


1 588.F23 .gz43_2 17606 


F 


M00055545A.C01 


UC2-NormColon 


961 


648494 ' 


1588.H24.gz43j2 17624 


F 


M00055547B:G09 


UC2-NormColOxi 


962 


477064 


1588.P24.gz43_217632 


F 


M00055557B:B10 


UC2-NormColon 


963 


535436 


1729.L02.gz43_217660 


F 


M00056092B:D10 


UC2-NormColon 


964 


650900 


! 1729.B04.gz43_217682 


F 


M00056078B:G02 


UC2-NomaColon 


965 


649149 


! 1729.D04.gz43_2 17684 


F 


M00056081A:C05 


UC2-NormColon 


966 


644054 


1729.K04.gz43__217691 


F 


M00056091A:E04 


UC2-NormColon 
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SEQ 
ID NO 


CLUSTER 


SEQ NAME 


ORIEN 
T j 


CLONE ID 


LIBRARY 


967 


649082 


1729.P04.gz43_J217696 


F 


M00056097B:C04 


UC2-NonnColon 


968 


454176 


1729.J05.gz43_217706 


F 


M00056089D:A09 


UC2-NormColon 


969 


448485 


1729.A06.gz43_217713 


F 


M00056077B:E03 


UC2-NonnColon 


970 


643440 


1729.N06.gz43_2 17726 


F 


M00056095B:A07 


UC2-NormColon 


971 


649356 


1729J08.gz43_217754 


F 


M00056089D:E07 


UC2-NormColon 


972 


710155 


1729.L08.gz43_217756 


F 


M00056092C:E12 


UC2-NormColon 


973 


560054 


1729.Ell.gz43_217797 


F 


M00056082A:E12 


UC2-NormColon 


974 


644712 


1729.Kll.gz43_2 17803 


F 


M00056091C:C06 


UC2-NormColon 


975 


646696 


1729.A12.gz43_217809 


F 


M00056077C:H02 


UC2-NormColon 


976 


644715 


1729.H12.gz43_217816 


F 


M00056087B:G12 


UC2-NormColon 


977 


644965 


1729.N12.gz43_2 17822 


F 


M00056095C:E02 


UC2-NorniColon 


978 


645050 


1 729 .Kl 3 .gz43_2 1 7835 


F 


M00056091C:D09 


UC2-NormColon 


979 


649514 


1729 .N14.gz43_2 17854 


F 


M00056095C:G07 


UC2-NormColon 


980 


570939 


1729.E15.gz43_217861 


F 


M00056082B:G07 


UC2-NonnColon 


981 

X 


647437 


1729.F 1 6.gz43_2 1 7878 


F 


M00056083D-.D11 


UC2-NonnColon 


08? 


650547 


1729 .L16.gz43_2 17884 


F 


M00056093A-.B12 


UC2-NomiColon 


983 


649054 


1729.P16.gz43_2 17888 


F 


M00056098B:C04 


UC2-NormColon 


984 1 


645409 


1729K17.gz43 217899 


F 


M00056091D:C12 


UC2-NormColon 


985 


613722 


1729.B18.gz43 217906 


F 


M00056079A:B01 


UC2-NoimColori 


986 


418763 


1729.L18.gz43 217916 


F 


M00056093A-.F08 


UC2-NormColon 


987 


619635 


1729.K20.gz43_2 1 7947 


F 


M00056091D-.E10 


UC2-NormColon 


| 988 


645131 


1729.L20.gz43 217948 


F 


M00056093B:D03 


UC2-NonnColon 


989 


451899 


1 729.M22 .gz43_2 1 798 1 


F 


M00056095AC02 


UC2-NonnColon 


990 


597647 


1729.P22.gz43_2 17984 


F 


M00056098DA08 


UC2-NormColon 


991 


560860 


17 1 7 .F0 1 .gz43_2 1 8406 


F 


M00055998B:D06 


UC2-NonnColon 


992 


640709 


17 1 7.P0 1 .gz43_2 18416 


F 


M00056009D:C12 


UC2-NormColon 


993 


447802 


| 1717.O02.gz43 218431 


F 


M00056008B:G05 


UC2-NormColon 


994 


641736 


1717.103.gz43_218441 


F 


M00056001D:B06 


UC2-NoimColon 


995 


464498 


1717.J03.gz43_2 1 8442 


F 


M00056003B:G11 


UC2-NormColon 


996 


452775 


1717K03.gz43 218443 


F 


M00056004D:F03 


UC2-NormColon 


997 


561836 


| 1717.L03.gz43 218444 


F 


M00056005C:F11 


UC2-NormColon 


998 


444454 


1717.K04.gz43 218459 


F 


M00056004D:F12 


UC2-NormColon 


999 

-7 ^ J7 


639178 


17 17.O04.gz43_2 1 8463 


F 


M00056008C:D04 


UC2-NonnColon 


1000 

M.\J\J\J 


513888 


1717F05gz43 218470 


F 


M00055998C:D04 


UC2-NoraiColon 


1001 

±\J\J x 


64^971 


1717 105 sz43 218473 


F 


M00056001D:G12 


UC2-NormColon 


! 1002 


644824 


1717.J05.gz43 218474 


F 


M00056003C:C09 


UC2-NomiColon 


1003 


448418 


1717.N05.gz43 218478 


F 


M00056007CA03 


UC2-NormColon 


1004 


| 644030 


1717.O05.gz43 218479 


F 


M00056008C:D08 


UC2-NormColon 


1005 


644972 


1717.P05.gz43 218480 


1 F 


M00056010AE05 


UC2-NonnColon 


1006 


649556 


1717.B06.gz43 218482 


F 


M00055993D:F12 


UC2-NonnColon 


1007 


644611 


1717.I06.gz43 218489 


F 


M00056002AA03 


UC2-NormColon 


1008 


645162 


1717.J07.gz43 218506 


1 F 


M00056003C:H10 


UC2-NonnColon 
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Table 2 



SEQ 
ID NO 


CLUSTER 


SEQ NAME 


ORIEN 
T 


CLONE ID 


LIBRARY 


1009 


470462 


1717L07ez43 218508 


F 


M00056005DC04 




1010 


645662 


1717A08ez43 218513 


F 


M00055992AD08 


T TP 9 -Norm C nl on 


1011 


649259 


1717.H08.gz43 218520 


F 


M00056000CD09 


UC2-NormColon 


1012 


650018 


1717D09gz43 218532 


F 


M00055996C-A02 


UC2-NormColon 


1013 


640645 


1717.D10.gz43 218548 


F 


M00055996C:A08 


UC2-NormColon 


1014 


643477 


1717.M10.gz43_218557 


F 


M00056006D:E03 


UC2-NormColon 


1015 


541214 


1717.ElLgz43 218565 


F 


M00055997D:A11 


UC2-NoimColon 


1016 


651083 


1717.Mll.gz43 218573 


F 


M00056006D:F05 


UC2-NormColon 


1017 


549889 


1717.G12.gz43 218583 


F 


M00055999DC05 


UC2-NormColon 

V^r X ^ V^JI liAV/\/X\/XX 


1018 


558856 


1717D13sz43 ^18596 


F 


M00055996DD07 


UC2-NormColon 

\J \*s MVJiill V>WlWll 


1019 


644022 


1717N13ez43 218606 

X # X'.X^XaJ. *"* 


F 


M00056007DF07 


UC2-NormColon 

»— ' V> ^# X ^1 Ul XXX WlWll 


1020 


644612 


1717 013 ez43 218607 


F 


M00056008D-D02 


UC2-NonnColon 


1021 


644949 


1717 P13 ez43 218608 


F 


M000560 1 1 AC1 1 


UC2-NormColon 


1022 


643808 


1717C14ez43 218611 


F I 


M00055995AG09 


UC2-NormColon 

U V/A> X^l V7XAXJLV^WXVXXX 


1023 


642260 


1717 014 cz43 218623 


F 


M00056008DD03 


UC2-NonnColon 


1024 


651119 


1717B15ez43 218626 


F 


M00055994BB12 

ITlvv T X-/ j X^* 


UC2-NomiColon 


1025 


477521 


1717 C15 ffz43 218627 


F 


M00055995BA02 


UC2-NormColon 

V7 1 > Wl 1 1 1 V-/UIWU 


1026 


587106 


1717D15ffz43 218628 


F 


M00055996D*GK)2 


UC2-NonnColon 


1027 


419443 

"T J. 7 I I J 


1717 115 ffz43 218633 


F 


M00056002DE09 


UC2-NormColon 

VJ V>/^< X 1 V/X XXXVjVJXV/XX 


1098 




1717L15az43 218636 


F 

X 


M00056006AE04 


VJ V/A X > VIX X1XV> v/xv/xx 


1029 


555736 


1717 M16 ffz43 218653 


F 


M00056007AB02 

1T1VV/V/J VVU # ZV.XJWZ< 


UC2-NormColon 

X^I\JXXXXV^VJXV/XX 






1717G17ffz43 218663 

X / X / . VJ X / .££ZjT J a- X OUUJ 


F 

X 


M00055999D-H04 


VJ V-/ ^> X l Wl 111W1VJ11 


1031 


449261 


1717 117 ez43 218665 

x / x # .x x / »ji - x vjvjvtw' 


F 


M00056003AA08 


UC2-NoimColon 

VJ V^^« X ^ Vl IIIV^VIVI X 


1032 


643498 


1717J17ez43 218666 


F 


M00056004BF04 


UC2-NormColon 


1033 


517274 


1717P17ez43 218672 


F 


M0005601 1BA06 


UC2-NonnColon 


1034 


532307 


1717L18ez43 218684 


F 


M00056006AG09 


UC2-NormColon 


1035 


643288 


1717A19sz43 218689 


F 


M00055993AE02 


UC2-NormColon 


1036 


478025 


1717F19ez43 218694 


F 


M00055999AA04 


UC2-NormColon 

V-/ X_J-A*# X l V/l 111\/VA wAA 


1037 


447822 


1717G19sz43 218695 


F 


M00056000AB02 


UC2-NoiitiColon 


1038 


641580 


1717 119 ez43 218697 


F 


M00056003A-B07 


UC2-NormColon 


1039 


489426 


1717 P19 ez43 218704 

X / X / .XT 17.g^*TJ X> X <J # \/~T 


F 


M0005601 1BE10 

lYlw w w^ ww 1 V 


UC2-NormColon 


1040 


650534 

VJ*JV/*J«J*T 


1717C20ffz43 218707 


F 


M00055995CF05 


UC2-NormColon 

VJ V^Z* X V*l XllV^r VJXV/XX 


1041 

IvT X 


644990 


1717E20ffz43 218709 


F 

X 


M00055998AF06 

1V1UUUJJ770A.1 WW 


UC2-NormColon 

VV/Xf X l Wl 111 V/V/1WU 


1047 

X V/*T^» 


207530 


1717F20az43 218710 

X / X # .X ^U.^ZitJ Z» X O / XV/ 


F 

X 


M00055999AB07 


UC2-NormColon 

VJ v^z* X ^ V/l 1X1V^V/XV/XX 


1043 


645197 


1717 120 sz43 218713 


F 


M00056003 AGO 1 


UC2-NormColon 


1044 


414739 


1717.K20.gz43_218715 


F 


M00O56005B:HO8 


UC2-NormColon 


1045 


592941 


1717.L20.gz43_218716 


F 


M00056006B:B05 


UC2-.NormColon 


1046 


447556 


1717.H21.gz43_218728 


F 


M00056001C:F07 


UC2-NormColon 


1047 


238586 


1717.N21.gz43_218734 


F 


M00056008B:E04 


UC2-NonnColoii 


1048 


644612 


1717.E22.gz43_218741 


F 


M00055998B:A04 


UC2-NonnColon 


1049 


646387 


1717.F22.gz43 218742 


F 


M00055999A:H10 


UC2-NonnColon 


I 1050 


647443 


1717.G22.gz43_218743 


F 


M00056000A:E11 


UC2-NomiColon 
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SEQ 
ID NO 


CLUSTER 


SEQ NAME 


ORIEN 
T 


CLONE ID 


LIBRARY 


1051 


555172 


1717.P22.gz43_J2 18752 


F 


M00056011C:D04 


UC2-NormColon 


1052 


552846 


1 7 1 7.D23 .gz43_2 1 8756 


F 


M00055997A:F04 


UC2-NormColon 


1053 


639377 


17 17 .B24.gz43 2 1 8770 


F 


M00055994C:F05 


UC2-NormColon 


1054 


449713 


1 7 1 7.M24.gz43_2 1 878 1 


F ! 


M00056007B:C10 


UC2-NormColon 


1055 


456627 


1717.N24.gz43_2 18782 


F 


M00056008B:G01 


UC2-NormColon 


1056 


646143 


1717.024.gz43_2 1 8783 


F 


M00056009D:A02 


UC2-NormColon 


1057 


639629 


1 7 1 8.H03 .gz43_2 1 8827 


F 


M00056021C:H08 


UC2-NormColon 


1058 


643744 


171 8.D04.gz43_2 1 8839 


F 


M00056016C:F11 


UC2-NormColon 


1059 


397399 


1718.P04.gz43_218851 


F 


M00056032B:H08 


UC2-NonnColon 


1060 


650391 


1718.G06.gz43_218874 


F 


M00056020C-.H03 


UC2-NormColon 


1061 


447659 


1718.N06.gz43_218881 


F 


M00056029D:D08 


UC2-NormColon 


1062 


644967 


1718.D07.gz43_2 18887 


F 


M00056016D:E02 


UC2-NormColon 


1063 


545509 


1718.G08.gz43_218906 


F 


M00056020D:D03 


UC2-NonnColon 


1064 


643975 


1718.D09.gz43 218919 


F 


M00056017A:B08 


UC2-NormColon 


1065 


645883 


1718.Mll.gz43_218960 


F 


M00056028D:F06 


UC2-NonnColon 


1066 


538808 


1 7 1 8 .B 1 2.gz43_2 1 8965 


F 


M00056013A.C09 


UC2-NormColon 


1067 


425455 


1718.P13.gz43_218995 


F 


M00056033AC11 


UC2-NormColon 


1068 


467057 


1718.M20.gz43_219104 


F 


M00056029B:H03 


UC2-NormColon 


1069 


640181 


1718.B21.gz43_219109 


F 


M00056013D:D07 


UC2-NormColon 


1070 


650348 


1718.F23.gz43 219145 


F 


M00056020B-.C09 


UC2-NormColon 


1071 


451993 


1718.P24.gz43_219171 


F 


M00056033C:H09 


UC2-NormColon 


1072 


647976 


1719.E01.gz43_219176 


F 


M00056039A:F04 


UC2-NormColon 


1073 


557676 


1719.101.gz43_219180 


F 


M00056045A:D03 


UC2-NormColon 


1074 


644639 


1719.N01.gz43_219185 


F 


M00056050C:A09 


UC2-NormColon 


1075 


644738 


1719J02.gz43_219197 


F 


M00056046AB04 


UC2-NormColon 


1076 


645973 


1719 . A03 .gz43_2 1 9204 


F 


M00056033D:F06 


UC2-NonnColon 


1077 


550376 


1719.B03.gz43_219205 


F 


M00056035BA.04 


UC2-NormColon 


1078 


646088 


1 7 19.D03 .gz43_2 1 9207 


F 


M00056037B:G02 


UC2-NonnColon 


1079 


506920 


1719.P03.gz43_219219 


F 


M00056053A:F01 


UC2-NormColon 


1080 


642198 


1719.A04.gz43_219220 


F 


M00056033D-.G07 


UC2-NonnColon 


1081 


649293 


1719.E04.gz43_219224 


F 


M00056039B:C03 


UC2-NonnColon 


1082 


641193 


17 19.H04.gz43_2 19227 


F 


M00056043D:E03 


UC2-NormColon 


1083 


461517 


1719J04.gz43_219229 


F 


M00056046A:B12 


UC2-NormColon 


1084 


645070 


1719.K04.gz43_2 19230 


F 


M00056046D:C11 


UC2-NormColon 


1085 


645942 


1719M04gz43 219232 


F 


M00056049B:E11 


UC2-NonnColon 


1086 


645151 


1719.O05.gz43_219250 


F 


M00056051DJ102 


UC2-NonnColon 


1087 


566745 


1719.B06.gz43_2 19253 


F 


M00056035B:D11 


UC2-NormColon 


1088 


642184 


1719.M06.gz43_219264 


F 


M00056049C:A08 


UC2-NormColon 


1089 


649429 


1719.O06.gz43_219266 


F 


M00056052AA11 


UC2-NormColon 


1090 


549786 


1719.B07.gz43 219269 


F 


M00056035B-.E10 


UC2-NormColon 


1091 


452102 


1719.J07.gz43_2 19277 


F 


M00056O46A:GO8 


UC2-NonnColon 


1092 


644819 


1719.O07.gz43_219282 


F 


M00056052AC07 


UC2-NormColon 
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SEQ 
ID NO 


CLUSTER 


SEQ NAME 


ORIEN 

T | 


CLONE ID 


LIBRARY 


1093 


453804 


1719 E08 ez43 219288 


X 


MOOO*>6039C -D05 


T TP9-T\InrmPr»1r»n 


1094 


650348 


1719K09ez43 219310 


F 

X 


M000S6047AF09 


T Tf*9 JMnrmr^nlnri 
U OiUlv^OiOxL 


1095 


644894 


1719 L09 ez43 219311 

X / X 7 +A~I\J *S .jrr t r >T~J X 77 X X 


i 


M0OOS604RAD19 


T TP9_"Mr»nnr^n1r»n 
O IwZ, -IN Ollilv^Olun 


1096 


646048 


1719 A10 ez43 219316 


i 


M000S60^4A-f*Tin 

\XX\J\J\J *j V7Vy*7*TX^.. VJ 1V7 


T TP9 JMnrtnPnlnn 


1097 


638912 


1719DlO0z43 219319 


F 

X 


Monns60^7rrri i 


T Tr ,| 9JMr»rmPn1r»Ti 


1098 


644817 


1719 F10 0z43 219321 

X / 17,1 ll/.gZi'TJ Z<17J^il 


F 
i 


Mnnnsfi04 1 tvfoi 

lYMAJyj \JJ \j\J*t JL MJ JT WO 


vj v-r^"i>i uiiiiv^oion 


1099 


645973 


1719 Ell &z43 219336 

X / X 7 . J— * X X ■^Zili * Xf X 7a7«7 IS 


F ! 

X 


1V1V7 17 V/«7 W J 7 V> . VJ v/7 


T Tr , 9-.'MnrmPr»lriTi 

U V>^.~1N Ul HI V^VJlUil 


1100 


643748 


1719 Mil &z43 219344 

X/ 17.1T11 l.gLTJ £(17J~" 


F 


M00056049P H04 


vj -in ui in V/ uivjii 


1101 


649259 


1719 Nil 0z43 219345 

X r 17.ll X X . X 77 I 7 


F 


1VXWV7 VyJ V7V/«7 Xxtl. V-/V7 J 


VJ v- Zr— IN C/l III V^UKJIl 


1102 


642852 


1719 E12 0^43 219^59 


F 
i 


IVlUUl/JUuJ-'V'.Xlv X 


T TP 9 ->JrkrmPr»1 rtn 
VJ VvZr— IN QI III v^OlOU 


1103 


451544 


1719 L12 ez43 219359 


F 




T TP9 -NTormPnlnn 
VJ v^a»— IN UIlIlv^wlUII 


1104 


528616 


1719N12ez43 219361 


F 


M0005605 1 A-Hl 1 

lVlv w v/ J V/ V J 1/V.lll X 


1 TP 9 -Norm P r»1 on 

w XN VI lllXw'VJIUll 


1105 


460727 


1719 G13 erz43 219370 


F 


M00056043A-A09 


T TP9-MnrmPn1rm 


1106 


468689 


1719C14ez43 219382 


F 


M00056036DB06 

ItIv/V/Vi^/ W/»7 \J X*t . XJ\J\J 


T IP9-MnrmPn1nn 

W V/Ar"l > CI 1X1 V-y VJ1U1X 


1107 


452729 


1719 C15 ez43 219398 

X # 1/iW1v/.KIjtJ -4^X77 7U 


F 


M00056036DD01 

J-tXV7 \J\JmJ \J\J~J VJX7 .1/U X 


1 TP9 -MnrmPrilon 

17 Vs4& — X^l d 111 V^*JA\J1X 


1108 


481614 


1719D15ffz43 219399 


F 


M0005603 8B -G08 


T TP 9 -TsTnrm Pol nn 

U V^ iw 1 t| V7l lHV_yUl VJl 1 


1109 


649900 


1719 HI 5 ez43 219403 


F 


M00056044C-C0 1 


T TP9 -"MnrmPnlnti 

Uv>L, 1 > V/l 111 V>\Jiwll 


1110 


645375 


1719N15ffz43 219409 

X / l^il l l«/«K&iT>7 ^»X7~V/7 


F 

X 


M00056051B-B03 


T TP9-lMnrmPo1oTi 


1111 

X X X X 


497101 

"T7 # x\j x 


1719 P15 0^43 219411 

X / 17.1 J- 7 .^Z-, k T^7 A,17tl X 


F 
i 


MOOft ^>60S^Tj T>07 

AVXWWW J\JV7_/«7XJ .X7W / 


T TP 9 -Norm P ol nn 
VJ V/>£— IN <JI HlVvOiUIl 


1112 

X X xz* 


556325 


1719 Ct16 0z43 219418 

1/17 . VJ 1 VJ ,g^,*T^ X, X 7*T X O 


F 

X | 


M000S604^ A Kl 9 

1VXV/\JW-71JVJt , ~7^V.XXXZ # 


T TP 9 -Nnrm Pol r»n 
-IN VJl III vAJlUIl 


1113 


645431 


1 7 1 9 Ml 6 0743 21 9424 

1 / 17. 1VX X v * ^ 1 7"X»" 


F 
i 


IVlV vu J UU^7i/ .i/V J 


TTP9JMormPo1on 
VJ V^Z, - 1N Ullllv^t/lOIl 


1114 


452325 


1719 A17 &z43 219428 

X / 17.A1 / . c£.ZjT7 iil7t^O 


F 


M00056034C-D07 


T TP9-ATormPo1on 

VJ Vv^— IN OI IIlv^ LJHJI1 


1115 


447645 


1719 B17 0z43 219429 

1 / 17.19 X I J il7til7 


F 

i 


lVlWV/J V/x/J J xy . V-/ V/ o 


T TP9 «T\I orm P ol on 
vj v,^— in vji in vvOiuix 


1116 

X X X\J 


644054 


1719 T17 0743 9194^0 

1 / I7,ul /.gLTJ Ail7*tJu 


F 
i 


1VX W/V/ J U V7«7 OA*/ . 17 V/J 


T TP9 JWr»rmPo1rm 


1117 


641715 


1719 M17 0743 219440 

X / 17.1VXX / .gyCsT^J 4rl7lTU 


F 
i 


M000560491> F0 1 

XVXWWVJ JWV/*T7X7.X-fV/ X 


T TP 9 -Nnrm Pol on 

VJ V>Z»~"1N Ol II IV^OlOIl 


1118 

X X lw 


644692 


1719P17 0z43 219443 

X # 17.1 X / .gi<TJ 4.17TTJ 


F 

X 


Mn00S60 4 S4 A ROfi 

XVxVy V/Vl J V> VI *7 L T^Y . Dvu 


T TP9 -Nnrm P ol on 
VJ Z,— IN Ol lllv^OlOll 


1119 

1117 


465589 


1 71 0 PI 8 074^ 9 1 Q45Q 

1 / 17.110^^ AA7tJ7 


F 

r 


Mnfln^6ft^4 A nno 


T TP9 -MnrmPnlnn 


1 170 

X 1Z.U 


fwttR97 

UtJ07 / 


171Q Cx\Q vrrAI 910466 

X / I7.\jl7.g/.T , J Z. 1 7 t TV7VJ 


F 


lVlV/\y\J .JOVJHOij .I2AJ D 


T TP9 JsJnrmrrvl r\n 
Uv/Z -IN OlIIlv-zOlOIl 


1121 

X XJL, X 




1710TlQa74^ 910471 


F 


A/f nno ^ An4K"n -rho 


T TP9-KTnmiPr*1on 
VJ Vv^-xN omiv^oion 


1122 




1 71 Q P 1 Q 074^ 9 1 Q47S 

X / 17.1 X 7 .gitJ Z. 177 / 7 


F 

JT 


lvlUUU^QUjH'A.iiv/j 


TTP9 >JrkrmPnlr*n 
VJ V^Z-iN 0llIlV_^O10Il 


1123 


645530 


1719 P20 0743 919478 


F 

1 


MOOO^O^A-TjI 1 


T TP9 -"MormPolon 
vj v^x. -in oi mv^oioii 


1124 


648390 


1719 A21 0743 219492 


F 
i 


M000^60^4r , H[09 

lVlUV/V/JUV/J'tV-'.l Xv/7 


T TP9 -Norm P ol on 

VJ \sZ. -IN Ol XllVw'OlOXl 


1125 


467597 


1719 B21 ez43 219493 

X # 17.17Ll.gLTJ L X 7™ 77 


F 


M00056035Tj-niO 


T TP9 -Norm Pol on 

VJ VL XN Vji- illV/VJlUll 


1126 


468330 


1719 D21 0743 219495 

X / 17,L/Ll,gZjtJ L17t7J 


F 
i 


IVlV/UvfJUvJOL/.l 


T TP 9 -"Morm P o! on 

VJ V_/Z» IN OI 111 V^OlOIl 


1127 


454438 


1719 E21 0z43 219496 

X * 17.JjL1.^LtJ L17T7V 


F 
i 


M000^f>04nr -B03 

X vxvj \j \j j v/vfrv* Vw . XJ \j «J 


T TP9 -TMormPolon 


1128 


402476 


17 19.12 l.gz43_2 19500 


F 


M00056045D:C09 


UC2-NonnColon 


1129 


558439 


1719.M21.gz43_219504 


F 


M00056050A:D02 


UC2-NormColon 


1130 


595066 


1719.N21.gz43_219505 


F 


M00056051C:C09 


UC2-NormColon 


1131 


646687 


1719.E22.gz43_219512 


F 


M00056040C:C12 


UC2-NonnColon 


1132 


452026 


1719.H22.gz43_219515 


F 


M00056044D:F01 


UC2-NormColon 


1133 


639255 


1719.K22.gz43_2195 18 


F 


M00056047C:E03 


UC2-NonnColon 


1134 


612572 


1719.L22.gz43_219519 


F 


M00056049A:C11 


UC2-NontiColon 
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Table 2 



SEQ 
ID NO 


CLUSTER 


SEQ NAME 


ORIEN 
T 


CLONE ID 


LIBRARY 


1135 


641786 


1719.022.gz43_219522 


F 


M00056052D:G01 


UC2-NormColon 


1136 


647306 


1719.P22.gz43_219523 


F 


M00056054A:G07 


UC2-NonnColon 


1137 


640559 


1719.A23.gz43_219524 


F 


M00056034D:E09 


UC2-NormColon 


1138 


550365 


1719.D23.gz43_219527 


F 


M00056039A:A02 


UC2-NonnColon 


1139 


462293 


1719.123.gz43_219532 


F 


M00056045DE06 


UC2-NormColon 


1140 


587106 


1719.N23.gz43_219537 


F 


M00056051C:H09 


UC2-NonnColon 


1141 


483266 


1719E24.gz43_219545 


F 


M00056042B:B05 


UC2-NormColon 


1142 


490898 


1719.G24.gz43_219546 


F 


M00056043C:G03 


UC2-NormColon 


1143 


640603 


1719.N24.gz43_219553 


F 


M00056051D:A07 


UC2-NormColon 


1144 


643510 


1720.G01.gz43_219562 


F 


M00056062D:F12 


UC2-NormColon 


1145 


645049 


1720.A02.gz43_219572 


F 


M00056054B:G05 


UC2-NormColon 


1146 


466795 


1720.C02.gz43_219574 


F 


M00056057B:D01 


UC2-NormColon 


1147 


14157 


1720.102.gz43_219580 


F 


M00056066B:H04 


UC2-NormColon 


1148 


558494 


1720. A04.gz43_2 19604 


F 


M00056054C:C09 


UC2-NormColon 


1149 


644927 


1720.C04.gz43_219606 


F 


M00056057B:E12 


UC2-NormColon 


1150 


648688 


1720.C05 .gz43_2 19622 


F 


M00056057C:B02 


UC2-NormColon 


1151 


454906 


1720.A06.gz43_219636 


F 


M00056054C:E12 


UC2-NormColon 


1152 


644597 


1720.D06.gz43_2 19639 


F 


M00056059A:A02 


UC2-NormColon 


1153 


460047 


1720.K06.gz43_219646 


F 


M00056069A:F01 


UC2-NontiColon 


1154 


62053 


1720.C08.gz43_219670 


F 


M00056057C-.D06 


UC2-NormColon 


1155 


649054 


1720.D08.gz43_219671 


F 


M00056059A:B01 


UC2-NormColon 


1156 


644225 


1720.K08.gz43_219678 


F 


M00056069A:F06 


UC2-NormColon 


1157 


447635 


1720.C09.gz43_2 19686 


F 


M00056057C:E01 


UC2-NonnColon 


1158 


642166 


1720.E10.gz43_219704 


F 


M00056060D:H11 


UC2-NormColon 


1159 


473007 


1720.Dll.gz43_219719 


F 


M00056059B.D10 


UC2-NormColon 


1160 


551181 


1720.M 1 1 .gz43_2 1 9728 


F 


M00056072B:C06 


UC2-NormColon 


1161 


644928 


1720.C12.gz43_219734 


! F 


M00056057C:E12 


UC2-NotmColon 


1162 


644916 


1720.E12.gz43_219736 


F 


M00056061A:E05 


UC2-NormColon 


1163 


644701 


1720.L12.gz43_219743 


F 


M00056070C:B09 


UC2-NormColon 


1164 


487567 


1720.O12.gz43_219746 


F 


M00056074C:H08 


UC2-NonnColon 


1165 


641801 


1720.E13.gz43_219752 


F 


M00056061B:A03 


UC2-NonnColon 


1166 


651108 


1720.O13.gz43_219762 


F 


M00056074C:H09 


UC2-NotmColon 


1167 


| 447676 


1720.A14.gz43_2 19764 


F 


M00056055A:E04 


UC2-NormColon 


1168 


645076 


1720.K14.gz43_2 19774 


F 


M00056069BE04 


UC2-NormColon 


1169 


641469 


1720.C15.gz43_219782 


i F 


M00056057D:E11 


UC2-NormColon 


1170 


375577 


1720.G15.gz43_219786 


F 


M00056063C:F10 


UC2-NormColon 


1171 


558154 


1720.C16.gz43_219798 


F 


M00056058A:B07 


UC2-NormColoa 


1172 


650549 


1720.G16.gz43_219802 


F 


M00056063C:G11 


UC2-NoimColon 


1173 


461351 


1720.116.gz43_219804 


F 


M00056066D:G04 


UC2-NormColon 


1174 


640818 


1720.A18.gz43_219828 


F 


M00056055B:B06 


UC2-NormColon 


1175 


455716 


1720.11 8.gz43_2 19836 


F 


M00056067AA12 


UC2-NormColon 


1176 


644822 


1720J18.gz43_219837 


F 


M00056068DA06 


UC2-NonnColon 
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Table 2 



SEQ 
ID NO 


CLUSTER 


SEQ NAME 


OWEN 
T 


CLONE ID 


LIBRARY 


1 177 

XX// 


467780 


1720 C19 ez43 219846 


F 


M00056058AF08 


UC2-NormColon 


1 178 

11 / o 


446415 


1720 F20 ez43 219865 


F 


M00056062D-D08 


UC2-NonnColon 

Vv/m x i vixiXw v/xv/xx 


1 17Q 

x x / ~ 


553993 


1720 H20 az43 219867 


F 


M00056066AD03 


UC2-NormColon 

V_/ v_/-^— • x ^ v/x xjlx%^/v/xv/xx 


1 1 80 


645 Q00 

V/*T^/ 7V/V 


1720121 £z43 219884 


F 


M00056067BD08 


UC2-NormColon 


1 181 


640364 

VT7JVT 


1720 M21 ez43 219888 


F 


M00056072DE08 


UC2-NormColon 


1189 


643348 


1720N23ez43 219921 


F 


M00056074BD 12 


UC2-NormColon 


1 183 


648654 


1731 A01 oz43 219940 


F 


M00056120CH04 


UC2-NormColon 


1 1 84 

X X 0*T 


648547 


1731 C01 ffz43 219942 


F 


M00056123BG05 


UC2-NoimColon 

V/ v/iv X 1 Villi V^vxvxa 


1 IBS 

X lOJ 


463894 


1731 101 ez43 219948 

1 / J 1 .XV/ X • n 


F 


M00056133DD09 


UC2-NormColoii 


1 1 86 

X X OvJ 


648791 


1731 N01 ffz43 219953 


F 


M00056 140CE04 


UC2-NormColon 


1 187 

X X o / 


648481 


1731 O01 cz43 219954 


F 


M00056142A:F03 ! 


UC2-NormColon 


1 188 

X X oo 


644017 


1731A02ez43 219956 


F 


M00056120DF01 


UC2-NormColon 


1 189 

X 107 


635354 


1731 H02 ez43 219963 


F 


M00056132BC12 


UC2-NormColon 


i ion 

X xzf\J 


71 1403 


1731 T09 <r743 910965 


F 


M00056 1 34D *G07 


UC2-NormColon 


1 1 Ol 


648749 


1 73 1 T 09 <*743 9 1 9067 


F 


M00056137DH05 


UC2-NonnColon 

W wAf X i Vl *11 V/V* Vll 


1 1 Q9 


476380 


1731 F03 az43 910977 

1 / Jl ,X^ uJ .g4it«7 A X / / 


F 


M00056129CF09 


UC2-NormColon 


1 1 Q3 
x iyo 


640617 


1731 103 <T743 910081 

X I J X.J uJ .gZrt J X 770 X 


F 

X 


M00056 134DG1 1 

XVXV/ \J\J -~J \J X »^~J-/ . VJ X X 


UC2-NormColon 


1 1 OA 


6^Q^5 


1 73 1 R04 a743 9 1 0080 


F 

XT 


M00056122B-A07 

i v i""" hiiaax/, n * * / 


Ur2-NormColon 


1 1 


46-5197 
*fOJ iz / 


1731 H04 a^43 910001 


J? 

X 


M00056 1 2513 F0 1 

J. VXV/ V/uJUl ^>«/ X-/ . X V/ X 


UC2-NormColon 


1 1 Q6 




1 73 1 1 04 ar43 9 1 0000 

X / J X.X*AJ*+.gw£*T«7 ^17777 


X 


M00056138AB1 1 


UC2-NormColon 


1 1Q7 


453796 


1731 (t05 cr^43 990010 


X 


M0005 613 1 A * E09 


UC2-NormColon 


1 1 Q£ 


557096 


1731 TT06 <*743 990097 


F 

X 


M00056132CF04 

XVXV/V/V/»/V/ 1 JZrVr.l V/T 


UC2-NoimColon 


1 1 QQ 


499949 


1731 D07 az43 990039 


F 

X 


M00056127AA10 


UC2-NonnColon 


i 9no 


456Q85 


1731 F07 CT743 990040 


F 

X 


M00056128BA07 


UC2-NonnColon 


1901 


640605 


1 73 1 T07 <rz43 920044 


F 


M00056134AD04 


UC2-NoimColon 


1 909 


69751 5 


1731 R08 (7743 990053 


F 


M00056122B-G09 


UC2-NormColon 

V*' ^ta/A^ X ^ X^X XXX V/VlVAl 


1903 


648341 


1 73 1 M08 <*743 920064 


F 

X 


M0005 6 1 3 9D E05 

XVXV/V/ V/ X «✓ ^l/.-I^Vk/ 


UC2-NormColon 


19H4 


464000 


\ 1731 A 00 <7743 990068 


F 


M00056 12 1 AE05 

XYXV/v/V/«^V/ X ^# lxi.XJU J 


UC2-NormColon 


1 9H5 


450591 


1731 Tl 0(7743 990003 

X / Jl.J X V/.eyfci*T J £*£*\J\JZ7J 


F 

X 


M00056135BE03 

XVXv/ \J\J*s \J X-J -s X\J , x^i\j 


UC2-NormColon 




540601 


1731 1,10 oz43 220095 


F 


M0005 6 1 3 8B F05 


UC2-NormColon 


1907 


514838 


1731 M10az43 220096 

X / J X «XtX X V/ igfa~J £*&f\J\J^\J 


F 


M0005 6 13 9DH04 


UC2-NormColon 


1908 


459738 


1731 O10 ffz43 220098 


F 


M00056142CA09 


UC2-NormColon 


19H0 


644098 


1731 nil a743 990103 

1 / J 1 .X-/ X X . i^Z/TJ Z*£*\f X V/-7 


F 

X 


M00056127A-H03 

XVXV/V/V/— ' V/ X ^> / xx^X iv./ 


UC2-NormColon 


1910 
1Z1U 


460 1 Q0 


1731 R19 a743 990117 

1 / Jl .D14..g6tJ jLZ*\J x X / 


F 

x^ 


M0005 6 1 22C-H12 


UC2-NormColon 


xZx 1 


^69378 


1731 019 (7743 990199 


X 


MO0056131B-E01 

i VXV/ V» V/ VJ X -J X X-/ .J-/V X 


UC2-NormColon 


1212 


647949 


1 173 1 P 12.gz43_22013 1 


F 


M00056144D:C05 


UC2-NormColon 


1213 


557419 


1731.B13.gz43__220133 


F 


M00056122D:B07 


UC2-NormColon 


1214 


638934 


1731.D13.gz43_220135 


F 


M00056127B-.E11 


UC2-NormColon 


1215 


646785 


1731.F13.gz43_220137 


F 


M00056130B:E11 


UC2-NormColon 


1216 


647248 


1731.E14.gz43_220152 


F 


M00056128C:F02 


UC2-NonnColon 


1217 


470006 


1731.114.gz43_220156 


F 


M00056134C.C06 


UC2-NormColon 


1218 


648311 


1731.K14.gz43_220158 


F 


M00056137B:F07 


UC2-NonnColon 
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SEQ 
ID NO 


CLUSTER 


SEQ NAME 


ORIEN 
T 


CLONE ID 


LIBRARY 


1219 


649299 ! 


1 73 1 .F 1 5 .gz43_220 1 69 


F 


M00056130C:C12 


UC2-NormColon 


1220 


478393 


1731.K15.gz43 220174 


F 


M00056137B:F11 I 


UC2-NormColon 


\TL\ 


647290 


1731.F16.gz43 220185 


F 


M00056130C-.F08 


UC2-NormColon 


1222 


649735 


1731.G16.gz43 220186 


F 


M00056131C:H03 


UC2-NonnColon 


1223 


646039 


1731.J16.gz43 220189 


F 


M00056135C:C05 


UC2-NormColon 


1224 


646581 


1 73 1 .M 1 6.gz43_220 1 92 


F 


M00056140A:F12 


UC2-NormColon 


1225 


470667 


1731.N16.gz43 220193 


F 


M00056141C:H01 


UC2-NormColon 


1226 


642650 


1 73 1J 1 7 .gz43_220205 


F 


M00056135C:C08 


UC2-NormColon 


1227 


642275 


1731.K17.gz43 220206 


F 


M00056137C:A01 


UC2-NormColon 


1228 


651015 


1731 .L17.gz43_220207 


F 


M00056138D:G08 


UC2-NormColon 


1229 


642936 


1 73 1 .HI 8 .gz43_2202 1 9 


F 


M00056133B:F01 


UC2-NormColon 


1230 


642204 


1731.A19.gz43_220228 


F 


M00056121D:A12 


UC2-NormColon 


1231 


557867 


1731.J19.gz43__220237 


F 


M00056135C:H06 


UC2-NonnColon 


1232 


639395 


1731.K19.gz43 220238 


F 


M00056137C:E01 


UC2-NonnColon 


1233 


645367 


173 1 .K20.gz43_220254 


F 


M00056137C:G02 


UC2-NormColon 


1234 


647232 


173 1 .L20.gz43_220255 


F 


M00056139A:D12 


UC2-NoraiColon 




64743 1 


173LE21.gz43 220264 


F 


M00056129AG01 


UC2-NormColon 


1236 


708175 


1731.I21.gz43 220268 


F 


M00056134D.B07 


UC2-NormColon 


1737 


503122 


173 1 .A22.gz43_220276 


F 


M00056122A:A05 


UC2-NormColon 


1238 


515350 


1731.C22.gz43 220278 


F 


M00056124D:F06 


UC2-NormColon 


1239 


556336 


173LF22.gz43_220281 | 


F 


M00056130D:E06 


UC2-NonnColon 


1240 


589098 

-J yj *s \J *s \j 


1731A23az43 220292 


F 


M00056122A:B05 


UC2-NormColon 


1241 


531461 


173 1 .B23 .gz43 J220293 


F 


M00056123B:F02 


UC2-NonnColon 


1247 


455808 


1731 C23 sz43 220294 


F 


M00056124D:G03 


UC2-NonnColon 


1243 


643510 


1731.D23.gz43 220295 


F 


M00056127D:G10 


UC2-NonnColon 


1244 


648996 


1731.J23.gz43 220301 


F 


M00056136AB11 


UC2-NormColon 


1245 


446164 


1731 N23 ez43 220305 


F 


M00056141D:E08 


UC2-NoimColon 


1246 


419114 


1731 023 ez43 220306 . 


F 


M00056143A:H08 


UC2-NormColon 


1247 


648076 


1731.A24.gz43_220308 


F 


M00056122AD02 


UC2-NonnColon 


1248 


649482 


173 1 .124.gz43_2203 1 6 


F 


M00056134D:F02 


UC2-NormColon 


1249 


646323 


1732.H01.gz43_220331 


F 


M00056155B:A02 


UC2-NormColon 


1250 


550714 


1732.A05 .gz43_220388 


F 


M00056146AA09 


UC2-NonnColon 


1251 


649482 


1732.J05.gz43_220397 


F 


M00056158C:D11 


UC2-NormColon 


1252 


650987 


! 1732J06.gz43_220413 


I F 


M00056158C:F09 


UC2-NormColon 


1253 


644221 


1732.F07.gz43_220425 


F 


M00056153A:G04 


UC2-NoraiColcai 


1254 


647412 


1732.M07.gz43j220432 


F 


M00056162C:F02 


UC2-NormColon 


1255 


416624 


1732.108.gz43_220444 


F 


M00056157A-.B11 


UC2-NonnColon 


1256 


644801 


1732.F09.gz43 220457 


F 


M00056153A:H07 


UC2-NormColon 


1257 


650249 


! 1732.G10.gz43_220474 


F 


M00056154B:F11 


UC2-NonnColon 


1258 


465734 


1732.Cll.gz43_220486 


F 


M00056149CA02 


UC2-NormColon 


1259 


605761 


1732.Ell.gz43_220488 


F 


M00056152A:B11 


UC2-NormColon 


1260 


| 649782 


1732.Mll.gz43_220496 


F 


M00056162DA01 


UC2-NormColon 
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INSOOCID. <WO 0214500A2_L> 



WO 02/14500 



PCT/US01/25840 



Table 2 



SEQ 
ID NO 


CLUSTER 


SEONAME 


ORIEN 
T 


CLONE ID 


LIBRARY 


1261 


4499 J O 


1 /32.A12.gz43_220500 


T? 

r 


A/TAAAC/ll A/Z A .T_TAA 

M00056146A:xl09 


TTA1 "Mr , , i ■ , n 1 

U C2-JN ormColon 


1 OilT 


642244 


1732.112.gz43_22050o 


r 


A/TAAAC^ 1 CTXJ. A AO 

M00056157B:A08 


UC2-NormColon 


1263 


650345 


1 TO O T 1 O A O OlA< 1 1 

1732.Lrl2.gz43_2205 1 1 


r 


A /TA A AC 1 1 /^.i^»A/C 

M00056161C:Cj06 


U C2-JN ormColon 


1 T<£ A 

1264 


yj/- 1 IOC 

461135 


1 732 .£ 1 3 ,gz43_220520 


T7 

F 


A jfAAAC/^ 1 CO A .T\AT 

M00056152A:D07 


U C2- JN ormColon 


1265 


1 ZTAAO 

16092 


1 732.L 13 .gz43_220527 


F 


TViTAAAC^ 1 £L 1 A.U1 A 

M00056161C:rllO 


UC2-JNormColon 


i o 

1266 


650116 


1 732 . M 1 3 .gz43_2205 2 8 


F 


tv jCAAAC/C 1 /COT\.T\AO 

M00056162D:D03 


TT/^O ALl.uliAjlLl.i 


1 T^T 

1267 


650534 


7 TOO nil 10AC5A 

1 73 2 .O 1 3 .gz43_2205 3 0 


r 


A/TAAAC/C 1 iCC/^.O 1 1 

M00056165C:.bll 


U C2-1N ormlJoloii 


1 T zTO 

1268 


/T /I Ai£"0 A 

640634 


i too a i a /io ooacoo 

1 732 . A 1 4.gz43_220532 


r 


A iTAAACZT 1 A £Ti .T?(\C 

M00056 146B:jb05 


U C2-1N ormColon 


1269 


A A HI OA 

446184 


1TOO Y1A*~ A 1 OO AC A A 

1 732.1 14.gz43_220540 


F 


TV X A A AC £ 1 gnn.TTA/r 

M00056157B:H06 


U C2-iNonnUolon 


•% TTA 

1270 


380477 


1 TOO ~\A1A ~ A OOAC/1/1 

1 7 3 2 .Ml 4 .gz43_220544 


x? 
r 


A>f AAAC/C 1 /:0"r\.T\A/l 

M00056 1 62D:D06 


UC2-lNorm\Joion 


1271 


463966 


1 TOO ^„/lO 

1 732 . ivl 5 .gz43_22055o 


T7 

r 


M00056160C:r 12 


u i^z-in otmv^oion 


1 OTO 

1272 


649578 


1TOO AK >t o OOAC/^O 

1732.O15.gz43_220562 


r 


A AAAC/C 1 /CCA.AAO 

M0005 6 1 65D :D09 


U C2-1N ormColon 


1273 


650877 


1 TOO /"** 1 ylO OO AC/I^T 

1732.C16.gz43_220566 


r 


AjTAAAC/CI /I A/^.T7 1 A 

M00056149C:lilO 


U C2-JNormColon 


1274 


449882 


1 TO O T7 1 ^ a 1 OOAClTA 

1732.F16.gz43_220569 


r 


AvfAAAC/CI C *3TD .T7 1 1 

M00056 153 J3:r 1 1 ; 


u C2-JN ormColon 


1275 


650909 


1TOO ~ yi"J OO AC TO 

1732.J 16.gz43 220573 


T7 

r 


A jTAAAC/C 1 CA A .EAO 

M0005 6 1 5 9 A:h.03 


u L.2-JN ormuoion 


1 OT/" 

1276 


f A St f\f\ 

645100 


1 TOO 1 /T ~. ylO OOACT/f 

1732.K16.gz43_220574 


r 


AVTAAAC/C 1 £LC\(~^ .t~±.r\ 1 

M00056 160C ajO 1 


U C2-i\ormUoloii 


vm 


649883 


1 TOO A AT 1 T - >IO OTACrfcO 

1 732 .M 1 7 .gz43 2205 92 


f ; 


A A" A A AC 1 /TO A ."D 1 A 

M0005 6 1 63 A:B 1 0 


f T/^O TS-T .l.T./" 1 - In,, 

U C2-JN ormColon 


i no 

1278 


554581 


1 TOT T1 A jIO OOAjTOA 

1 732.1 1 9.gz43_220620 


r 


M00056157C:£10 


U C2-JN onnColon 


i o Tr\ 

1279 


467972 


1 TOT T 1 A O OOA/COO 

1732.JL/19.gz43_220623 


Jr 


ATAAAC/C 1 /CO A ."DA/C 

M00056162A:l>0o 


u L/2-JN ormcoion 


1280 


>• O O C AO 

488592 


1 TOO A OA - /I *5 OOACOO 

1732 . A20 .gz43 22062 8 


r 


A/f AAAC/Cl ACT\*T\C\A 

M00056 146U:D04 


uc2-jfNorniv^olon 


1281 


651075 


1 TOO OA — A O OOA./TOO 

1732.E20.gz43J220632 


F 


a/taaac/c i <or.nAo 
M00056 152C:Ci0o 


UC2-1N otmColon 


1282 


649991 


1 TOT V T/\ /f O OTA^OO 

1732 .K20.gz43_22063 8 


F 


A jTAAAC/C 1 /CAT\.PA/C 

M00056 160D:C06 


U C2-JN ormColon 


1283 


649947 


1TOO T *%/\ itO OOZ\yOA 

1732.L20.gz43_220639 


T7 

F 


a a r\r\f\c i zro a .PAn 

M0005 6 1 62A: C09 


U C2-IN ormColon 


1 T O A 

1284 


224092 


"1 TO O VTAA A O OOA^Al 

1732.N20.gz43_220641 


F 


A/TAAAC£1 £/l A .T-IAO 

M00056 1 64A:ii03 


u C2-JN onriL/Oion 


1285 


555634 


1 TOO T Ol _ yiO OO/X^^C 

1 732 .L2 1 .gz43_220655 


F 


A fAAAC^ 1 ZTO A .T7AA 

M00056162A:b09 


U C2-JN ormColon 


1286 


639485 


1732.C22.gz43_220662 


F 


M00056 149D:F06 


UC2-JN ormColon 


1287 


646596 


1TOO TOO yf 00/\^Oyl 

1732.123 .gz43_220684 


F 


XJAAAC^I CTTX-TJAO 

M0O056157D±I08 


U C2-JN ormColon 


1288 


459260 


i tot too yii nn/"of 

1732.J23.gz43_220685 


F 


M00056159C:C1 1 


U C2-INormColon 


1289 


634409 


1 TOT T OO /« O OOA^OT 

1732.Lr23.gz43_220687 


F 


A/1AAAC/C1 /CO A .T7A1 

M0005 6 1 62 Air 0 1 


TT/^O Al^^«-.A rt 1^« 


1290 


650820 


i too /*>a >i yio Tin zrr\ /t 

1732.C24.gz43_220694 


ii T7 

F 


A if AAAC/C 1 C A A -H? A/1 

M0005 615 0 A:E04 


U sJJ. -IN ormColon 


i t a i 

1291 


£ f\*> Af\ 

559349 


tv k r\ i ^. ai oo/\TOn 

1741 .M0 1 .gz43 220720 


F 


A if A A AC 1 O /t PtPArf 

M00056 1 84C:C07 


U JN ormUolon 


i tat 
1292 


650063 


1 ft A ■% T>/\0 OOATOT 

1741 .D02.gz43 220727 


F 


A /f A AAC/C 1 T1 A AO 

M00056171C:A0o 


UC2-JNonnColon 


1293 


492982 


1T>I1 T/\T >IO OOZ\TOO 

1 74 1 . J02.gz43_220733 


F 


M00056 1 80B:D05 


U C2-rsormColon 


1294 


^ jTTO E o 

557353 


1 A t T^"/\0 yi O OT/\TO j! 

1 74 1 .K02 .gz43_220734 


T? 

F 


A JAAAv /T 1 OO A .T3A>I 

M00056 1 82A:B04 


UC2-1N ormColon 


1 T A£T 

1295 


649299 


IT/11 /"*»/"\0 ^ ilO OT/\Tviy 

1 74 1 .G03 .gz43_220746 


F 


A if AA AC/T 1 TCTA.T7AC 

M00056175D:b05 


UC2-lNormColon 






T7HA cttA.'X OOmAt 
I ft l.X*U*t.gZ*f D^LZA) /Ox 


r 




T TP 9 ->J Arm Pol nn 


1297 


641193 


1741.J04.gz43_220765 


i F 


i M00056180B:F11 


UC2-NonnColon 


1298 


561856 


1741 .L04.gz43_220767 


F 


M00056183B-.G11 


UC2-NormColon 


1299 


642631 


1741.M04.gz43_220768 


F 


M00056184C:E08 


UC2-NormColon 


1300 


648344 


1741.P04.gz43_220771 


F 


M00056188B:E07 


UC2-NormColon 


1301 


458085 


1741.B05.gz43_220773 


F 


M00056169C:F07 


UC2-NormColon 


1302 


394772 


1741.C06.gz43_220790 


F 


M00056170C:C09 


UC2-NormColon 
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3NSDOCID: <WO 0214500A2_I_> 



WO 02/14500 
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Table 2 



SEQ 
ID NO 


CLUSTER 


SEQ NAME 


ORIEN 
T 


CLONE ID 


LIBRARY 


1303 


646699 


174LE07.gz43_220808 


F 


M00056173D:C05 


UC2-NonnColon 


1304 


593173 


1741.G07.gz43_220810 


F 


M00056176A:D06 


UC2-NormColon 


1305 


643206 


174 1 .K07 .gz43_2208 14 


F 


M00056182A:G11 


UC2-NormColon 


1306 


646309 


1741 .M07.gz43_2208 16 


F 


M00056184C:H03 


UC2-NormColon 


1307 


553023 


1 74 1 .N07 .gz43_2208 1 7 


F 


M00056185D:C01 


UC2-NormColon 


1308 


649705 


1741.O07.gz43_220818 


F 


M00056186D:G05 


UC2-NormColon 


1309 


555771 


1 74 1 .B08 .gz43_22082 1 


F 


M00056169D:A03 


UC2-NormColon 


1310 


379805 


1741.108.gz43_220828 


F 


M00056179A:D12 


UC2-NormColon 


1311 


710362 


174 1 .L08 .gz43_22083 1 


F 


M00056183C:F01 


UC2-NonnColon 


1312 


650188 


1741.P08.gz43_220835 


F 


M00056188C:D02 


UC2-NormColon 


1313 


649795 


1741.A09.gz43_220836 


F 


M00056168CA05 


UC2-NormColon 


1314 


460506 


174 1 .K09 ,gz43_220846 


F 


M00056182B-.G12 


UC2-NormColon 


1315 


502683 


1 74 1 .N09 .gz43_220849 


F 


M00056185D:D06 


UC2-NormColon 


1316 


453768 


1741.F 1 0.gz43_22085 7 


F 


M00056175AA06 


UC2-NormColon 


1317 


457272 


1741.G10.gz43_220858 


F 


M00056176B:B05 


UC2-NormColon 


1318 


378694 


1741.J10.gz43 220861 


F 


M00056180D:E06 


UC2-NormColon 


1319 


649136 


1 74 1 .F 1 1 .gz43_220873 


F 


M00056175AC04 


UC2-NormColon 


1320 


555655 


1741 .Ml 1 .gz43_220880 


F 


M00056184D:D06 


UC2-NormColon 


1321 


648777 


1 74 1 .B 12.gz43_220885 


F 


M00056169D:D05 


UC2-NonnColon 


1322 


446289 


1 74 1 .Gl 2 .gz43_220890 


F 


M00056176B:E10 


UC2-NormColon 


1323 


449452 


1741J12.gz43_220893 


F 


M00056180D:F05 


UC2-NormColon 


1324 


648665 


1741.P12.gz43_220899 


F 


M00056188CJI04 


UC2-NormColon 


1325 


561747 


1 74 1 .D 1 4 .gz43_2209 1 9 


F 


M00056172AH06 


UC2-NormColon 


1326 


646915 


1741.E14.gz43_220920 


F 


M00056174B:D02 


UC2-NonnColon 


1327 


648865 


174LG14.gz43_220922 


F 


M00056176C:B11 


UC2-NormColon 


1328 


639886 


174LI14.gz43J220924 


F 


M00056179B:H12 


UC2-NonnColon 


1329 


183842 


174 1 . J14.gz43_220925 


F 


M00056180D:G07 


UC2-NonnColon 


1330 


469409 


1741 .K14.gz43_220926 


F 


M00056182C:F08 


UC2-NormColon 


1331 


649702 


1 74 1 .L 1 4 .gz43_220927 


F 


M00056184AB11 


UC2-NormColon 


1332 


646581 


1 74 1 .N 14.gz43 220929 


F 


M00056186AB09 


UC2-NonnColon 


1333 


638730 


1741 .H15 .gz43_220939 


F 


M00056178BA11 


UC2-NormColon 


1334 


641525 


i 1741.K15.gz43_220942 


F 


M00056182DA07 


UC2-NormColon 


1335 


650800 


i 1741.B16.gz43 220949 


F 


M00056169D:H05 


UC2-NormColon 


1336 


647086 


1741 E16 gz43 220952 


F 


M00056174B:E02 


UC2-NoraiColon 


1337 


471836 


\ 1741 J16 az43 220957 


F 


M00056181A.B05 


UC2-NormColon 


1338 


640285 


j 1741.E17.gz43_220968 


F 


M00056174B:E04 


UC2-NonnColon 


1339 


459923 


1741.H17.gz43_220971 


F 


M00056178B:H08 


UC2-NormColon 


1340 


452729 


1741.J17.gz43_220973 


F 


M00056181AE07 


UC2-NormColon 


1341 


209414 


1741.H18.gz43_220987 


F 


M00056178C:C02 


UC2-NonnColon 


1342 


463487 


1741.L18.gz43_220991 


F 


M00056184B:D08 


UC2-NonnColon 


1343 


645288 


1741.F19.gz43_221001 


F 


M00056175CA10 


UC2-NonnColon 


1344 


640514 


1741.J19.gz43_221005 


F 


M00056181B:B02 


UC2-NonnColon 
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Table 2 



SEQ 
ID NO 


CLUSTER 


SEQ NAME 


ORIEN 
T 


CLONE ID 


LIBRARY 


1345 


459974 


1741 Kl 9 ez43 221006 


F 


M00056183A:B09 


UC2-NoimColon 


1346 


465594 


1741P19ffz43 221011 

X / "T 1 • X 17 — X\J X X 


F 


M00056189B:A01 


UC2-NormColon 


1347 


646173 


1741 C20 ez43 221014 


F 


M00056171A:H01 


UC2-NormColon 


134K 


557010 


1741 E20 ez43 221016 


F 


M00056174B:H11 


UC2-NomaColon 


1349 


650146 


1741 F20 £z43 221017 


F 


M00056175C:B11 


UC2-NormColon 


1350 

M.DD vr 


499690 


1741 G20 ez43 221018 


F 


M00056177A:D12 


UC2-NoniiColoii 


1351 


5X4745 


1741 H20 ez43 221019 


F 


M00056178C:E12 


UC2-NormColon 


1359 


446987 


1741 L20 ez43 221023 


F 


M00056184B:H08 


UC2-NormColon 


1353 


T'JOUOJ 


1741 A21 ez43 221028 


F 


M00056169B:C02 


UC2-NormColon 


1 354 

X33*t 


640992 


1741 H21 ffz43 221035 

X / » X .X 1 r X « ft ft X \J ^ 


F 


M00056178C.F02 


UC2-NoraiColon 


1355 
X33 D 


561 793 


1741 J21 ffz43 221037 


F 


M00056181B:E11 


UC2-NonnColon 


1356 


641945 


1741 021 2z43 221042 


F 


M00056188A:E05 


UC2-NonnColon 


1357 
13.} / 


645578 


1741 J22 ffz43 221053 


F 


M00056181C.D02 


UC2-NonnColoii 


lJJO 


6476 88 


1741 T 29 o-t43 221055 


F 


M00056184C:A09 


UC2-NormColon 


135Q 


610986 


1741 G23 oz43 221066 


F 


M00056177A:H01 


UC2-NomiColoxi 


i 


644H53 1 
vrrtl/JJ 


1741 T93 ot43 221068 


F 


M00056180B:A06 


UC2-NormColon 


1 ^£1 


6301 39 

03~X3^ 


1741 M93 o*743 221079 


F 


M00056185B:B06 


UC2-NormColon 


13DZ 


649956 


1741 T394 a^43 221077 


F 


M00056170BB09 


UC2-NormColon 


13£3 


648890 


1741F24az43 221081 


F 


M00056175DA06 


UC2-NormColon 




650075 


1741 T 94 a^43 221087 


F 


M00056 1 84C C03 


UC2-NormColon 


X 300 


£/lzL&01 
0*f*toUx 


1741 TM94 o-z43 991089 

1 /*Tl.X^Zr*T.git*T3 lvOy 


F 

X 


M00056 1 86BH09 


UC2-NonnColon 


1300 


03 / do 1 


1749^01 ot43 921098 


F 

X 


M00056 1 98 AB05 


UC2-NormColon 


1 30/ 


OH J. 1 lo 


1749 P01 cr743 991 107 


F 


M00056212CC09 


UC2-NonnColon 


130o 


^07363 


1 749 F03 ay43 991199 


F 


M00056196CB03 


UC2-NonTiColon 


1 

i joy 


46^ 1 68 
*f03 lOO 


1749 T03 ay43 221 133 


F 


M00056203DF05 


UC2-NonnColon 


13 /U 


179843 


1749 M03 a743 221 136 

X / Hi. ,XVX\J3 .gZitJ Z-Z, X X D \J 


F 


M00056208BC10 


UC2-NormColon 


13 / 1 


570030 


1749 T04 <*743 221 149 


F 


M00056203D:H03 


UC2-NormColon 


1379 


463545 


1742 L04 ffz43 221151 


F 


M00056206D:F07 


UC2-NormColon 


1 ^73 
13 /3 


456090 


1749 T05 **743 221165 


F 


M00056204A:B08 


UC2-NormColon 


1 374 


644916 


1 742 F06 ^74^ 22 1 1 76 


F 


M00056195C:F12 


UC2-NormColon 


1 11^ 

ID ID 


550055 


1749 M06 ot43 921 184 


F 


M00056208CA10 


UC2-NormColon 


ID /O 




1749 R08 ot43 921905 


F 


M00056191BE08 


UC2-NoraiColon 


ID 1 1 


£4001 3 


1749 T08 cr^43 991919 


F 

X 


M00056201CC03 


UC2-NormColon 


13 /O 


648 5 OO 


1749 F00 o-74^ 921925 
1 /T-z.r u7.j5z < tj zz x zzj 


F 


M00056196CG12 


UC2-NonnColon 


13 /y 


*f /UoU 1 


1 749 1NI00 ot43 99 1 933 


F 

X 


M00056210BC11 


UC2-NormColon 


1380 


645223 


1742.M1 l.gz43^221264 


F 


M00056208D:A12 


UC2-NormColon 


1381 


649402 


1742.E12.gz43„221272 


F 


M00056195D:D07 


UC2-NormColon 


1382 


649354 


1742.N12.gz43 221281 


F 


M00056210B:E03 


UC2-NormColon 


1383 


650756 


1742.K15.gz43 221326 


F 


M00056205D:G04 


UC2-NormColon 


1384 


648754 


1742.J18.gz43_221373 


F 


M00056204C:H12 


UC2-NormColon 


1385 


645177 


1742.K19.gz43 221390 


F 


| M00056206BA10 


UC2-NoimColoii 


1386 


507050 


1742.M19.gz43_221392 


F 


M00056209B:D02 


UC2-NormColon 
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Table 2 



SEQ 
ID NO 


CLUSTER 


SEQ NAME 


ORIEN 
T 


CLONE ID 


LIBRARY 


1 ^£7 
loo / 


646975 


174?P19(*743 291395 


X 


M00056213C"C01 


UC2-NormColon 

v/v/£> 1 1 wi iu\> vtvii 


1 QRR 


^7A1 66 


1 743 H0 1 o743 ?? 1 483 


F 

X 


M00056223 A*D 1 0 


UC2-NormColon 


1 QfiO 
1 


♦ 6AQ70Q 
04V / 


1 743 FT03 cy-743 991515 


17 


M00056293A-H07 


1 11^9 -NormCol on 

Uv^r X ^( V7X AllV_/V7i.V/i.X 




6AR1 55 


1743 P04 07AI 991596 




M00056217A-H11 


UC2-NormColon 


1 'SOI 


57B71 51 


1 743 K04 07A3 99 1 53 A 


F 

X 


M00056227D-A02 


UC2-NormColon 


1 3Q9 


6A9979 


1 743 "N04 CT743 991 537 


■ X 


M00056232CE06 


UC2-NonnColon 


1 ^Q3 


6AR351 

DtOJJ 1 


1 743 Tj05 ot-43 99 1 543 


X 


M0005 62 1 8C* C02 


UC2-NormColoii 




61 8670 


1743 T06 0743 991564 


F 


M00056225BA1 1 


UC2-NormColbn 




7067A5 


1 743 P07 »743 99 1 587 


F 

X 


M0005 623 6A*G 1 2 


UC2-NormColon 


1 *5Q6 


OjOOu / 


1743 A 08 0743 291588 


F 

X 


M000562 14B H03 


UC2-NormColon 


1 ^Q7 


569359 


1743 CO 9 **743 991606 


F 


M00056217BG03 


UC2-NoimColon 




447899 


1743 K09 k43 221614 


F 


M00056228A:A02 


UC2-NormColon 




507A13 


1743>J09a743 991617 


F 

X 


M00056232D*C08 


UC2-NoraiColon 


1 AOO 


A99517 


1743 A10o743 991620 


F 


M00056214C-A04 


UC2-NormColon 


i Am 

l*tUl 


570057 


1743 fVIl <*743 991642 


F 

X 


M00056222AE05 

ITlV U V«/ V^MM X»A^W»^ 


UC2-NormColon 


1 A07 
1HUZ 


6AQ351 


1743 Ml 1 0743 991648 


F 

X 


M00056231BE01 


UC2-NonnColon 


1 AO^ 


650Q75 


1743 A1 6o743 991716 


F 

X 


M00056214DF05 

ITXv vV^V/M X iX/*X VSw* 


UC2-NormColon 


1 /I HA 
14U*t 


567675 


1 7A3 P 1 6 ot43 991718 

1 /H > 3.\^lU.gZ*t3 ZZ1 / AO 


T? 

.r 


M00056217DB07 


UC2-NormColon 


14U3 


044y ID 


1 7A3 171 6 cr«yA3 99 1 790 
1 /'fO.xilD.gZ'tJ ZZ1 /ZU 


F 


M00056220A-H04 


UC2-NormColon 

^/ Z— X l V/A lllV/Vlvll 


i /in£ 
14Uo 


DD / JO 


17A3 T 16 <*tA3 971797 


X 


M00056229CF05 


UC2-NormColon 


1 A (VI 

I4U/ 


6A55AQ 
043 3 4V 


1 7A3 HI 7 cfyA3 99 1 739 
1 /*r3.ni / .gZ*iO ZZ1 


IT 


M00056224BFO 1 ' 


UC2-NormColon 


14Uo 


43 145 / 


17A3 1M71 »^A3 991808 
1 / *K> .1V1Z 1 .gZT-->_ZZ 1 OUO 




M00056232B* Al 1 


UC2-NoimColon 


1 A AO 


AAQ7QO 
445 /5U 


17A^ P77 crrrA'X 99 181 A 
1 / 4 L/ZZ. gZ4 O _ZZ lolH 


F 


M0005621 8A-G1 1 


UC2-NormColon 


141U 


0455 ID 


17A^ n77 o-rA3 991896 
1 /4j.V^ZZ.gZ^3 ZZIOZO 


F 


M00056235A-C12 


UC2-NormColon 


1 A 1 1 
141 1 


O45yoy 


17A3 rx93 cr7A3 991834 
1 /*t j.vjzo.gz*TO ^iojt 


F 


M00056223 A'B 1 1 


UC2-NormColon 


1 A 1 7 

141Z 


A5^A70 
4304 /U 


1743 H93 cr-743 99 1 835 


1 F 

X 


M00056224DE08 


UC2-NormColon 


1 A1 *\ 
141 j 


6AQ030 


1 743 M93 cr^43 991 840 


F 

X 


M00056232BC04 


UC2-NonnColon 


1 A 1 A 
1414 


AA7^A7 
44ZJ4 / 


1 7A3 fr94 <y743 99 1 850 


F 

X 


M00056223A:C03 


UC2-NonnColon 


1 A1 ^ 
1413 


6A5771 
D43Z / 1 


17AA001 rr7-A3 991874 


F 

X 


M00056256AC02 


UC2-NormColon 


1/116 
1410 


6A5S77 
0435 /Z 


1744 T09 <y-743 991885 


F 

X 


M00056249A*H07 


UC2-NonnColon 


1 All 
141 / 


651 051 
03 1U3 1 


1 7AA TSJ09 o-tA3 99 1 889 


F 

X 


M00056254AF04 


UC2-NormColon 


1 Al 8 
1415 


650A63 
03U4OJ 


1 7AA F03 o743 991 897 


F 

X 


M00056244AA0 1 

lVlVV/V J UiiTTXX,JtXV X 


UC2-NormColon 


1 A1 O 

i4iy 


50671 0 

3uo / iy 


1 744 T03 cr743 99 1 901 


F 

X 


M00056249BC02 


UC2-NormColon 


14ZU 


045^5y 


1 7AA "EOA n-7A3 99 1019 
1 /44.£iU4.gZ < f j ZZ171Z 


F 
x^ 


M00056242DA02 


UC2-NormColon 


1 AO 1 
14Z1 




1 7AA A06 <yyA3 99 1 940 


F 

X 


M00056237BD04 

iviuuvy j / xj»x/v/t^ 


UC2-NormColon 


1422 


650661 


1744.K07.gz43_221966 


F 


M00056250C.D05 


UC2-NonnColon 


1423 


419255 


1744.H08.gz43_221979 


F 


M00056246CG07 


UC2-NormColon 


1424 


648996 


1744.M08.gz43_ 221984 


F 


M00056253A:F12 


UC2-NoraiColon 


1425 


451361 


1744.N09.gz43 22200 1 


F 


M00056254D:C04 


UC2-NormColon 


1426 


15296 


1744.A10.gz43_222004 


F 


M00056237C:C05 


UC2-NonnColon 


1427 


! 649396 


1744.C10.gz43 222006 


F 


M00056240B:F08 


UC2-NormColon 


1428 


650860 


1744.J10.gz43_222013 


F 


M00056249C:E06 


UC2-NormColon 
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Table 2 



SEQ 
ID NO 


CLUSTER 


SEONAME 


ORIEN 
T 


CLONE ID 


LIBRARY 


1 A7Q 


AA7.077 

0433Z / 


17AA /I n^Al 777 A7£ 
1 /4^.JVl £ KgZ4.3__ZZZU /o 


P 


IVIUUUJOZ J 1X>./\1Z 


T T07 -."KTr*Tinr n r*lr»ri 
U V/Z"IN OlIIlV^OlOIl 


143U 


o3yi iy 


17*AA A/fIA n*yA1 777 Aft A 
1 /44 JM14.gZ43_ZZZUoU 


r 


A/rnnn 5 A7 5 ^ "R -mha 


T T/^7 "\T/MtnPn1r<r» 


1 A O 1 

1431 


/I A HIT*) 

44/2 /2 


1*7/14. P 1 ^ rvrr/I O 777AS/C 

1 /44.L,10.gZ43_2220o6 


17 

r 


IVIUUU30ZHUX^.xjLU / 


%j \»/Z-iN ormi^oion 


1 /I07 


4jZo4o 


11 A A 1^1^ rrrr/lO 077A0A 

I /44.J\.13.gZ*tJ__2Z2Uy4 


Jr 


A/ffinn5^75 1 "R«"RA7 
IVXUUUjOZj 1D.DU / 


U L/Z -IN Olllll^OiOIl 


1/10*2 

1433 


63 /3o / 


17/f/l nil r*-,A1 OOOl 
1 /44.VJ1 /.gZ43_222122 


r 


\/TAnn^^7A/^ A *T7ni 


T T/^O \T/\rtviP/\lAn 

u \^z-in oriii^oioii 


1 /1 O A 

1434 


lo2yoi 


1*7 A A TJT1 7 r*rr,A1 OOO TOO 

1 /44.rll /.gZ43_222123 


r 




u i^z -in ormi^oxon 


1435 


/I C'TFO1 1 

45 7221 


17/1/1 XJ1 Q ~.~A 1 T70T2A 

1 /44.rllo.gZ43_222139 


r 


Ayrnnn^^7A7 a 'TynT 


U v> Z-IN OrmV^OlOIl 


1 Alii 

1436 


7A07 1 7 


1 /44.1vllo.gZ43_2ZZ144 


17 

r 


1VLU U U-5 OZD j \^ . \^U4 


TT^*7 ATArmPnlAti 

u i^z-in orm^oion 


i a in 
143 / 


559324 


1 7/1 A "ATI Q ArrrAO 077 1 A^ 

1 /44.JN lo.gZ43_Z2Z14D 


r 


A4"nnn 5 #^7 5 5 "R • T?no 

1Y1UUU3 OZ -) . X* U7 


u v^z-xn ormv^oion 


1 A 'JO 

1438 


70 /4o3 


17/|yi A 1A /y«/l^ OOO 1 /I Q 

1 /44.A19.gZ43_22214o 


■p 


JYIUUUD 0Z3 5A.AUj 


1TP7 Mrti-rnrV.Irm 


\ Aid 

I43y 


aha am 
464091 


17/1/1 D1Q 1 777 1 /CO 

1 / 44. It 19.gZ43_2221o3 


r 


A/innn^A75ft A • A 1 1 


1 TP* 7 7\Ir»t-n-i rs\ r\r» 

u v^z-in oirnv^oioii 


1 A A€\ 

1440 


6454 /2 


17/1/1 XT7A r»*»AO 777177 
1 /44.JN20.gZ43_Z2Zl / / 


T? 

r 


AAnnn5£755r , -r , nA 

1V1UU KJD OZjjV/ . \^U*t 


T Tr**7 JWrvrm C* a1 rtn 


1 A A 1 

1441 


6500 lo 


17/1/1 TT7 1 /vpt/12 777 157 

1 / 44 .1x2 1 .gZ43_222 1 o / 


T? 

r 


A/f nnn 5 A7ATR-r*i i 


u -in orrnU/Oion 


1 A A 1 

1442 


6510/3 


17/1/1 1777 /*^/12 7777AA 

1 /44.Jc,22.gZ43_Z222UU 


17 

r 


Ayrnnn^^7A^r**n-i n 




1443 


£LA 1 OT/I 

6418/4 


17/1/1 A/170 *v*»/10 77777/1 

1 /44.JV123.gZ43_222224 


r 


A/fnnn^^7 5^ri»r^nA 

IVIUU UD OZ D j U . VJrv4 


T TP* 7 .Mrtrm P 1 rJ ah 


1444 


1 A7 /CO 

610269 


1 7AA "D70 <y^/I2 077777 
1 /44..rZ3.gZ43_ZZZZZ / 


X* 


AAfinn.5A75£ A*T71 H 


T TP*7 -NnrmPnl ati 


1 /I /I c 

1443 


/CCA/I zTQ 

650463 


1 7/1/1 177 A rr^-AO 007700 
1 /44.r 24. gZ43_22225 3 


T7 

r 




1 TP , 7_KTAT , TV*lP*n1 A« 


i /i a a 
1446 


/-f AA1 O 

650919 


17*AA T7A rr^AO 7777^^ 
1 /44.1Z4.gZ43__ZZZZ3o 


r 


Avfnnn5£7AQA*171 1 


1 TP^ 7 JMnrm C*ri\r\n 


l a ah 
1447 


586794 


11 A A Tf)A crwAI 70772Q 
1 /44.iS k Z4.gZ43_Z2ZZ3o 


p» 
r 


AvrnnfK£75 1 p-mha 
iviuuu d oz .xiuh- 


1 TP 7 JM/xrm P*rVI An 


1 A AO 

1446 


642200 


170 TZ"A7 77777A 

1/53 . JsAJ2 .gZ43_ZZZZ / U 


T7 

r 




T TP 7 Jvlnrm P a1 An 


1 A ACS 

1449 


64527 1 


1 7<ZO PAQ ^._/| o 077770 

1 /53.L/03.gZ43_2222/o 


17 
r 


A>rnnn5^7^i A*1717 

!VlUUU30Z01/\.Jr 1Z 


1 TP7 JWnrmrnl An 


i450 


r 70 1 £7 

528162 


17CO 1A/1 ~~/t O 0770m 

1 /5 3 . J 04 .gZ43__2223 U 1 


17 


IVIUUIOOZ / lVy.UUD 


T TP 7 .MattmPaI An 

Uv-/Z-in oixii^/Oion 


1451 


648609 


1 7CO "MAC /v«/l2 777071 

1 /53.JN05.gZ43_222321 


17 

r 


XVLUUUjOZ / J D.KJKjJ 


TTP7 MArmPnlnti 

u ^/Z-in orinv^oioii 


1 Jen 

1452 


213 


1 7CO T\t\£L e^rwAI 717077 

1 /53.L>06.gZ43_2223Z/ 


17 

r 


JVlUUU^OZOZU.VJUo 


1 TP 7 -MArm P a1 An 


1 /l CO 

1453 


648063 


into T7A/C /v-ylO 77707Q 

1 75 3 .±}0 o .gZ43_Z2232o 


17 

Jr 


iVlUUU 3 OZ 0*fx> . LUj 


TTP7 "M ATm P a1 An 
U V^Z -IN OIlTl^OIOIl 


1 /I C A 

1454 


647828 


170 TA/C «-»/IO 777007 

1 /53.106.gZ43_222332 


17 


AifAAA^ £SV1(\ A •T^fkO 


1 TP 7 JNfATm P a1 An 


i -iff 
1455 


1 4/6380 


iafQ "r A/C svrr/IO O7700C 

1 /53.LiOo.gZ43__222335 


17 

r 


A/THrt A ^ 7 7 AP> ■ 17 O SI 
IVLUUU JOZ /Hl^.lwUo 


T TP7 -NT attm P a1 An 

u v^z -in oirni^oi on 


1 A 

1456 


/i/i a7*£7 
649262 


1 7<0 T\l A -_._./! O 777001 

1 /53.U10.gz43__222391 


17 


X>f aaac/:7i;o A -rifiR 
JMUUU30Z03/\.JUfUo 


T TP7 .Mattti P a1 An 


1 A CI 

1457 


648590 


1 7CO A 17 077/17A 

1 753 .A 1 2 ,gz43_222420 


17 

r 


JVIUU0DOZ3 o Li .xlUO 


T TP7 MArmPAlAn 


i if © 
1458 


557857 


17CO T17 /w»/10 777/17© 

1753.112.gz43_222428 


17 

r 


A /f A A A ^ ^7 7HXI *17n7 

iviuuoDoz /ou.ruz 


1TP7 AlArmpAlAti 


1459 


*C/f AO /I A 

649349 


1TCO A 1 O 0 077/1C.1 

1 75 3 .P 1 3 .gz43 22245 1 


17 

r 


AyfAAA^/J7CATl''PAO 
JYLUU 03 OZ o UJj . XlAJzr 


1TP7 "NTrM-rvi P a1 r\r\ 


1460 


460245 


17S.O D1 4 *>-»»/10 777/1^0 

1 /53.ol4.gZ43_222453 


17 

r 


AifAAA^A7^n^'T71 1 
JVIUUUjOZOU^/.III 1 


T TP 7 JM Arm P a1 An 


1 A ZT 1 

1461 


648442 


1 7C3 AJI /I _._/1 0 707/1/C/1 

1 753 JM 14.gz43_222464 


17 

r 


AyTAAA^/C77#£ A »17A^ 
1V10U03OZ / OA.r KID 


TTP7 ATnimPAlAn 

u^z-rsoniiV-/Oioii 


1462 


646609 


1TCO T?1 f /IO 077/177 

1753.E15.gz43_2224 /2 


17 

r 


AiTAAA<<0 A «T7 A 1 

MU005oZ05A.iiU 1 


TTP7 \TArmPAlAti 


1463 


562173 


1 ns 1 nif a i 177 /I7 /I 

1 753 .G15 ,gz43_222474 


17 

r 


"\yTAAA^/COiC7TVQAO 

M00056267L>.x>Oy 


TTP7 ATAnviPAl An 


1464 


SSS071 


1 753 A 1 7 ar43 222500 


F 


M00056259AE08 


UC2-NormColon 


1465 


486452 


1753.D17.gz43_222503 


F 


M00056263B:E10 


UC2-NormColon 


1466 


507066 


1753.M17.gz43 _222512 


F 


M00056276D:B12 


UC2-NormColon 


1467 


649592 


1753.O18.gz43_222530 


F 


M00056279B:G06 


UC2-NonnColon 


1468 


268336 


1753.C23.gz43_222598 


F 


M00056262B:B08 


UC2-NormColon 


1469 


509202 


1753.P23.gz43_222611 


F 


M00056280D:C06 


UC2-NormColon 


1470 


! 459914 


1753.A24.gz43_222612 


F 


M00056259C:E03 


UC2-NormColon 
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Table 2 



SEQ 
ID NO 


CLUSTER 


SEQ NAME 


ORJEN 
T 


CLONE ID 


LIBRARY 


1471 


466894 


1753 .I24.gz43_222620 


F 


MOOO56271A:G04 


UC2-NormColon 


1472 


647839 


1754.D07.gz43_222727 


F 


M00056285B:B01 


UC2-NonnColon 


1473 


644862 


1754.107 .gz43_222732 


F 


M00056292D:C06 


UC2-NormColon 


1474 


492587 


1754.K07.gz43_222734 


F i 


M00056295C:F12 


UC2-NormColon 


1475 


651057 


1754.M07.gz43_222736 


F 


M00056298AH02 


UC2-NormColon 


1476 


554276 


1 754.E 1 0 .gz43_222776 


F 


M00056286C:H08 


UC2-NonnColon 


1477 


643984 


1754.N1 0.gz43_222785 


F 


M00056299C:F06 


UC2-NormColon 


1478 


646060 


1754.H12 .gz43_2228 1 1 


F 


M00056291B-.G01 


UC2-NormColon 


1479 


646387 


1754.K12.gz43_222814 


F 


M00056296A.A07 


UC2-NormColon 


1480 


462986 


1754.N12.gz43_222817 


F 


M00056299DA02 


UC2-NoraaColon 


1481 


598471 


1754.H19.gz43_222923 


F 


M00056291C:C11 


UC2-NormColon 


1482 


449882 


1 754.1 19.gz43_222924 


F 


M00056293B:F02 


UC2-NortnColon 


1483 


555998 


1754.F20.gz43_222937 


F 


M00056289B.E05 


UC2-NormColon 


1484 


419801 


1 754 J20.gz43_22294 1 


F 


M00056294B:G06 


UC2-NormColon 


1485 


647462 1 


1754.E21.gz43_222952 


F 


M00056287C.H08 


UC2-NormColon 


I486 


648524 


1 754.F2 1 .gz43_22295 3 


F 


M00056289B:F09 


UC2-NonnColon 


1487 


462865 


1754.M22.gz43_222976 


F 


M00056298D-.G01 


UC2-NormColon 


1488 


645844 


1754.022.gz43_222978 


F 


M00056301C:H08 


UC2-NormColon 


1489 


556497 


1 75 4 . C23 .gz43_222982 


F 


M00056284C:A11 


UC2-NormColon 


1490 


467709 


1 754X23 .gz43_22299 1 


F 


M00056297D:B10 


UC2-NormColon 


1491 


559857 


1754.D24.gz43_222999 


F 


M00056286BA04 


UC2-NonnColon 


1492 


651115 


1754.N24.gz43_223009 


F 


M00056300BA11 


UC2-NormColon 


1493 


644660 


1755 . DOS .gz43_223079 


F 


M00056307AF07 


UC2-NormColon 


1494 


647248 


1 755 .N05 .gz43_223089 


F 


M00056329C:B09 


UC2-NormColon 


1495 


524261 


1755 .B06.gz43_223093 


F 


M00056305AA02 


UC2-NormColon 


1496 


556444 


1755.K06.gz43_223102 


F 


M00056320C:E08 


UC2-NormColon 


1497 


639378 


1755.M08.gz43_223136 


F 


M00056326C:G07 


UC2-NonnColon 


1498 


460929 


1755 .009 .gz43_223 154 


F 


M00056333AE09 


UC2-NormColon 


1499 


642260 


1 755 .Kl 0.gz43_223 1 66 


F 


M00056320D:F07 


UC2-NormColon 


1500 

JL*J \J\J 


635062 


1755 .111 .gz43_223 180 


F 


M00056319AA11 


UC2-NonnColon 


1501 

Uv X 


644880 


1755 B14 ez43 223221 


F 


M00056305B-.F09 


UC2-NormColon 


150? 


452075 


1755 N14 sz43 223233 


F 


M00056330B:C02 


UC2-NormColon 




557052 


1755 El 5 ez43 223240 


F 


M00056310A-.C10 


UC2-NormColon 


1504 


569195 


1755 K15 ez43 223246 


F 


M00056323AG03 


UC2-NonnColon 


150S 


444972 


1755 M16 sz43 223264 


F 


M00056328B:E07 


UC2-NormColon 


1506 


467364 


1755.A17.gz43_223268 


F 


M00056304AH04 


UC2-NormColon 


1507 


446910 


1755A18.gz43_223284 


F 


M00056304B:C05 


UC2-NormColon 


1508 


447126 


1755.M19.gz43 223312 


F 


M00056328D:B01 


UC2-NormColon 


1509 


650639 


1755.D20.gz43 223319 


F 


M00056308A.C01 


UC2-NormColon 


1510 


641681 


1755.B22.gz43 223349 


F 


M00056305D:E08 


UC2-NonnColon 


1511 


606129 


1755.P24.gz43_223395 


F 


M00056338B:B08 


UC2-NormColon 


1512 


448412 


1766.A01 .gz43_224099 


F 


M00042346AB04 


UC2-PrimColon 
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1513 


485237 


1766.E01.gz43 224103 


F 


M00042432D:G02 


UC2-PrimColon 


1514 


479131 


1766 G01 ez43 224105 


F 


M00042437BA11 

x. txu vy w ■ r * x/4* xxx 


UC2-PrimColon 


1515 


481457 


1766.H03.gz43 224138 


F 


M00042439A:D09 


UC2-PrimColoii 


1516 


448892 


1766A09ez43 224227 


F 


M00042346CG12 


UC2-PrimColon 


1517 


477797 


1766 E13 sz43 224295 


F 


M00042433CH05 


UC2-PrimColon 


1518 


450212 


1766 119 az43 224395 


F 


M00042462A-F12 


UC2-PrimColon 


1519 


3781 


1766.J21.gz43 224428 


F 


M00042464A:F06 


UC2-PrimColoxi 


1520 


447377 


1766.M22.gz43 224447 


F 


M00042470B:H1 1 


UC2-PrimColon 


1521 


453606 


1767 .H01. gz43 224490 


F 


M00042764B:B10 


UC2-PrimColon 


1522 


472101 


1767.O02.gz43 224513 


F 


M00042777B:B05 


UC2-PrimColon 


1523 


284586 


1767D03ez43 224518 


F 


M00042756B:B01 


UC2-PrimColon 


1524 


449689 


1767.M03.gz43 224527 


F 


M00042773A:H11 


UC2-PrimColon 


1525 


449215 


1767.A04.gz43 224531 


F 


M00042515D:B11 


UC2-PrimColon 


1526 


128749 


1767 104 sz43 224539 


F 


M00042766AD07 


UC2-PrimColon 


1527 


451982 


1767 A05 ez43 224547 


F 


M000425 15D.G01 


UC2-PrimCoIon 


1528 


448251 


1767 H05 ez43 224554 


F 


M00042764C.B10 


UC2-PrimColon 


1529 


449437 


1767J05ez43 224556 


F 


M00042767D.D02 


UC2-PiimColon 


1530 


487876 


1767 K05 oz43 224557 


F 

X 


M00042769B-E12 

ITlvUXr l «< » V/^'X^.X^X^W 


UC2-PrimColon 

W V-^X# X X XL IV-ZVlV/xA 


1531 

X*J^J X 


481614 

tO X VJ XT 


1761CQ6ez43 224565 


F 


M00042755AD09 

ITlv v w / */«/AX«X^ V</ 


UC2-PrimColon 


1537 


486547 


1767M06az43 224575 


F 


M00042773BH08 

XVXV/ v/ # / s^XV.XXW 


UC2-PrimColon 


1533 


1 79760 


1767A07ffz43 224579 


F 


M0004275 1BC07 


UC2-PrimColon 


1534 


484633 


1767 108 ffz43 224603 


F 


M00042766BG11 


UC2-PrimColon 


ijjj 


47Q851 


1767 K08 ffz43 224605 


F 

X 


M00042769CB0 1 


UC2-PrimColon 


1536 


470801 


1767G09ez43 224617 


F 


M00042763B.A03 


UC2-PrimColon 


1537 


486452 


1767J09ffz43 224620 


F 


M00042768AH09 


UC2-PrimColon 


1538 

X <J«J O 


479868 


1767K09sz43 224621 


F 


M00042769CB06 


UC2-PrimColon 


1539 

1JJ7 


485880 

TOJOOU 


1767 C10 ez43 224629 


F 


M00042755A:H04 


UC2-PrimColon 


1540 


456513 

•t»j »j«j x tj 


1767Jllez43 224652 


F 


M00042768BF02 


UC2-PrimColon 


1*^41 


473433 


1161 Oil ez43 224657 


F 


M00042778AC04 


UC2-PrimColon 


1542 


481594 


1767 CI 2 2z43 224661 


F 


M00042755BD01 


UC2-PrimColon 


1543 


477521 


1767 F12 bz43 224664 


F 


M00042761CG11 


UC2-PrimColon 


1544 


485073 


1767 L12 ffz43 224670 


F 


M00042771BG05 


UC2-PrimColon 


X J*T J 


480960 


1767B13ffz43 224676 


F 


M00042753BC08 


UC2-PrimColon 




TT7L X«J 


1767 A 14 ez43 224691 


F 


M0004275 1DD03 


UC2-PrimColon 


1 547 


450551 


1767T14P743 224699 


F 


M00042766DC05 


UC2-PrimColon 


1548 


481057 


1767.M14.gz43_224703 


F 


M00042774B:C01 


UC2-PrimColon 


1549 


449076 


1767.D15.gz43_224710 


F 


M00042757A:H07 


UC2-PximColon 


1550 


477098 


1767.E15.gz43_224711 


F 


M00042760A:G12 


UC2-PrimColon 


1551 


482486 


1767J15.gz43_224716 


F 


M00042768C.E06 


UC2-PrimCoIon 


1552 


449438 


1767.K15.gz43_224717 


F 


M00042770A:D02 


UC2-PrimColon 


1553 


448260 


1767.015.gz43_224721 


F 


M00042778B:B07 


UC2-PrimColon 


1554 


453385 


1767.P15.gz43_224722 


F 


M00042779D:D04 


UC2-PrimColon 
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ID NO 


CLUSTER 


SEO NAME 


ORIEN 
T 


CLONED) 


LIBRARY 




4o43D-> 


17£7 A1£ rr-yAO 77A770 
1 /O / .AXO.gZHO_ZZ*f / Zj 


Jr 


A/ffin04?7S 1 D-F0Q 


T IC2-PrimColon 


Iff/ 

1556 


/I ocooo 

485882 


1 nan n A 0 77A77^ 
1 /6 /.L,10.gZ*IO_zZ^ /Z3 


17 

r 






1557 


449592 


17£7 T rr^AO nOAniA 
I/O /.Xjl0.gZ £ tO_ZZ*r / 


•p 
r 




T Tf*9 -P rim Pol on 


ire o 

1558 


449261 


17£7 A/F1£ cfrAI 77A70^ 
1 /0/.JVxi0.gZ'IO_ZZ4/ 


r 


Tv/rnnn ao 77 AC • roq 


T TP 9 -Prim f!n1 nn 


1559 


48590 / 


1 nan ri7 /y-T/tn 77A7A1 
1 /O/.L/l / .gZ^J_ZZ*f /Hi 


r 




T TP9 -PrimPnlnn 


15ov) 


4/45oU 


17£7 17 IT n^A^ 77A7AA 
1 / O / .1* 1 / -gZH 0_ZZ*t / HH 


r 


iv/rnnoA7 76 1 tyt* i 7 


1 JP2-PrimColon 


1561 


/I /1 0 1 <o 
448152 


1 7ATf \T1 7 rt-r/l'J 77A7^7 
1 /O / .IN 1 / .gZ^f 3_ZZ*f /3Z 


T? 

Sr 


Tv/rnnnA777^ri*Tj0 1 

iVXVJVJU'+Z/ 1 \jU .xJ\J x. 


T JP2-PrimPolon 

KJ V^Z. ""X X XXXXv^WX^FXX 


i can 
1562 


An A QOO 
4/4825 


17^7 D17 ft-rA^ 77A7^A 
l/O/.rl / .gZHJ_ZZH / 


r 


M"nnO49770n*F06 


T IP9 -PrimPolon 


i ca 
1563 


453024 


17^7 T718 rt-rA^ 77A7^0 
1 /0/.r!/lo.gZ'f^_ZZ z f/^y 


T7 
T 


TVf nnfi4.?7fi(7R*F 1 ? 


T JP2-PrimPolon 


1 CCA 

1564 


479880 


1 /0/.llo.gZHO_ZZH/0^ 


V 


"M*0n049767 A *R 1 0 


Uv^i X XXXllV>V/XwXI. 


1565 


477399 


17X7\T1fi 77A7/££ 
1 /O /.JN lo.gZ43_ZZH /Oo 


X* 


IVIvUU't^ / / UJ_y .VJ J. v/ 


T TP2-Prim Polon 


1 CCa 

1566 


485441 


17A7 A 10 crrrAI 77 All F 1 

1 /O / .Al y .gZ*f 0_ZZh / / 1 


T7 


M0004975 1 D*G09 


UC2-PrimColoii 


i can 
1567 


/toe 1 on 
485183 


1 7£7 DIO rr-yAO 77A777 

l /o / .x5iy.gz*fo_zzH/ /z 


r 




UC2-PrimColon 


156o 


/KO 1 f\A 

452104 


17A7 T1Q 77A773 
1 /O /.l_/l:7.gZHO__ZZ*t/ ID 


V 




UC2-PrimColon 


1569 


449275 


17A7 A OH n"rA0 77A7R7 
1/0/ .AZU.gZ*tO_ZZ*f / 5 / 




1V10004?7^2A*B04 


UC2-PrimColon 

<J V^^< X 1 XU1 V/V/l wJU 


1570 


A A TO 1 O 

4472 1 8 


1 7/C7 T 70 ft^/ll 77A7Q8 

1 /o / .LiZU.gZHj_zzH- / yo 


r 


A/T00049779f ,, n 2 


I JP2-PrimColon 


1571 


472119 


1 7/C7 XT7fi r^AO 77Aftnn 
1 /O / .JNZU.gZ £ fO_ZZ'foUU 


T7 


A/T00047777A-R1 0 


IJC2-PiimColoxi 


1 Cn*> 

1572 


482868 


1 7/C7 TO 1 rr™10 77 AS 11 
1 /O / .12 1 .gZ45_ZZ*f o 1 1 


r 


A^0004?767R-F 1 0 


T JP2-PrimColon 

V^Vi/^ X XXXIXV^V/XVVXX 


1 f 

1573 


yjff 1 A"} 

455143 j 


man X7nn n^A o OOACOA 
1 767 .r 22 .gz43_2z4oZ4 


X* 


A/r000477fi9 A '(709 


T IP2-PrimColon 

Uv-'jtf Jl X XXXXV^UXVJ'XX 


1574 


A O AC\H A 

484964 


i nan too c*rwAi OOA807 
1 /o / .12z.gZ43_ZZ45Z / 


17 


A/f OO 0 A9 767R • f*¥04 


T TP2-PrimPolon 

\J\^dZ* X XXXXXV^V/XV/XX 


1 f Tf 

1575 


A A OOTf 

448325 


i nan t\oo *+rrAi nnAQ.no 
1 767 .L/23 .gZ45_2Z4o5 o 


p 

X* 


TV/f OOOA7 7 S * P06 


T JP2-PrimColon 

\J\^r£, X X XXXXV^ UlvXl 


1576 


AO A AO*7 

484987 


i i nun TOO wWlO 7745/11 

1767.123 .gz43_224845 


p 
X* 


M00049 767T* • G0Q 


T IP2-PrimColon 

W V>X» X X XXXl\_/^/XWXX 


1577 


A^IC HA1 

475797 


1 767 .U23 .gZ45_ZZ4o49 


p 


1VA00049778TJF1 1 


T JP2-PrimColon 

UV/Ai X X XXXXVs v/xv/xx 


1578 


477757 


1 /67.F24.gz43_224oDO 


p 
r 


A/TOOO 49 7X 0 A -HO 5 


T JP2-PrimColon 

V>X» X XXXXIV^V/XWXX 


1579 


448233 


ino tv/\1 ___yi o 77 r 7 r A 
; 1 / / 8 .DO 1 .gZ45_2Z jZ34 


p 


AA0004.9 R49 A *R 1 2 


1 TP2-PrimColon 


1 f OA 

1580 


AHC 111 

475 111 


mnQ DH7 r»*»AO 77 f 7^8 
I 1 / /o.I3UZ.gZHJ_ZZ^Z05 


p 


A/fO 0049 8^ 8 A -F02 


UP2-PrimColon 


1581 


449403 


i nno xtao ^rr/io 00<0£/i 
1 / /o.lN0z.gz43_ZZDZoU 


p 


AA000A9 8^QR-Tj06 


T JP2-PrimColon 

\^ V^X# X XXXXIV^V/XV/XX 


1582 


449604 


mno r\f\n r***Ai 77 r 78i 
1778.U02.gZ45_2Z3Zol 


p 

X* 


lV>f OOOA9 X60T1-F09 


T TP2-PrimPolon 

vJ V/Zi X X XXlXv^rVXV/XX 


1583 


447983 


i nno 17 no r*-,A o oo^OfiQ 
1778.r05.gz45_Z23Zo<$ 


p 
r 


Ayf OOOA9 X AS P • A 0Q 


T TP2-PrimColon 


1584 


AO A AA 1 

484091 


i nno r\(\l r*~A1 77^007 
1 / /8.U05.gZ43_Z2j29/ 


r 


TV>TOOOA9 R^OTl -F0S 


T JP2-PrimColon 

WV^Z/ X X XXXlX^VXXVXX 


1 ^ 0 fr 

1585 


480900 


i nno jf\A ~-wA i oo<OAft 
1778.J04.gz43_2253Uo 


17 

r 




T IP2-PrimColon 

V^Z> X X XXXIV'VSXVXIX 


1 f O /T 

1586 


ff 101 1 

551811 


\nnO TST\A r***A1 oo^ono 

1 / /8.rv04.gz45 zzo^uy 


p 


AA00049R^4A / R1 1 


T JP2-PrimColon 

U v^i X XXXXLV^V/XV/XX 


1 f O ^7 

1587 


A O C O A 1 

485841 


i nno t r\<. **-,A i OO^OOA 
17 78.L<05.gZ452Z55ZO 


T? 

r 


A/rn nnA9 R s STVWOfi 

JYXU U U*tZ OJJD .XxV/ U 


T TP 2 -Prim Colon 

V_-f ^« XT X XXXIV^V^XV/XX 


1 f 0 0 
1588 


476256 


inno f~*r\n n ~Al OO^O/IO 
1778. CO / .gz43_2ZD54y 


T7 

r 


AAnOOA7 0TJ-F09 


T TP2-PrimPolon 

\J\*sAl* X X lXIXVsVJXVJXX 


1 C OA 

1589 


447983 


inno T2(\n nrrAI 07^0^1 

1 / /8.xi0 /.gZ45_ZZD3Dl 


p 

X* 


AA00049 R44 A -D09 


I TP2-PrimColon 

\J \**A* X X XXXl^'V/XV/XX 


1590 


725641 


1778.K07.gz43_225357 


F 


M00042854A:D05 


UC2-PrimColon 


1591 


484789 


1778.L08.gz43_225374 


F 


M00042855C:G11 


UC2~PrimCo!on 


1592 


449110 


1778.L09.gz43_225390 


F 


M00042855D:A12 


UC2-PrimColon 


1593 


451368 


1778.O09.gz43_225393 


F 


! M00042861A:F04 


UC2-PrimColon 


1594 


449521 


1778.P09.gz43_225394 


F 


M00042862C:E02 


UC2-PrimColon 


1595 


727875 


1778.G10.gz43_225401 


F 


M00042847A:F04 


UC2-PriniColoii 


1596 


449273 


1778.111.gz43_225419 


F 


M00042851AB08 


UC2-PrimColon 
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44Q659 


1778 Ml 1 <rr43 225423 


X 


M00042857C-G04 


UC2-PrimColon 


isq8 


481664 


1778 Nil oWtt 99S494 


T? 


M000498 iQD-Dl 9 


T TC9~Prrmf % nlnn 


1SQQ 


4489^0 


17755 F19 99S431 


x^ 


M00049844r-B03 


TTr9-Pr7TnPn1nn 

\J X 1 XLliVydUil 


1600 
low 


48S09Q 


1778 FT19 99S434 


F 


M00049 840R -O06 




1601 


48^974 


1778 T19 cr-74^ 99^4^ 

1 / / O.XlZ.gZHD__ZZJ t tJJ 


X* 


M000498S1 API 1 


T TP 9 -Prim P nl nn 
u vy^-jr i xniv^uiuxx 


lOUZ 


644. 1 Q0 


1778 T19 &-74^ 99^4^6 

1/ / O.J IZ-.gZH-J ZZ.J'-IOVJ 


F 

x^ 


M000498S^A-n0^ 


TJP9-Prii«Colnn 

\J X 1 Uli.V_xVJJ.vJIi 


1603 

10U3 


79^966 

/ Z JZOD 


1778 TCI 9 ot43 99S4^7 

X / / O.lVlZ.gZtO /.ZJtj / 


TT 
17 


M000498S4RF0S 


T TP9 -PrfmPnlnn 

VJ V>-C/ X 1 Xli I V_/ UXVJX1 


1604 


48000S 


1778 T 19 <*74^ 99S4^8 


r 


M00049 85 5D R 1 9 


T TP 9 -P ri m Pnl nn 
vj r i uxiVAjxv/ix 


160^ 


4R4841 


1778 Ml 9 ov4^ 99S4^0 

1 / / O.xVlxZ.g^TO z.JLJr'+jy 


r 


M00049857r*riO5 


T TP9 -Prim Pol nn 

VJ V/Z« I X Xlll V_/VJ1 VJXi 


1 606 


480641 


1778 1M19 <T74^ 99S440 


F 

X 


M00049860 A -P0 1 


UV/i- I 1 IXtXwwXVJXI. 


1UU / 


48131Q 


1778 01^^74^ 99S4S7 


F 

X 


M00042861CC03 


UC2-PriiriColon 

Uv*» X 1 1111 V^V/XV/Xl 


1608 


^87077 


1778 PI 4 ot4^ 99S461 


F 


M00049840C*Tn 9 


T TP9-PrimPolorj 

VJ V_/Z< X 1 XlllV-»wAWJX 


1 60Q 


4^67^7 


1778 T 14 cr743 99^470 

1/ / 0.1vX*T.gZ*TO Zx.JH/U 


F 

17 


M00049855DD06 


VJ V/^> XT 1 lXllVvVJlVJll 


1 61 A 
xOiU 


47QOR4 


1778 Ml 4 <T-r4^ 99<\471 


F 


A/T00049 8 5 8 A • A 08 


T TP9 -PrimPol nn 

UV/Z< X 1 XXLlV>UiUXl 


161 1 
lOl 1 


484468 


1778 OIS <r«74^ 99548Q 


F 

r 


M00049 861 C-F07 


I IP9-PrimCo1on 

VJvL X A JLXIXV-'XJAVXIX 


1610 
iOlZ 


4^067^ 


1778 PI 8 &t4^ 99S^9S 


F 

Mr 


M00042841 RH05 

1VXV/ V/ UtZi Otl X* .X XV/ -J 


T IP2-PrimColon 

Uva IXilXV^VrXVJXI 


1 61 0. 


474^07 
/*rDy 1 


1778 FI18 <r-74^ 99^^96 

1 / / O.i/1 0.gZH J_ZZJ JZD 


F 


M00049 84^ A *F07 


T JP9 -PrimPnlon 

\J \*s£* X X XLllVw/\Ji.VIXl 


1 61 A 


477^0^ 


1778 F18 99SS97 


F 


MOO 049 844D -FT02 


T JP2-Primrolon 

L/V/^> X 1 IXXJLVw'V/XVJXl 


161 ^ 


4^7000 


1778 T18 ar4^ 99SS^1 


F 


M000498S1DR08 


T JP2-PrimCo1on 


lOlu 


44Q40/7 


1778 018 rr«74^ 99^^^7 


F 


M00049 86 1 TV H09 

1VX V/V/ \-r*T^. OU X XV . 1-/ \J -7 


1 JP 2-P iHtTiColon 

U V^^> X XXXllV^V/lV/lX 


1 61 7 
lOl / 


448 104 


1778F1Qar4^ 995^4^ 


F 


M00049845 A *B05 


UC2-PilmColoti 

VJ X 1 JLX1X V^Vyl WJJ. 


161 ft 
lOlo 


4^856 


1 778 Ti99 <xt4^ 99S^Q0 

1 / / O .l/Zit ,gZ*t J_ZZ J J 7 w 


F 

x^ 


M00049 84^R FO 5 


UC2-PrimColon 


161 Q 


47^Q49 


1778 T*9^ o-r4^ 99^604 


F 

17 


M000498^QBF05 


t JP2-PrimColon 


1 £7fi 
lOZU 


xyDyyyi. 


177» F9^ crr4^ 99S608 


F 

17 


M00049 846P T)0Q 


T TP? -Prim Pol on 

L/V/X. X 1 XL11V_/V71VJ1X 


1 671 


480640 


1778 fi9^ 995617 


F 


M00049869 A -C0 1 


T Jr2-PriinColon 


1 677 
lOZZ 


477^07 
*f 1 Zd\) / 


1778 P94 <yr4^ 99 S 691 
1/ / O.^Zf.gZ^J^ZZJOZ 1 


F 


M00049 849 A R04 


UC2-PrimColon 


1 £JT\ 
10Z.3 




17SfiTf)1 rr-74^ 996097 
1 / 5U.1U 1 .gZ*rO_ZZDUZ / 


F 

JT 


M00056^61 A-r*01 

IVIUUU J OJ vJ 1a. \-/U X 


T TP?-PrimPolnn 

LJ \s£t X X XXllV^V/XViXX 


1674 


7^7 i 


1 780 T709 crr4^ 9960^0 


F 


M00056^56r*H07 


T IC2-PriiiiColon 

U 1 X 1.11 lV^UlVJli 


1 £7^ 


77AQH7 


17ftA \t(YJ nvA.1, 99604^ 


F 


M00056^6^ A *P06 


T TP 9 -Prim Pol on 

KJ VyZ. X X XIXIV^VJX^JJX 


1 AOA 


TO /I TOO 
/Z^t/yZ 


1 7RH T H7 r»-r4^ 07^n46 
1 / 5U .lwUZ.gZ £ f^__ZZOU*f O 


F 

JT 


M00056^ 64 A -"R04 
1V1UUU jOj O i tf\.D\J e T 


T TP9 -Prim Pol on 

\J V-'Z. X X XlliVxvllVJXl 


1 £97 

loz / 


/.30ZxU 


I /oU.X5UH-.gZ £ rj_ZZOUOO 


F 
17 


M000S6^5^P-F19 

1V1UUU D OD J O . JCi 1 Z 


T TP9 -PrimPnlon 

VJ V^Z« XT X ILL LVwsVJXVJXL 


1 AOQ 
loZo 


£0*2 07£ 


1 / oU. OU3 .gZ'rO^ZZOUo?' 


F 


MOOO56^50A-PO8 

1V1UUU J Oj J 


T TP9-Prim Pol on 

KJ X X XXXXV^VJXUXX 


1 /COO 




170A tac rt-^A^ 00AHO1 
1 /oU.IU3.gZ*fO_ZZOUyi 


F 

17 


1V1UUU D Dj O l/V.lxUU 


T TP9 -Prim Pol on 

\J V/Z. 1 1 UliV^V/XVXIX 


i 




17ftffc Tff\^ OO^HO^ 

1 /oU.JvU-).gZ £ fO_ZZOUi'J> 


F 

! r 


M00056^6^ A *F 1 0 


T TP? -Prim Pol on 

VJ V»y^» X X XlXlVsUXUlX 


i £7i 


1 1 OA 


1 78H PH^ rr-r4^ 79^0.02 
1 /oU.JrU*>.gZ'r^>_ZZOU5'o 


F 
r 


M00056^68RTT00 


T TP ? -Pri m Po1 on 

KJ X X XXXAV^VXIV/XX 


1632 


416326 


1780.G06.gz43 226105 


F 


M00056359AF06 


UC2-PrimColon 


1633 


727129 


1780.D07.gz43_226118 


F 


M00056355D:A05 


UC2-PrimColon 


1634 


560859 


1780.O07.gz43__226129 


F 


M00056367C:D06 


UC2-PrimColon 


1635 


! 482831 


1780.P07.gz43_226130 


F 


M00056368C:E07 


UC2-PrimColon 


1636 


503862 


1780.L08.gz43_226142 


F 


M00056364C:B01 


UC2-PrimColon 


1637 


468467 


1780.D09.gz43_226150 


F 


M00056355D:B08 


UC2-PrimColon 


1638 


594994 


1780.A10.gz43_226163 


F 


M00056352D:H02 


UC2-PrimColon 
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Table 2 



SEQ 
ID NO 


CLUSTER 


SEONAME 


ORIEN 
T 


CLONE ED 


LIBRARY 


loo)? 


AA7GAA 
40«5yOO 


1 /oU.v^xU.gZ^r-J ZZDIOj 


r 




UC2-PrimColon 


1 A A A 

164U 


A7 1 A/1 A 
4/1 04O 


nsn Tin 99£i7i 

1 /oU.llU.gZ43_ZZ01 fx 


r 




Tir , 9-PriTnPn1nn 

KJ\-t£* X7 X XXIXV-^UXVJXX 


1 AA 1 
1641 


1 $ 1 yzj 


1 / 5U.D 1 1 .gZ*f j_ZZD 1 OU 






v^V_/-t* X7 XXXXX\_svJXUXX 


1 A/IT 

164Z 


/IOAQOA 

4oUyZ4 


ITOATV/Tll rr^A^ 99A1Q1 
1 /oU.iVxx l.gZ43_ZZOiy 1 


r 


MO0nS6^6SR*m 0 


T TP? -PrimPnlon 

\J Vv^« X X XXXXV^v^XV/XX 


1 A/f 1 

io4J 


31 


nftn pi i cfrAi 99A1QA 

1 / 5U .r 1 1 .gZ43 ZZO 1 


p 


MO00S 63 68D 


UC2-PrimColon 

V^*» X X XXXXV^\JXv/Xl 


1644 




178ft 1719 crrrAI 99A9fHl 
1 /OU.r !Z.gZ43 ZZ-OZUU 


P 


M0n0S63S8AG05 


UP2-PrimColon 

\Jy^j£* XT X XXXXV^v/Xv/Xl 


1 A/1 ^ 

1o4j 




17ftft W19 rr-rA^ 996909 

l /ou.riiz.gz^3_zzozuz 


P 


M000S 6360R D09 


UP2-PriiTiColon 


1 A/1 A 

xo4o 


1 Q^'a AA 


1 7ftft T19 crvA.'l 99A,9fV3 
1 /oU.llZ.gZ*+3 ZZOZUj 


p 


M0005 63 6 1 C ' C06 


UC2-PrimColon 


1 A/IT 
l04/ 


^11 AHA 
311 OUO 


178ft T19 rr-7A3 996904. 


p 


M00056362CC02 


UC2-PrimColon 


1 A/1 Q 

lo4o 




1 7«ft 1 4 rr-7A3 996937 


p 


M00056363CC12 


UC2-PrimColon 


1 A/IO 




1 78ft P1A rvrA^ 99A9A9 
1 /5U..T 14.gZ*f3_ZZOZ i tZ 


P 

r 


IVX\Jv/V/»^ UJVJOL/.VJuJ 


T IP2-Prim Colon 

VJ X X XXXXV^V/Xv/XX 


1 ACA 


/i qaaoc 


1 78ft Ml <J nvAl 99A9^ft 
1 /oU.xll3.gZ43_ZZOZ3U 


p 


JLVAVW/fc/VI J\J\J\** .Xj\J X 


T TP2-PrimColon 

\J X X XXXXV^vSXVAXX 


1651 




170AT 1 ^ 99A9*!A 

1 /oU.JL13.gZ43_ZZOZ3*t 


p 


iviuv/vjyju'Tv/.i xv/ o 


T TP2-Pri mColon 

\J v/Z) XT X XXXXV^V/Xv/XX 


1652 


33 /UZy 


179H ft-TA^ 99A9A1 
1 /oU.^10.gZ43_ZZOZ01 


p 
r 


LVX\J\J\JiJ UJ J JJJ.L/U^ 


T IP2-PrimColon 

VJ X X XXXlXw/ Ulvli 


1653 


533244 


1 /oU.J-fl / .gZ45_ZZOZoO 


p 


lv/rnoo S 63 64D • 


T TP?-PriTTiPo1on 

UV/Xi X X XXXXV^v/XVJXX 


1654 


456533 


1 7ftA Hon ft-jzl'J 99A^37 
1 /oU.UZU.gZ45_ZZO^o / 


p 

JT 


A4"00nS636RA-r , 1 1 

1V1v\/UJOJ\jO^\..v-'X X 


T TP2-PrimPolon 

vj ^ X X XXXXVsVJXv/XX 


1655 


725/ /4 


1 7ftft iV*99 /yt/I^ 99 A"3A5 
1 /oU.JsZZ.gZ4^_ZZO303 


P 


A/Tnn0S6363rjH03 


T TP2-PrimPolon 

UV<b x xxxxxv^v/xisxx 


1656 


724039 


1 /oU.liZ3.gZ43__ZZ03 #3 


p 




T TP2-PrimColon 

Uv/i. X X XXXX v^VJXVJXX 


1 f r 

1657 


T"> / ^ 1 A A 

732144 


1 TOA UTOO r*T,A1 OOAQTQ 

1 /5U.ri23.gz43_ZZo5 /<S 


p 
r 


1V1UUU3 O 3 O \JxJ .xx 1 \J 


T TP9 -Prim Pol nn 

U V^Z> —X 1 XiliV^VJXvJXX 


1658 


724257 


1TQO nAI Q OOA/1AO 

1 /oy.OrU 1 .gZ43_ZZo4l/y 


p 


AAOOOS A37Sr>-D 1 9 
XVxv/UV/JOJ / Jl/.i/l^ 


TTP9-PrimPn1nn 

VJ V_/^# X X XXXXVsVJXVJXX 


1659 


474346 


1700 A/TA1 r*<-,A 1 OO A/1 1^ 

1 /oy.MUl.gZ45_ZZo413 


p 


TVAonn^A3RAP-prnA 

xVlUUyJOOO*TV^.xxv^*T 


T TP? -Prim Pol on 

VJ V-/.£- X X XXXXV^VJXUXX 


1 /CAA 

looU 


3 /0342 


17ftO"Mm cttAI 99Av11A 
1 /oy.INUl.gZ*l-3_ZZO z tlO 


p 


M000S63R6A-F08 


UC2-PrimColon 


1 iCA 1 

1661 


4756o2 


1 /oy.rUl.gZ43__ZZ041o 


p 


1VJLU J OJ O O J_> .nvJ 


T TP2-PrimPolon 

VJ V^^> X X XXXX V^V7Xv/XX 


1662 


/34o4o 


1 75Q RAO 99AA9H 
i /oy.x3UZ.gZ43 ZZOH-ZU 


p 


A/f000S6370R-f , 1 0 


I TP2-PrimPolon 

U V/^ X A XX1L V_>VJXv-/XX 


i aa*2 
IOOj 


1 QA^79 
xOOJ /Z 


1 750 lVyfH9 wtZI^ 9964^1 
1 /oy.lVlUZ.gZ 4 t'3 ZZOHjI 


P 


M000S 63 84D • A04 


UC2-PrimColon 


l AA/i 
loo4 


/3oUUl 


1 /5y.AU3.gZ43_ZZO*t33 


P 


M000S636QRF03 


UC2-PrimColon 

VJ V^<Z-« X X JX11\_/ v/Xv/XX 


1 AA< 
lOOD 


4o /u /y 


1 /oy.l:U3.gZH3 ZZOH'fU 


P 


1V1UUWJUJ / ^V^.VJvO 


UC2-PrimColon 

L/W^ X X XXXXX^ v/XV/XX 


1 AAA 

looo 


AAA.9SA 


17OQ TJAQ rr^/t^ 99AAA9 

1 /oy.riU3.gZ'+3 ^zzo^+^fz 


Mr 


i M00nS6377A-r01 


UC2-PrimColon 


1 AA*7 

loo/ 


/232/4 


17SOTA'? otA^ 99AAA/2 
1 /oy.lU3.gZ43_ZZ0443 


P 


A/f000S6377rj'F0X 


T TP2-PrimPolon 

UVy^i X X 1XXXV^VJXV/XX 


1 AAQ 
lOQo 


333U33 


1 7SO "MA^ fT7/l^ 99 A/MR 
1 /oy.INUj.gZ'rO ZZO*H-0 


F 

Mr 


A/T00056386A-G01 


UC2-PrimColon 

*J V-i' X X XX 1 Iv/VlVU 


1669 


7240Uy 


1 /oy."Uj.gZ4«3 ZZ043U 


P 


IVIUwJDjOOD.DuJ 


TJP2-PrimColon 

VJ V^^> X X XEXLV^rv/AVJXX 


lo/U 


<AQ/1Q/1 

3oo42$4 


1 7fiO T5fiA cttA'X 99AA<J9 
1 /oy.JjU4.gZ4J__ZZ043Z 


p 


M000S637fl , R'F07 


t TP2-PrimPolon 

UV/Ai X X AXXXVsV/Av/XX 


16/1 


/23321 


1TQO T HA crwA'X 99AAA9 
1 /oy.l->v4.gZ43_ZZ040Z 


p 

Mr 


M000563R3T3-F0K 


T TP2-PrimColon 

UV/A> X X XXXt\_^V/XVJXX 


1672 


731542 


17QA V"AC rt-rAQ TTA/1TT 

1 /oy.JS.U3.gZ43_ZZo4/ / 


P 


A/ffinn^A389P*F1 1 
xVlUUU303oZV^.r 1 1 


T TP? -Prim Pol on 

VJ \-r£* X X xxxxv^vjxvjxx 


1673 


72544 0 


1 TOO T AA cwtAI lOSZAQA 
1 /oy.LfUO.gZ43_ZZu4y4 


p 
r 


xYIU UVJ J OO o ^ D . VJ V/ o 


T TP ? -Prim Pol on 

VJ V^jw X X XXXXV^VJXVJXX 


1674 


452806 


1789.N06.gz43 226496 


F 


! M00056386B:A07 


UC2-PrimColon 


1675 


549790 


1789.L07.gz43_226510 


F 


M00056383C:E07 


UC2-PrimColon 


1676 


730660 


1789.N07.gz43_226512 


F 


M00056386B:A11 


UC2-PrimColon 


1677 


503625 


1789.B08.gz43_226516 


F 


M00056370C:F01 


UC2-PriniColon 


1678 


725381 


! 1789.E09.gz43_226535 


F 


M00056373D-.GO1 


UC2-PrimColon 


1679 


649411 


1789.G09.gz43_226537 


F 


M00056376A:C10 


UC2-PrimColon 


1680 


729993 


1789.109.gz43__226539 


F 


M00056378C:E10 


UC2-PrimColon 
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Table 2 



SEQ 
ID NO 


CLUSTER 


SEQ NAME 


0RIEN 
T 


CLONE ED 


LIBRARY 


1681 

IvO X 


727523 


1789 M10 ez43 226559 


F 

X 


M00056385AA07 


I TC*9~PrimPn1nn 

KJ V^^> "X X XIII V> VJlVJXl 


1682 


459918 


1 789 N 1 0 z43 2265 60 


X 


M0005 63 86R -D07 


T TP 9 -Prim Pnl on 

V-J V^Zr X X XXXXV>V/XV/X1 


1683 

X UOJ 


735649 


1 789 R1 1 j*z43 226564 


X 


M0005 63 70D C06 


T TP9 -PrimPnlrm 

\J X X XXXiVsVJXUXX 


1684 


734348 


1789F11 P743 226568 


X 


M00056375A-D1 1 

XYX\J\J\J-J \J-7 / ^X^..XV X X 


U X X XX1XV>^JXU1X 


1685 


55841? 


1789 OH az43 226577 

X / 07.V X X , {*£*\J-J 1 / 


F 

X 


M0005 63 87C -DO 1 


I Tf*9-PrirnCYilr>n 

V V/Zr X 1 JXllV^rlFXUXX 


1686 


724694 


1789 H12 ez43 226586 


F 

X 


M00056377C* AO 1 


UC2-PrimColfin 

KJ\^>£* X X iXlXKs\JX\JIX 


X uo / 


723972 


1789 J12 ffz43 226588 


F 


M00056382A-A12 


UC2-PrimColon 

X X IXXXV^V/X^XX 


1688 

1UOO 


623408 


1789N12ffz43 226592 


F 


M 0005 63 86B E09 


UC2-PrimColon 


1689 


640158 


1790F01&z43 226792 


F 


M000563 94CD07 


UC2-PrimColon 


1690 


725095 


1790 J02 ez43 226812 


F 


M00056398CD 1 0 


UC2-PriTnColon 


1691 


562769 


1790 002 ffz43 226817 


F 


M00056404DG03 


UC2-PrimColon 


1692 


47985 1 


1790L03sz43 226830 


p 


M00056400OG04 


UC2-PriiTiColon 

\_</ V^^* X E JXXX W1U11 


169*3 

X\J^J 


481077 

TO XV// / 


1 790 N03 CT743 226832 


F 

X 


M00056403CF02 


T IC2-Prini Colon 

V>* V*/Z* X X XXXX\^x7XWXX 


1694. 


656968 
ujuz>uo 


1790 K04 0743 226845 


F 

X 


M00056399D-C1 1 


IJC2-PriiiiCo1on 

V^>o X X XXXX \JM\JXX 


1695 


79441 1 

/ JLHrx X X 


1 790 "NT04 cr 7 43 226848 


F 

X 


M00056403CF03 


T JC2-Prirrir , olon 

V> V/^> X X XXXX V^UXV/XX 


1 696 


640961 


1790"M05a743 996864 


F 

X i 


M00056403C-F1 9 


TJP9 .Prirn Colon 

V V<rZ« X X XXXX V> VJXV/XX 


1 697 


496997 


1790 H07 a^43 996886 


F 


M00056399r>r05 


I IC2 -Prim Col on 

UV/Ai X X XXXXV^wXVJXX 


1 AQ£ 
lOi/o 


£40^01 


1 700 M07 cr743 99680S 

1 /^U.xVJLU / .gZ*TJ Z*£,\JQZJ D 


F 


M 0005 6409 C-R09 


I IC9-PrimCo1on 

V7 \—r£- X X XXXXV^V/XUXX 


1 AQQ 


7901 1 1 


1700 TiOR cr-743 996O09 


F 


M00056399n*FT09 
ivxv/v/v/j\j j yz*xj j^xv/Zr 


T TC9 -PrimColon 
u v/ii r x xxxiVsvuvux 


i §\j\j 


/ JJJUJ 


1700 (t08 ot43 996905 


F 

x 


M00056396 A 1103 


T TC2-PrimColon 

V^V/Z» X X XXXX V> vJXVJXX 


1701 


790637 


1 700 A 1 0 <«r43 99693 1 


F 


M 0005 63 89FI H03 

i.VXV/V/V/ J vJ J 07U.11UJ 


T TC9 -PrimColon 

v_j v_^^ xt x xx i x\~> \jxyjxx 




796494 


1 790 K 1 0 <T743 99694 1 

X f y\J.l\~l v/.gZ*rJ ^.^rVJ^*tX 


F 


M00056400AD08 

1\X\J \J\J J VJ"T W^V.XV V/ o 


t TC2-PrimColon 

V/ \-/Z« X IJXUV^VJlUii 


X /VJ 


794714 


1 700 A/T1 0 ot43 996043 


F 


M00056409H-A01 


I TC9 -PrimColon 

VJ VsAf X XXXXXV^AJXwXX 






1700 PIO cr'rAI 99^046 
x / 7U.r lU.gZ*r J_ZZD7HO 


F 


IV/f 000^ 6406P ♦TjOO 


1 TC9-PrimCo1on 

IJ Vw^Z. X XXXlXV_yVJXvJXX 


1 70^ 


^56760 


1700R11 ot43 996948 


F 


M000 5 63 90C -R0 8 


I JC2-PrimColon 

KJ\^*£* X X XXXX v/xv/xx 


17H£ 
x / UO 


909 

ZU7 


1700 Mil ctt43 996959 
1 / j/u.iyi l l.gi&H'J j*z*\jyjy 


F 


M00056402D1307 


T TC2-PrimCo1on 

V/V^Z> X XXXLXV^V/XVJXX 




994014 


\ 1 700 U 1 9 0^4*3 996064 


F 
r 


M00056390C-C04 

1VX v/V/V/ J V7 J j7V/ V/.VyV/*T 


T TC9-PrimCo1on 

KJ V^Z> XT X XXXX Vy VJXVJXX 


1 7na 


Oi^Oj 


1700ni9o-z43 996066 


F 


M00056303 A 006 


I TC9 -PrimColon 

V/ V^x* XT X XXXXVvVJXV^XX 


1 £\JZ7 


J.7AQR1 

*fr /H761 


1 700 fil 9 »-r/l3 996060 


F 


M00056306 A-firl 0 

1VXV/ v/VI J U J ^Ux\.. VJ X \J 


T TC2-PrimCo1on 

V V^Z> X X XXXXV^V7XV/XX 


1 71 0 

1 / X \J 


790950 


1700 1 19 ar43 996974 

L /7v.L>1Z.^*tJ Z*£*\J-s /■*+ 


F 


1 M00056401AF09 


UC2-PrimColon 

V_^^— • X X XXXXx^V/XV/XX 


171 1 


640756 

U < tV/ /JO 


1 790 >J1 9 <T743 996976 


F 


M00056404AG03 


UC2-PrimColon 

X X XXXX X«/ V/XV/X1 


1717 

X / 1Z 


440998 


1790O13&7-43 996993 

x 1 yVJ.KJ 1 J.gi6*rJ £-.Z*\jZ7zs J 


X 


M00056405C-H04 


UC2-PriniColon 

X X XLltV./ \JX\JXX 


x / I.) 


4.9^490 


\ 1 700 P 1 ^ <*tA^ 99^004 


F 


M00056406P-F1 9 

lVXVv/UJv/ 1 +V/UV/.X' XJU 


I TC9 -PrimColon 

V7 V>Z» XT X XXIXVAJXV/XX 


1714 

1 / X*T 


7771 50 
/Z / 1JU 


1 700 P 1 4 cyy43 996007 
i / y\J.\s It-.^Z't J z,z.\jyy / 


F 


M00056391 r-Hl 2 


UC2-PrimColon 

wV^Z> X X UXlVswXUU 


171^ 
1 / 1 j 


50^173 
jUj X / J 


1 700 F 1 4 ot43 996090 


F 
x^ 


M00056394TVC1 1 

XVxV/V/V/ JV/ J J^tXJ.VxX X 


T JC2-PrimCo1on 

y~j v^z^ x x xxxxv-/V7xvjxx 


1716 


188309 


1790.N14.gz43 227008 


F 


M00056404B:B06 


UC2-PrimColon 


1717 


448758 


1790J15.gz43_227020 


F 


M00056399B:G09 


UC2-PrimColon 


1718 


735234 


1790.L15.gz43_227022 


F 


M00056401B:B01 


UC2«-PrimColon 


1719 


483147 


1790.P15.gz43_227026 


F 


M00056407A:D10 


UC2-PrimColon 


1720 


642985 


1790.G16.gz43_227033 


F 


M00056396B:E10 


UC2-PriinColon 


1721 


725451 


1790.K16.gz43_227037 


F 


M00056400A:G10 


UC2-PrimColon 


1722 


729809 


! 1790.P16.gz43_227042 


F 


M00056407A:G03 


UC2-PrimColon 
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Table 2 



SEQ 
ID NO 


CLUSTER 


SEQ NAME 


ORIEN 
T 


CLONE ID 


LIBRARY 


179"? 


495712 


1790.A17.gz43 227043 


F 


M00056389D:F01 


UC2-PrimColon 


1794 




1790 017 ez43 227057 


F 


M00056405D:E05 


UC2-PrimColon 


1795 


457508 


1790 C18 az43 227061 


F 1 


M00056391D:E06 


UC2-PrimColon 


1796 


417549 


1790 Ml 8 ez43 227071 


F 


M00056403A:E09 


UC2-PrimColon 


1797 

X / £» 1 


795454 


1790 E19 &z43 227079 


F 


M00056394B:G05 


UC2-PrimColon 


1798 


448817 


1790 C20 ez43 227093 


F 


M00056392A:F06 


UC2-PrimColon 


1790 


79641 1 


1790O20ez43 227105 


F 


M00056406A.C02 


UC2-PrimColon 


1710 

X / D\J 




1790D21ez43 227110 


F 


M00056393D:A09 


UC2-PrimColon 


1711 

X / D 1 




1790P21ez43 227122 


F 


M00056409A:C03 


UC2-PrimColon 


1719 


711641 


1790N22ffz43 227136 

X / fciMi *^ C i~ * ir.r.riir # -L — ' vy 


F 


M00056404CC12 


UC2-PrimColon 


1711 


523364 


1790F23sz43 227144 


F 


M00056395C:H03 


UC2-PrimColon 


1714 


468467 


1790 M23 sz43 227151 


F 


M00056403B:D09 


UC2-PrimColon 


1715 


402799 


1790 023 ez43 227153 


F 


M00056406A:E01 


UC2-PrimColon 


1716 


470469 


1790 P23 ez43 227154 


F 


M00056409A:D03 


UC2-PrimColon 


1717 


1 18514 

X X O*/ X*T 


1790 D24 ez43 227158 


F 


M00056393D:C12 


UC2-PrimColon 


1718 

X / DO 


719821 


1790 F24 ez43 227160 


F 


M00056395D:B01 


UC2-PrimColon 


1710 

X IDs 


447999 


1790 K24 sz43 227165 


F 


M00056400C:B11 


UC2-PrxmColon 


1740 


790770 


1790 M24 ez43 227167 


F 


M00056403B:G12 


UC2-PriinColon 


1741 


797611 

1 jL 1 \JDD 


1790P24ez43 227170 


F 


M00056409A:D06 


UC2-PrimColon 


1749 


475707 


1791 F01 sz43 227176 


F 


M00056415A:D03 


UC2-P3rimColon 


1741 


648 1 70 


1791 G01 ez43 227177 


F 


M00056416B:B11 


UC2-PrimColon 


1744 
x / 


479684 


1791 101 ez43 227179 


F 


M00056418A:G09 


UC2-PrimColon 


1745 


477718 
*+ / / / 10 


1791 O02 ffz43 227201 

x / J 7 x . Vy 7 vy ^» fitr / a»v/ x 


F 


M00056423D:H07 


UC2-PrimColon 


1746 
X / 4 tO 


790784 


1701 A01 <*741 927203 


F 


M00056409B:C12 


UC2-PrimColon 


1747 


409471 


1791 F01 az41 227208 


F 


M00056415B:B10 


UC2-PrimColon 


174ft 


451769 


1791 K03 ffz43 ?27213 


F 


M00056419D:G04 


UC2-PrimColon 


1740 


5 11 1 40 

J X X l*t? 


1791 M03 ez43 227215 

X / 7 X . IVXVy^y ,ff t rtT»s ,r tf • <a>1*/ 


F 


M00056422B:A08 


UC2-PrimColon 


17^0 
1 /DV 


797000 


1701 (t04 07-41 227225 


F 


M00056416B:H01 


UC2-PiimColoii 


17^1 


^600^7 


1701 K 04^41 227229 


F 


M00056420AC0 1 


UC2-PrimColon 


1759 


797845 


1791 M04 ez43 227231 


F 


M00056422B:C12 


UC2-PrimColon 


1 7^1 


649 1 70 


1791 H05 ^743 227242 

1 / y 1 .XXV J .gf&rrD / » 


F 


M00056417BD05 


UC2-PrimColon 


17^4 


198740 


1791 K05 ffz43 227245 


F 


M00056420A:C12 


UC2-PrimColon 


1 7^ 


i 716619 
/ JDOJ2> 


1701 P05 <r741 227250 

X / 37 X . X \y-y .gii*tJ # /V/ 


F 


M00056424DC08 


UC2-PrimColon 


17S6 
1 / JO 


549057 


1791 107^743 227276 


F 


M00056419A:F08 


UC2-PrimColon 


17^7 
X ID 1 


711400 


1701 O08 or41 227297 


F 


M00056424A:F05 


UC2-PrimColon 


1758 


649722 


1791.O09.gz43_227313 


F 


M00056424A:F12 


UC2-PrimColon 


1759 


562137 


1791.Lll.gz43 227342 


F 


M00056421C:E06 


UC2-PrimColon 


1760 


729199 


179LG12.gz43_227353 


F 


M00056416D:C08 


UC2-PrimColon 


1761 


730178 


1791.L12.gz43 227358 


F 


M00056421C:E12 


UC2-PrimColon 


1762 


435349 


1791.G13.gz43 227369 


F 


M00056416D:D12 


UC2-PrimColon 


1763 


723963 


1791.P13.gz43 227378 


F 


M00056425B-.B02 


UC2-PrimColon 


1764 


725408 


1791.H14.gz43 227386 


F 


M00056417C:G10 


UC2-PrimColon 
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SEQ 
ID NO 


CLUSTER 


SEQ NAME 


0RIEN 
T 


CLONE ID 


LIBRARY 


1765 

A i \J*J 


500253 


1791 L14 ez43 227390 


F 


M00056421CH01 


I JC2-PrimCn1nn 


1766 

A / VJVJ 


644723 


1701 N14 ez43 227392 


F 


M0005 6423 C-G05 


T TP9-Prim Pnlnn 


1767 


474065 

*T / *T7 VJ_J 


1701 M15 ot43 927407 

1 / 7i ,1V1 X .gZIO ZZ / tv/ / 


X 


M00056422r , R1 0 


T Tf , 9-PrimP^1nn 

\J V^Z A 1 ALllV^wJLvJll 


x i \jO 


794094 


1701 >J17 o^43 927440 

1/71 .IN x i .gZ'rO AA / *t*tU 




M00056423TV1306 


Vj wZ A 1 ill iV_xUlUil 


1760 

1 /VJ7 


9708^ 


1701 F>1 8 oy43 997446 

X / 7 X .XJ X O ,gZ*fO ZZ / t*tO 


P 


M0005641 3R-Tj07 


\J \_yZ A 1 Jill VyUlVJll 


1770 

1 / IK) 


790800 


1701P18o-743 997447 

1/71 - H X O .gZHO ZZ / / 


P 


A/T000564 1 4C 1 PT08 
ivxv/V/v/ j ohi tv.nu o 


T TP9 -Prim OcA rtn 


1771 
i / / 1 


798086 

/Z070U 


1701fr18o743 997440 

X / 7l.\J10.giCi*TJ AA /*t*r7 


p 

x^ 


M0005641 6D-H1 0 

IVlUUvJ *J*T A \JX\J .A A A \J 


VJ VwZ A 1 AllJLv^VVl.V/11 


177? 
i / / z 


640744 


1701 A10O743 927450 

1/71 .A17.g^*tJ ZZ / tJ 7 


p 


M0005641 0 A* A04 


VJ\_/^ JT 1 AlllV^vPlLrAl 


1773 


703078 


1701 R10 o-743 997460 

1/71 ,X> X ~ .gZ.tJ ZZ / *tUV/ 


P 
X 


M0005641 IFiDOl 


T TP9-Pritnf*nlr»n 

VJ V/Zi A ± ALliV^v/lAJAl 


1774 


79S408 


1701 010 07-43 997473 

1 /7l.V7l7.gi'rJ ZZ / *t / J 


P 
1 


M00056494r , R1 0 


T TP 9 -Pri m Pnl on 

U V>Z JT 1 AlllV^Ulv7AJ. 


1775 


795407 


1701 1-T90 0743 997482 

X / 71 .xxA\J .g£rr~> ZZ 1 tOZ. 


P 


lVf 0005641 71^-000 


T TP9-PrimPn1nn 

KJ 1 1 AIA1 KsKJXSJlX 


1776 


736S07 


1701 1C91 0743 997501 

1 /7l.AkZX.g^ < TJ Z.Z,/ Jul 


P 

A ! 


M00056490r-F08 


T 1(^9^11111^01011 
VJ V-//« A J. ALIIV^UIVJU 


1777 
x / / / 


JJJJJ7 


1701 D91 ot43 997505 


P i 
A 


1^/100056424^^09 




1 778 
x / /o 


79477^ 
/ A*r 1 / j 


1701 P9? o-74^ 997S00 

1 / 7X «.\^ZZ.gZTO ZZ / JV77 


P 


M000S641 913 'R08 

1VXU Vy V/ J t»*T 1 Z Jj . A7> \J O 


T TP 9 -Prim Pr»1 nn 


1 77Q 

X / /7 


794990 

/ A^tAAV 


17Q1 P99 074^ 997S99 

1 / 71 AA .jSyA'+D A A 1 DAA 


p 


M000 < S649^Tj*D08 

IVlUvvJUT'ZJl/.l/l/O 


T TP 9 -Prim P^ilnn 

VJ V-/Z A 1 llIlv^UlVjll 


1 780 
X / oU 


641 ^8** 


1701 A9^ o^4^ 997S9^ 

l/7l .AZj -gz*f O _ZZ 1 DAD 


p 


M0005641 OR- A00 

lYAVJ V/l/«-' 0*T 1 \J A> ./^.V/7 


T TP9-PrimPn1on 


1781 
1 /o X 


^69^81 
jozjoi 


1701 TT9^ ot43 99753^ 

X / 7l .XS^ZrO .gZH-^ AA I jj j 


P 

X? i 


M000 5 649 OTj • A 00 


T TP9-PrimPi , ^rm 


1789 
X /oZ 


796768 
/ZO /Oo 


1701 P9** ot43 997^^8 
X / z? X .x A3 .gZH^ ZZ 1 DDq 


p 


M0005649SDR01 

lVXV/ v/U J U*r/.J XJ .nw A 


T TP9-PrimPn1nn 

VJ V_/Z A 1 11 1 J.\_s VJ1 VJ Al 


1 78^ 


790^70 

/Z7J #7 


17Q1 TT94 o-7-4^ 997*546 

1 / 7l .nZ'r .giCr^ AA I J*tO 


p 


M0005641 8 A -001 

1t1UwJU*t A 0/V.VJv X 


T TP9-PrimPn1nn 

\J wZ A 1 11 1 1 V-/ vJlVJll 


1784 
X / of 


408 1 Q4 


1701T94ot43 997550 


p 


M00056499 A-F07 


T TP9 -PrimP rklnn 

UViA 1 1 Ull\w>v/lv/ll 


178^ 
X JoD 


9881^4 


1701 1SJ94 ot43 997S59 


P 


M00056493TTF1 0 


T TP9-PrimPnlnn 

kJ X 1 llUV^V/lV/ll 


1 786 


5S3083 


1709 P01 ot43 997582 


p 


M00056486BCt02 


T TP9 -Prim Pol nn 

VJV/ii X lllllV^V/lV/11 


1787 
x / o / 


794400 


1 709 F03 ot43 997603 


P 

X7 


M00056433R- A00 

XVA v V/vr «^ v^T«7 -7 A/.r\v 7 


T TP9-PrimPnlon 

KJ V/Z> X lllilV^V/lvUl 


1 788 


64S476 


1709 T05 o^43 997640 

X / 7Z.J vJ-J .^Z*T«7 ZZ / Ut"v 


P 
A 


M00056470C-F04 


T TP 9 -Prim Pol nn 

vJ V^Z X 1 HllV^xAlv711 


178Q 

1 / 07 


5S61 1 5 


1 709 FI06 074^ 227650 

X / 7Z. .l/UU.gL"TJ ZZ / VJJU 


P 
A 


M00056432A* AO 1 


T TP 9 -Prim Pnl nn 

\J X 1 AlllV^WlV^AL 


1700 

X /7U 


7^4987 


1 709 "F06 074^ 997659 


P 


M00056434C T?0 1 


T TP9-PrimPnlnn 

KJ V/Z X 1 1111V_xV71U11 


1701 
1 /71 


795S84 


17Q9 T07 oa4^ 997671 

X /7Z.1U / .gZHO ZZ/D / X 


P 
X^ 


A/f00056478f 1 -PI01 


T TP9 -PrimPnlnn 

KJ V/Z—X 1 lill\->\JlVJll 


1 7Q9 

X /7Z 


79^006 


1709 P07 o^4^ 997678 
X / 7Z.rU / ,gZ*rj ZZ / D / o 


p 

1? 


A/100056486n- A 1 9 

1VXU UUJ Ut O wJ-/ .xTl X z 


T TP9-PrimPr»1nn 

KJKsA X 1 AlllV^VJlVJll 


1 70^ 

x /to 


7^1 1 ^8 
/3X Ijo 


17Q9 AHO otA^ 99760^ 
X /7Z»/VU7.gZ*tJ ZZ / 07^> 


F 


A400056496T^rr1 9 


T TP 9 -Prim Pnl nn 


17Q4 


<Q9 1 ©6 


1709 R0Q o^43 997606 

X / 7Z.J3V/7.gZ*IO ZZ / vJ7vJ 


P 


M0005 649 8 A *F 1 0 

1V1Uv/WJVJT'Z,0/V.1 av/ 


T JP2-PriTTiPoion 

KJ V^Z A 1 Alll\^/V7lV/ll 


17Q^ 

X /7J 


4^961 8 


1 7Q9 TV/fOO o74^ 997707 
X / 7Z .MU".gZ*t j ZZ / / U / 


P 
r 


M00056483AF10 


T TP9 -PrimPnlnn 

VJ v/Z A 1 JlilV^UllJll 


1706 

X /tu 


7959^8 
i AD Ado 


1 709 f^l O 074^ 997717 

X / 7Z.VJX U.gZT-O ZZ / / 1 / 


p 


lV/T00056476rF05 


T TP9 -Prim Pnl nn 


1 7Q7 

x /y / 


797Q48 
/Z / 7*fo 


1709 PIO rr^4^ 997796 
X / Tz.r Xtl.gZHJ ZZ / /ZO 


A 1 


IV/IOOO ^ 6486r>*P04 
iVXUv/U J vTH oDX-r . r v/*r 


T TP9-PrimP/^1nn 
KJ KsA—ITx 1H1V-/VJU .'11 


1 7oa 

X /70 


4^9081 
HOZ70 1 


1709 All Cr'rAI Willi 
X /7Z.A1 1 ,gZ*fO ^ZZ/ /Z/ 


17 


AifOOO S 649 6P -frO S 


T TP 9 -Prim Pnl in 
KJ \sA~x lllllv^i*!^ 711 


x /77 


710^69 


1709 "R1 1 o-7>4^ 997798 
X /7Z.15X X.gZHO ZZ / /ZO 


17 

A* 


A^OOOS 649 8 R-P10 1 

lYXv/Uv/«J Q*rZOA>.A>l/ X 


T TP9-PrimPn1rjn 


1800 


480005 


1792 Cll ez43 227729 


F 


M00056429D:E02 


UC2-PrimColon 


1801 


471931 


1792.Pll.gz43_227742 


F 


M00056487A:C10 


UC2-PrimColon 


1802 


725454 


1792.H12.gz43_227750 


F 


M00056477C:G09 


UC2-PiimColon 


1803 


729584 


1792.112.gz43_227751 


F 


M00056478D:G02 


UC2-PrimColon 


1804 


725210 


1792.N12.gz43_227756 


F 


M00056484B:E11 


UC2-PrimColon 


1805 


462687 


1792.P12.gz43_227758 


F 


M00056487A:D01 


UC2-PrimColon 


1806 


724183 


1792.Ml3.gz43_227771 


F 


M00056483B:E04 


UC2-PrimColon 
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Table 2 



SEQ 
ID NO 


CLUSTER 


SEQ NAME 


OWEN 
T 


CLONE ID 


LIBRARY 


i ftn7 
loU/ 




1709M14o74'3 7777X7 


JL 


M00056483B-F04 


UC2-PrimColon 


1 ftftft 
loUo 


04/3 / J 


1709 fr16 0^43 9978T3 


F 


M00056476DH1 1 


UC2-PrimColOxi 


18U9 


7Q6751 
/DO ID 1 


1707 fr17 997890 


F 


M00056477A*B1 1 


UC2-PrimColon 


i o i n 
181U 


/Z.};545 


1 707 P 1 7 0-74 3 997838 


F 

M. 


M00056487AF05 


UC2-PrimColon 


1 Q 1 1 

loll 


/iQi <ro/t 
4513^4 


1707 T 18 or-74.3 797850 


F 


M00056482BB11 


UC2-PrimColon 


lolZ 


/£7^9QQ 


1709 "NT1 R oWil 997852 


F 


M00056484DDO 1 


UC2-PrimColon 


1813 


734646 
1 J4040 


1 709 1 0 o^n 977865 


F 


M00056481AF02 


UC2-PrimColon 


1 ft 1 A 

1814 


433 / lO 


1709 T 90 cr74^ 777887 


F 


M00056482BC09 


UC2-PrimColon 


1 ft 1 ^ 

1513 


79/11 83 


1709M90O743 277883 


F 


M00056483C:D06 


UC2-PrimColon 


lolD 




1709M94a74^ 997047 


F 


M0005 6483D F06 


UC2-PrimColon 


ion 
1<S 1 / 


733006 


1801 "H01 074^ 777054 

IOU 1 ,XJ\J L .gZrT-? / 7J*T 


F 


M00056491BB09 


UC2-PrimColon 


1818 


7331 5 1 


1801 T01 074^ 997050 


F 


M00056496DB12 


UC2-PrimColon 


lOin 

loiy 


73/1961 
/34Z01 


IftOI TOI av&X 997060 

loul .JU1 .gZ**0 ZZ /70U 


F 


M00056497CD05 


UC2-PrimColon 


182U 


CHI /IO 

D 14 142 


IftOI IfOI o^43 997061 

loUl .JvU 1 .gZHO__ZZ / !70 1 


F 


M00056498DC01 


UC2-PrimColon 


1821 


/I •71737 

4/1232 


IftOI A./TH 1 rr-r43 997063 
1 Oil 1 .1VLU 1 .gZ*r3 ZZ / fOO 


F 

JL 


M00056501BB09 


UC2-PrimColon 


1822 


44/ /oj 


i am Tin 9 0^43 997070 

loUl .L/UZ.gZTO ZZ/^/W 


F 


M00056491BE02 


UC2-PrimColon 


1823 


523/53 


1 ftm Tfl7 rr«r43 997076 
loUl J UZ.gZ4 3__ZZ /¥ JO 


F 


M00056497CE01 


UC2-PrimColon 


1824 


451383 


ioni A /in 9 n~743 997070 
loUl.lVIUZ.gZ £ f3_ZZ /¥ l¥ 


F 


M00056501BC07 


UC2-PrimColon 


1 004 

1825 


o35951 


Iftfll ITli3 rr-74.3 997Q03 


F 

JT 


M00056499AA04 


UC2-PrimColon 


1 Ol^ 

1826 


/3182U 


1 SHI AH /I fY-r43 997Q00 

l oui .AU4.gZ4.j_zz /yyy 


F 


M00056487CE03 


UC2-PrimColon 


182/ 


44o92o 


1801 T70A crvAl 99800^ 


F 


M00056492B 4 D06 


UC2-PrimColon 


1828 


T'JO^Oft 


1»m P0A rr«7ZL^ 99 80 14. 


F 

X 


M00056504BB01 


UC2-PrimColon 


i oon 
1829 


/25434 


18H1 "R05 fT-zA^ 998016 
loUl .gZ'f-j ZZoUlO 


F 

X 


M00056488BG10 


UC2-PrimColon 


1830 


4oUoyu 


18H1 nO^ rr^J.^ 998091 
1 oU 1 .VJU3 .gZ4 J ZZOUZ 1 


F 

X 


M00056495AC02 


UC2-PrimColon 


1831 


£11 AHA 
Ol 10U4 


IROI TO 5 err J. Q 998094 


F 

X 


M00056497DC 1 1 


UC2-PrimColon 


1832 


r^co<; i 
03DiO 1 


1 SOI T 05 a-jAJX 998096 


F 

X 


M00056500AG12 


UC2-PrimColon 


1833 


49G1 Ql 
4ZV iy 1 


1 801 "R06 9780^7 


F 

X 


M00056488BH04 


UC2-PrimColon 


1834 


04U1 lO 


I 1801 006 ot4^ 998045 


F 

X 


M00056503BG1 1 


UC2-PrimColon 


1 ftQ< 

183D 


73/1043 


181^M05ot'4^ 990558 
1513 XVJO .gZ*T-> ZtJLryDu O 


F 


M00056576A A04 


UC2-PrimColon 


183o 


3U5Z1U 


1 o 1 o TH6 ot4^ 990575 


F 


M00056577AF10 


UC2-PrimColon 


183 / 


1 1639 
1 IOjZ 


ion H07 ot-4^ 990507 


F 

X 


M00056585CC08 


UC2-PrimColon 


1838 


/17^757 
4/3 /3 / 


1 ft 1 M08 <r-74^ 990606 
l o 1 -5 .jnuo ,gz*T-? z.z,7vjuo 


F 


M00056576AE01 


UC2-PrimColon 


1839 


TOGO ft 1 

/292o 1 


1«n Tin otA^ 9906J.0 
lo 13. J lU.gZnO ZZ^vrfU 


F 


M00056578D A02 


UC2-PrimColon 


1840 


/339/U 


1 ft 1 1 *M 1 fl €rv&.1 99064.4 
1 o 1 3 .IN 1 U ,gZ43_ZZ70*t i f 


F 


M00056584CA06 


UC2-PrimColon 


1 O/f 1 

1841 


727410 


ion a iq cxtA.'X 99Q670 
1813 .A 1 3 .gZ43_ZZi/0 / f 


F 
r 


M00056567A-H06 


UC2-PrimColon 


1842 


465446 


1813.L13.gz43 229690 


F 


M00056581D.A08 


UC2-PrimColon 


1843 


511351 


1813.M13.gz43j229691 


F 


M00056583CA05 


UC2-PrimColon 


1844 


1 733664 


1813.120.gz43_229799 


F 


M00056577D:F08 


UC2-PrimColon 


1845 


548893 


1813.C22.gz43 229825 


F 


M00056570A:D09 


UC2-PrimColon 


1846 


513540 


1813.H22.gz43_229830 


F 


M00056576C:G01 


UC2-PnmColon 


1847 


725759 


1813.I22.gz43 229831 


F 


M00056578AB05 


! UC2-PrimColon 


1848 


726104 


1816.C19.gz43 230929 


F 


M00056633D:E05 


UC2-PrimColon 



116 



1NSDOCID: <WO 0214500A2_I_> 



WO 02/14500 



PCT/US01/25840 



Table 2 



SEQ 
ID NO 


CLUSTER 


SEONAME 

ATX VLT III/ 


ORIEN 

T ' 


CLONE ID 


LIBRARY 

XjXJJXVTVXV. X 


lo**y 


*foZoOo 


1 ft 1 A T7 1 O rt^A 3 77A071 

ioio.iiiy.gz43_Z3uy3i 


r 


A >T A A A < /C 3 O .XJA < 

MUUUjoo3oL..rlUD 


UCz-rnmColon 


1 ojU 


777737 


1 ft 1 A T1 O r*-rA3 73AQ7< I 

is io.ny.gZH*3_z3uy33 


Jr 


JV1UUUD oo4 1 CijrUy 


U Cz-JrrimColoii 


1 ft< 1 
Ioj I 


777A7ft 
/Z /U/o 


1 ft 1 IV 1 O rrrrA 3 73AA37 

i o lojviy .gz43_Z3uy3 / 


r 


IV X A A A ^ /I /! . "D A 3 

MUUU3oo44C:r> , U3 


U Cz-rrimColon 


1 ft^7 
153Z 


7771 Aft 
/Z / lUo 


1 ft 1 A T 1 O Q 73AQ3ft 

lo io.j-,iy.gz43_Z3uy3o 


r 


A/f AAA^/C/C/1 <T\.t7A/C 


U Lz-F nmColon 


1 ft^7 


73/£77^ 
/303Z3 


lo Jo.Jt>ZU.gZ43_Z3Uy44 


r 


X>TAAA'C/C/C37/^.17A3 

M.UUUD oo3 Zv> . JtiU3 


U L/z-l'iimColon 


1 ft<JA 


77/Cft -I o 
/ZOolo 


IQi /rr T7A rr-yA3 77AG^7 

loioj zu ,gZHo__z j Uy 3Z 


-p 


A R A A A ^ d ti A 7 T> . A ^ 

JMUUUD O04Z JJ . "CUD 


u cz-Jr rimuoion 


lOJJ 


70AAft7 

/ZhUoj 


lo 10.1VxZU.gZ*0 Z3uy33 


r 


JVlUUU3O04ul-l.xiU3 


ucz-Fnitiuoioii 


1 ft^A 
IojO 


77A^a^ 

/Z*003 


i o i o . in zu . gzn- 3__z 3 uy 3 O 


r 


A/rnnA^/^^Aftn-r^Aft 

JVlUUUD004olJ. , oUo 


U UZ-Jr nmuoion 


i ft^7 


7A7^7A 
ZU /33U 


1 ft 1 A nOH twAI 77AQ^7 

lo io.uzu.gz*f3__Z3uy3 / 


17 


A/fAAA^^^^A A *I7A1 
JVLU U U3 O OD U A . JtiU 1 


u ^z-Fniiicoioii 


lo Jo 


736778 
/ jD/ / O 


1816 r*91 cr<r43 93HQ61 

io lo.VxZi.gz^o zouyoi 




1V1U U U D O O D *f A. ^ U O 


TTP7 DrimPnlrkn 






1816T771 crrrAI 73AQ67 
lO 10.JCfZl.gZ , 4-J_Z3Uy03 


17 


A/f AA A ^ A fS\ AFI • T7A7 
IV1UU U3 O 03 OU.r Wj 


u uz-x^nnii^oioii 


loOU 


77AAQA 
/ zo**y*i- 


1816 1377 crrrAI 77A076 
1 0 1 O.Jt5ZZ.gZ*t 3_z3uy / O 


X* 


Avf A A A ^ A <7 7 P Una 
1VIUUU3 003Z , CrlUo 


u ■uz-Fnin'uoion 


1 ft£1 
loo 1 


A! 071 1 

*tiy /i i 


1ft16#T-77 rr-rA3 77AGft1 

i o l o.vjzz.gz*f 3_Z3uyo i 


1* 


A>TAAA<IAA30 A «171 1 
JVlUUU3o , o3yA.r 1 1 


u L^z-FninuoiOxi 


1 ft£7 
loOZ 


377Q3* 

3 / /y33 


1ft16A/T77 rrrr/13 03AQft7 

lolO.JYLZZ.gZ43_Z3Uyo / 


X* 


A/TA A A< A .17A7 

iviuu U3oo4oJJ.ru / 


U uz - J'rini'Coion 


1 ft£7 
1503 


77 £ A Aft 
/ZO*f*fo 


1 ft 1 6 P77 crnrAl 77A007 

i o i o^zo .gz <c f^_z3uyy3 


r 


A/TAAA^AA7A A*f2-1 1 
lVLUUU3O03' £ fA.vJl 1 


TJP7 l>««i/'VJ*vr» 

U v-/Z-r*nmL/01011 


1 ftAA 
lo04 


71 ^7 AO 

zi3Z4y 


1 ft 1 lT-77 rr»rA7 77AQ07 

i o i o . vjz d .gz/\o_A5 uy y / 


17 


A>TAAA^AA30 A U1 7 
JVlUUU3 , oo3yA.xllz 


U Uz-JrrimUoion 


1 ft£< 
lo03 


77<i777 i 


Ift1/CXT77 rrrrAI T31AA/1 

lolO.JNZo ,gZ4o_Z31UU4 


17 

r 


A/TAAA</C/C/I 0 13.17 AO 

MUUU3 oo4ol>:liUy 


UCz-Friinuolon 


1 ft££ 
loOO 


7771 7^ 
/Z /l /3 


iQi/T r\77 r***A1 77 1 AA< 

1 o 1 0.UZ d .gZ*¥3_2,3 1UU3 


r 


JVlUUU3oo3Uo. , Ul 1 


U Uz-r nmColon 


1 ft£7 

loo/ 


73 1 Cl£LC 

/3iyoo 


1 o 1 o.rz 5 ,gz43_z3 1 UUo 


17 

r 


A/TAAAC/C/IC 1 "D.T7 1 7 

M0UU3OO3 lx>.lilz 


UCz-FrimUolon 


1 OAS 

loOo 


777A/I /I 

/Z/y44 


1 Q 1 ^ 157/1 /—/la 731AAQ 

lolo.15Z4.gZ4o Z^lUUo 


17 

r 


A>TAAA^/C/C37Tfc.l7A7 

MUUU3oo3ZxJ:liUZ 


TTi^'7 T>«»«/^^1^.« 

UCz-rrimColon 


1 ft£Q 

looy 


73A<tax 
/3U3yo 


1ft1/CTVyf7/l rr^/lQ 731A1Q 

1 o 1 0 . 1VIZ4 .gz43_Zo 1 U 1 y 


17 

r 


A>f AAA-C/C/C/f 7 A -T7A/C 

MU , UU3oo4 /AlrUo 


U Cz-FnmColon 


1 ft7A 

lo/U 


77<j/r/\7 
/Z3oU/ 


loZD.JN LD.gZ/\5_Z3 IZZo 


17 

r 


ATAAA^/CX/C7 A • A 1 7 

JV10UU3OOO /A.A1Z 


U Cz-r nmColon 


lo /I 




1 07C T>13 sv~/13 73 173 A 

Ioz3.1 j 13.gz43_z31z30 


17 

r 


A/TAAA^*C/C7A A ./Tl/lO 

M0003oo70A:Uvz 


U Cz-FrimColoii 


1 C77 
lo /Z 


4o4yo / 


lozD.r 14.gz43_z31Z3o 


r 


A/TAAA<C^C/C<0 A .CI 1 

MUUU3oo3oA:xil 1 


UCz-rnmColon 


lo/3 


4ol3yz 


1Q7C "NvTI A ~~A 1 07 17/13 

1 oz3 .JVl 14.gz43_z3 lz43 


17 

r 


A VTA AA £ £L£L£L£L A »C^t\A 

MUUU3ooooA:Cu4 


UCz-FnmColon 


lo/4 


73</ia^ 
/33oU3 


iooc /"\1/l rtrT /10 7317/1^ 

loz3.U14.gz43_z3 1245 


17 

r 


A IC€\ r\C\eZ£L£L£ QTV17A3 

M0003 ooo oU :li03 


UCz-rnmColon 


1 87< 

lo /D 


/341S4 


1 oZD .ir 14.gZ43_Z3 lZ4o 


17 

r 


A/TAAA^i'C/C7A'I3. A 1 7 

MUUU300 /Uo JV1Z 


u Cz-FnmiJoioxi 


lo/o 


A 7771 O 

4/ / /lo 


lOOC Tji/: o 73 1 7/C7 

1 oZ3 .Jb lo.gZ43_z3 lzo / 


17 

r 


AiTAAA</C/C<7 A .17A/C 

MuUu3oo3 / AlrUo 


UCz-rnmUolon 


1 577 

lo// 


33zyz / 


1 oZj .r 1 0.gZ43_Z3 lZOo 


17 

r 


A>TAAA^/iii^ft , D-/^A3 

iVlUUU3 003 ol3 . CU3 


ULyZ-Fnmuoion 


1 ft7ft 
lo/o 


71 Q/C7n 
/ lyoZU 


1ft7^ T1A 771777 
loZD.J !o.gZ43_Z31Z/Z 


17 
r 


A>TAAA^ fiddl A •"DA7 
JVIUUU3 0003 A.UUZ 


U ■UZ-Fnitl'oOlOll 


1 ft7Q 

lo /y 


77Aft7 

Z /Uoj 


1 ft7<* A/fl A /vrrA7 73177^ 
1 oZj .iVL10.gZ43_Z3 1Z #D 


17 

r 


A/rAAA^AAAA a nno 
JVlUUU3O00OA.JLIUy 


UL/Z-FrxlXl-UOlOIl 


i ftftA 

looU 


A7ftA^ft 


197^ D17 rtT/lQ 0717QA 
1 oZj .ir 1 / .gZ43_Z3 1 Zy4 


r 


Ayf AAA^AA7ATJ«riA< 
iYlUUU300 /UU.VJU3 


T TOO DnmrAlAn 


i as i 

lool 


AAft 1 AA 
*f*folU*f 


lft7^ T 1Q *ytA3 731377 
loZ3.JLiiy.gZ43_Z3 13ZZ 


17 


A>TA A A *i & 6. £L ATVtfA A 

1YIUUU3 ooo4JL/.rlUo 


u L/Z-r nni'Uoioii 


1 ftft7 

looz 


770^7^ 

/zyo/3 


1ft7^ T7A rr»yA3 93.137£ 

1 oZj . J ZU ,gZ*f3_Z3 1 3 3 O 


1* 


A/f AAA ^ A^£L3 A UnQ 
JVlUUU30003A.rlUy 


u , oZ-Fnni , ooioii 


1 ftft3 
lOOJ 


77A^7A 
/3U3 /U 


1C7^ D71 sv**/13 7713/1/1 

1 ozj .JtJZ 1 .gZ43_Z3 1344 


17 

r 


A if A A A < /Z/Z < 3 T\ •!? A7 

JVlUuU3oo33JJ.rU/ 


ucz-Friixiuoion 


x oot 


7771 OR 


1 O^* J*MJ4* X .g£rtO ^jIjt'U 






UvvLTl IXIIV^UXLMX 


1885 


447074 


1825.121.gz43_231351 


F 


M00056661D:D06 


UC2-PrimColon 


1886 


463741 


1 825.F22.gz43_23 1364 


F 


M00056658C:B12 


UC2-PrimColon 


1887 


732351 


1 825.N22.gz43_23 1372 


F 


M00056667D:E05 


UC2-PrimColon 


1888 


727343 


1825.L23.gz43_231386 


F 


M00056665A:G08 


UC2-PrimColon 


1889 


736415 


1825.B24.gz43_231392 


F 


M00056654A:A07 


UC2-PrimColon 


1890 


554597 


1 825.024.gz43_23 1405 


F 


M00056669B:E10 


UC2-PrimColon 
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Table 2 



SEQ 
ID NO 


CLUSTER 


SEQ NAME 


ORIEN 
T 


CLONE ID 


LIBRARY 


1891 


723914 


1826.A13.gz43_231599 


F 


M00056672A-.E05 


UC2-PrimColon 


1892 


734622 


1826.C14.gz43_23 1617 


F 


M00056674D:E09 


UC2-PrimColon 


1893 


645925 


1826.114.gz43_231623 


F 


M00056683B:H04 


UC2-PrimColon 


1894 


641777 


1826.D16.gz43_23 1650 


F 


M00056676B:G12 


UC2-PrimColon 


1895 


734828 


1826.E17.gz43_231667 


F 


M00056678B:E11 


UC2-PrimColon 


1896 


381623 


1826.117.gz43_23 1671 


F 


M00056683C.H10 


UC2-PrimColon 


1897 

107 / 


517274 


1826.J17.gz43 231672 


F 


M00056685A:H11 


UC2-PrimColon 


1898 


446575 


1826.C18.gz43_23 168 1 


F 


M00056675A:C02 


UC2-PrimColon 


1899 


725899 


1 826.D 1 8.gz43_23 1 682 


F 


M00056676C:B04 


UC2-PrimColon 


1900 


492094 


1826.M19.gz43_23 1707 


F 


M00056689B:F03 


UC2-PrimColon 


1901 


736246 


1 826.E20.gz43_23 1715 


F 


M00056678B:H02 


UC2-PrimColon 


1902 


473238 


1 826. J20.gz43_23 1720 


F 


M00056685B:GO4 


UC2-PrimColon 


1903 


730829 


1826.K20.gz43_231721 


F 


M00056686D-.E04 


UC2-PrimColon 


1904 


726722 


1 826 . A2 1 .gz43_23 1 727 


F 


M00056672D:B08 


UC2-PrimColon 


1905 


492629 


1826.021.gz43_231741 


F 


M00056692AA05 


UC2-PrimColon 


1906 


730664 


1826.E22.gz43_23 1747 


F 


M00056678CA12 


UC2-PrimColon 


1907 


729387 


1826.G22.gz43_23 1749 


F 


M00056681AE06 


UC2-PrimColon 


1908 


546121 


1826.A23.gz43_231759 


F 


M00056672D-.E04 


UC2-PrimColon 


1909 


513156 


1826.C23.gz43_231761 


F 


M00056675A:GO2 


UC2-PrimColon 


1910 


728791 


1826.G24.gz43_231781 


F 


M00056681BA11 


UC2-PrimColon 


1911 


559610 


1826.124.gz43_231783 


F 


M00056684B:B12 


UC2-PrimColon 


1912 


729205 


1 826. J24.gz43__23 1784 


F 


M00056685C.G07 


UC2-PrimColon 


1913 


561301 


1827.C01.gz43_23 1793 


F 


M00056695C:C04 


UC2-PrimColon 


1914 


728556 


1 827.E0 1 .gz43_23 1795 


F 


M00056697C:H10 


UC2-PrimColon 


1915 

X ZS X ~> 


640458 


1 827.10 1 .gz43_23 1 799 


F 


M00056701C:G09 


UC2-PrimColon 


1916 


650136 


1 827. K0 1 .gz43_23 1801 


F 


M00056703C:F04 


UC2-PrimColon 


1917 

X S X / 


288801 


1827.L01.gz43__23 1802 


F 


M00056704D:H10 


UC2-PrimColon 


1918 


648777 


1 827.M0 1 .gz43_23 1 803 


F 


M00056706A:B05 


UC2-PrimColon 


1919 


728196 


1827.G02.gz43_231813 


F 


M00056699C:F09 


UC2-PrimColon 


1920 


644354 


1827.102.gz43_23 18 15 


F 


M00056701D:C03 


UC2-PrimColon 


1921 

X X 


419465 


1 827.M02.gz43_23 1819 


F 


M00056706A:E09 


UC2-PrimColon 


1977 


558317 


1827 P02 ez43 231822 


F 


M00056708C:F11 


UC2-PrimColon 




693869 


1827.B03.gz43 231824 


F 


M00056694AG03 


UC2-PrimColon 


1974 


553805 


1827.L03.gz43 231834 


F 


M00056705A-.B12 


UC2-PrimColon 


1975 


626791 


1827.N03.gz43 231836 


F 


M00056707AG11 


UC2-PrimColon 


1926 


580601 


1 827. A04.gz43_23 1839 


F 


M00056693B:D11 


UC2-PrimColon 


1927 


493830 


1827.B04.gz43_231840 


F 


M00056694AG11 


UC2-PrimColon 


1928 


724714 


1827.E04.gz43 231843 


F 


M00056697D:B09 


UC2-PrimColon 


1929 


737109 


1827.F04.gz43 231844 


F 


M00056698D.E11 


UC2-PrimColon 


1930 


395604 


1827.G04.gz43 231845 


F 


j M00056699C:G11 


UC2-PrimColon 


1931 


735834 


1827.J04.gz43 231848 


F 


M00056702D:G03 


UC2-PrimColon 


1932 


726852 


1827.L04.gz43 231850 


F 


M00056705AD02 


UC2-PrimColon 
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Table 2 



SEQ 
ID NO 


CLUSTER 


SEQ NAME 


ORIEN 
T 


CLONE ID 


LIBRARY 


1933 


728892 


1 827.P05 .gz43_23 1 870 


F 


M00056708DB03 


UC2-PrimCo1on 

V-r X X XllA V-r \Jn\JLX 


1934 


540618 


1827.C06.gz43_231873 


F 


M00056695CH08 


UC2-PrimCol on 


1935 


501030 


1827.E06.gz43 231875 


F 


M00056697DC09 


UC2-PriiTir i o1oii 


1936 


477366 


1827.F06.gz43 231876 


F 


M00056698D-G09 


I JC2-Pri mColnii 


1937 


727948 


1827.I06.gz43 231879 


F ! 


M00056701DD04 


UC2-PrimC , o1on 


1938 


649927 


1827.K06.gz43 231881 


F 


M00056703DF08 


VrV/Zi X X XXiiV^UXUXX 


1939 


449752 


1827.M06.gz43 231883 


F 


M00056706B-C02 


VVrir X X X11XV>\JXUXX 


1940 


649106 


1827.N06.gz43 231884 


F 


M00056707BE02 


UC2-PriiiiCol on 


1941 


730463 


1827.P06.gz43 231886 


F 


M00056708DD10 


UC2-PrimColon 

vJVri< X X UUVa' V7XWXX 


1942 


462687 


1827.E07.gz43 231891 


F 


M00056697D-C12 


UC2-PrimColon 

UV/il> X XXXXXV^v/Xv/XX 


1943 


553140 


1 827.107 .gz43 231895 


F 


M00056701DG03 

XTX v V / V/ XX^ . VJ Vffc/ 


UC2-PrimCo1 on 

\-r v_^x^ X X Uli^/v7X V/XX 


1944 


736100 ; 


1827.M07.gz43 231899 


F 


M00056706BC07 


TJC^-PrimColon 

UV/A X 1 XXlLV_>v/lv7lX 


1945 


730375 


1827 B08 sz43 231904 


F 


M00056694BH10 


T TP?-PriiYiP.ri1nn 
vj vri. r x xXftiv^v/xvJxi 


1946 


403671 


1827K08ez43 231913 


F 


M00056704A-B05 


V-/^» X X XXI 1 \_r v/X VJXX 


1947 


595181 


1827.M08.gz43 231915 


F 


M00056706B D04 


TJC2-Prinir , o1on 

U V/ii X X XXXXV^WFXv/XX 


1948 


573733 


1827A09ez43 231919 


F 


M00056693B-H03 

X TiU \J \J \J \J 7 »J XJ * X XV/ -/ 


T TC*9 -Prim Pol nn 

VJ V_r JL X X XI CXUII 


1949 


550454 


1827 109 ez43 231927 


F 


M00056701DH12 


I ir*2-Primr , olon 

vJV^Z- X X XXllV_rvXXUXX 


1950 


651050 


1827 J09 sz43 231928 


F 


M00056703 AGO 1 


T TC 0 -Prim Pnl on 

VJ V-'X. X X XXXXV^r VXXV/XX 


1951 


728768 


1827.L09.gz43 231930 


F 


M00056705B*A12 


I JC2-PrimCol on 

U Va< X X XXIX V/Ul V/XX 


1952 


736276 


1827N09sz43 231932 


F 


M00056707BF06 

ITXWVw' W / V f X^.X v/v/ 


IJC2-PrimColon 

V7 <b X X XXIX V^r \J XV/XX 


1953 


472837 


1827 CIO ez43 231937 


F 


M00056695DE05 


UC2-PrimColon 

VJ Y_rZ- X X XXllV_r V7XV/XX 


1954 


732770 


1 827 H 1 0 ez43 23 1 942 

x u«# / *x x x vy » c^A-t r w' - T * x -y rx*r 


F 


M00056700D-H07 

iTlvVw^'v / V/X/X^aXXV/ » 


T TP?-Primr , olon 

vV/Zi I X XXIIV^VJXV/X1 - 


1955 


83388 


1827L10ez43 231946 


F 


M00056705B T) 1 0 


T JC^-PrimColon 

V/V/A« X XXXllV^VXLvXXX 


1956 


584693 


1827 M10 ez43 231947 


F 


M00056706CA07 


T 1C 2-Prim Col on 

UV/^r X X XXllV_rVyXVIXX 


1957 


588059 


1827Dllez43 231954 


F 


M00056697AD02 


T IP 2-Prim Col on 

VJV-rXr X X X111V>V/X\JXX 


1958 


452662 


1827 Fll gz43 231956 


F 


M00056699AD08 

XTJLw W*-' </XX»Xr VU 


UC2-PrimColon 

V/V*4» X X XXXX V-r V7XV/XX 


1959 


728898 


1827 Mil sz43 231963 

X ^§ 9 m X WJL X X a * J *y A* X Vy *y 


F 


M00056706C-B12 


UC2-PrimColon 

VV/Zt X X XXII v^vsxv/xx 


1960 


672032 


1827.F12.gz43 231972 


F 


M00056699A-E08 

lTXvvU*/ V-rV7-/ X L«X/vU 


UC2-PrimColon 

VV/ti X X XXIX Va/V/X V/XX 


1961 


496084 


1827H12ez43 231974 

X WX« # *X X X X* «fj< < 1 X* *y x -y # » 


F 


M0005670 1 A* A02 


UC2-PrimColon 

V-r X X XXXXV^VJ XV/XX 


1962 


736579 


1827 012 ez43 231981 

x \j r « x x* i&u i«/ x^ *y X -y v X 


F 


M00056708BB09 


I IC? -Prim Colon 

V/V-/-6 X X XXX1V^V/XV/XX 


1963 


727968 


1827A13sz43 231983 


F 


M00056693C-D12 


T TC?.-PrimCo1on 


1964 


447624 


1827 J13 ez43 231992 


F 


M00056703B-A04 


T TC9-PrimCo1on 

KJ XT X XHlV^ViXV/XX 


1965 


728552 


1827K13ez43 231993 


F 


M00056704A-H08 


T TC 9 -PrimCnl nn 

V/VyA- X X X1HV_> V7XVJXX 


1966 


733064 


1827M13ez43 231995 


F 


M00056706C-D05 


I JC2-PrimCol on 

UvrAi X X XXIX VsVXXV/XX 


1967 


607202 


1827 013 ez43 231997 


F 


IVlVvV/JU I VOU.irVJ 


T TC ? -Prim C <M on 

KJ V^Z. X X XXLlV^V/xwxl 


1968 


473433 


1827.P13.gz43_231998 


F 


M00056708D:H06 


UC2-PrimColon 


1969 


617813 


1827.A14.gz43_231999 


F 


M00056693C:G06 


UC2-PrimColon 


1970 


728300 


1827.G14.gz43_232005 


F 


M00056700A:F12 


UC2-PriiiiColon 


1971 


727760 


1827.K14.gz43_232009 


F 


M00056704B:C02 


UC2-PrimColon 


1972 


454087 


1827.P14.gz43_J232014 


F 


M00056709AA05 


UC2-PrimColon 


1973 


425396 


| 1827.A15.gz43^232015 


F 


M00056693D:B02 


UC2-PrimColon 


1974 


552430 


1827.N15.gz43_232028 


F 


M00056707C:E01 


UC2-PrimColon 
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Table 2 



SEQ 
ID NO 


CLUSTER 


SEQ NAME 


ORIEN 
T 


CLONE ID 


LIBRARY 


1975 


650542 


1827.C16.gz43_232033 


F 


M00056696A:G12 


UC2-PrimCoIon 


1976 


725084 


1827.D16.gz43_232034 


F 


M00056697A:H10 


UC2-PrimColon 


1977 


446607 


1827.G16.gz43_232037 


F 


M00056700A:G05 


UC2-PrimColon 


1978 


4584 


1827.H16.gz43_232038 


F 


M00056701AG09 


UC2-PrimColon 


1979 


560678 


1827J16.gz43_232040 [ 


F 


M00056703B:D07 


UC2-PrimColon 


1980 


730639 


1827.N16.gz43_232044 


F 


M00056707C.E03 


UC2-PrimColon 


1981 


727506 


1827.C17.gz43_232049 


F 


M00056696B:B03 


UC2-PrimColon 


1982 


540618 


1827.E17.gz43_232051 


F 


MO0056698B:ElO 


UC2-PrimColon 


1983 


735676 


1827.117.gz43_232055 


F 


M000567O2CA1O 


UC2-PrimColon 


1984 


728768 


1827.N17.gz43_232060 


F 


M00056707C:F10 


UC2-PrimColon 


1985 


475942 


1827.A18.gz43_232063 


F 


M00056693D:E12 


UC2-PrimColon 


1986 


631056 


1 827 .HI 8.gz43_232070 


F 


M00056701B:D02 


UC2-PrimColon 


1987 


259218 


1 827.P 1 8 .gz43_232078 


F 


M00056709A:H11 


UC2-PrimColon 


1988 


724489 


1 827.H19.gz43_232086 


F 


M00056701B:D06 


UC2-PrimColon 


1989 


727845 ; 


1827.D21.gz43_232114 


F 


M00056697C:D11 I 


UC2-PrimColon 


1990 


641072 


1 827.F2 1 .gz43_232 116 


F 


M00056699B:G04 


UC2-PrimColon 


1991 


553483 


1827.A22.gz43_232127 


F 


M00056694AC05 


UC2-PrimColon 


1992 


551096 


1827.C24.gz43 232161 


F 


M00056696B:H01 


UC2-PrimColon 


1993 


727649 


1 827 .F24.gz43_232 1 64 


F 


M00056699C:C02 


UC2-PrimColon 


1994 


728196 


1 827.H24.gz43_232 166 


F 


M00056701C:F08 


UC2-PrimColon 


1995 

X -J 


729428 


1 828 .AO 1 .gz43_232 1 75 


F 


M00056709C:F06 


UC2-PrimColon 


1996 


724050 


1828 .F0 1 .gz43_232 180 


F 


M00056714B:C12 


UC2-PrimColon 


1997 


732351 


1828.L01.gz43_232186 


F 


M00056721C:E05 


UC2-PrimColon 


1998 


730627 


1 828 .MO 1 .gz43_23 2 1 87 


F 


M00056722C:C09 


UC2-PrimColon 


1999 


728182 


1 828 .H02 .gz43_232 1 98 


F 


M00056716B:F12 


UC2-PrimColon 


2000 


727878 


1828.M02.gz43_232203 


F 


M00056722C:D11 


UC2-PrimColon 


2001 


462687 


1 828.N02.gz43_232204 


F 


M00056723C:C09 


UC2-PrimColon 


2002 


555763 


1 828 .L03 .gz43_2322 1 8 


F 


M00056721C:H01 


UC2-PrimColon 


2003 


728627 


1 828.P03 .gz43_232222 


F 


M00056725C:H06 


UC2-PrimColon 


2004 


630516 


1 828.H04.gz43_232230 


F 


M00056716C:B06 


UC2-PrimColon 


2005 


584179 


1 828 . AOS .gz43_23223 9 


F 


M00056709D:D05 


UC2-PrimColon 


2006 


558544 


1 828.G05.gz43_232245 


F 


M00056715C:B07 


UC2-PrimColon 


2007 


553294 


1 828 .N05 .gz43_232252 


F 


M00056723C:E01 


UC2-PrimColon 


2008 


726576 


1828 O05 ez43 232253 


F 


M00056724C:H11 


UC2-PrimColon 


2009 


734622 


1828 L07 ez43 232282 


F 


M00056721D-.D01 


UC2-PrimColon 


2010 


735362 


1828.P07.gz43_232286 


F 


M00056726A:C12 


UC2-PrimColon 


2011 


455297 


1828.A08.gz43_232287 


F 


M00056709D:E12 


UC2-PrimColon 


2012 


734990 


1828.N08.gz43_232300 


F 


M00056723C:G03 


UC2-PrimColon 


2013 


707609 


1828.B10.gz43_232320 


F 


M00056710D:C05 


UC2-PrimColon 


2014 


728178 


1828.L10.gz43_232330 


F 


M00056721D:F12 


UC2-PrimColon 


2015 


447634 


1828.P10.gz43_232334 


F 


M00056726BH06 


UC2-PrimColon 


2016 


| 729813 


1828.Fll.gz43_232340 


F 


M00056714DA11 


UC2-PrimColon 
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Table 2 



SEQ 
ID NO 


CLUSTER 


SEQ NAME 


ORIEN 
T 


CLONE ID 


LIBRARY 


2017 


728528 

/ mm* U — ' *-* \J 


1828 111 ez43 232343 

X Ufa U.i-l i. '£3* >M-J Am* mJ jLlmJ^T ~J 


X 


M000567 1 8 A-H05 


T TP7 -Prim Pol nn 
\J v-/fa x i nil v^uivJii 


2018 


132110 


1828 Jll ez43 232344 

X OfaO • «# X X « r^r tTmJ fa> *J fa *J i r 


F 

JL 


M0 005 67 1 9B * G04 


T TP ? -Prim Pn Inn 


2019 


505858 


1828 F12 ez43 232356 

XUfaU.X X-fa t* -J m^-*mJ AmtmJ mJ \J 


F 

X 


M0005 67 1 4D *E08 


T TP7 -PrimPol nn 

W V_yfa~X 1 XlllV^V/lV/Xl 


2020 


735047 


1828 E13 ez43 232371 


F j 


M00056714A-B02 

1T1WUV JU / X X« mVmW \J mm* 


I JP2-PriiTi Pnlnn 

V^fa X X Xill VvV^XVJlX 


2021 


727749 


1828 K13 ez43 232377 


F 

X 


M0005672 1 A* C07 


T TP9-PrimPn1nn 

w V>fa X X XillV^UXUlX 


2022 


598746 


1828 LI 3 2z43 232378 

lUAU.JJlJtgtrTJ Am**/ fa—' / L) 


F 

X 


M00056722AE 1 0 


UC2-PrimPftlon 

w Vvfa X X XXllAw'WXvJAl 


2023 


728061 


1828 013 ez43 232381 


F 


M00056725A-E02 


V-/ V_/fa X X XXHV_>UXVJXX 


2024 


448217 


1828 F14 ez43 232388 

XUfaVFcX X ■ -flymmtT mJ Am*mJ 4 * ■ . f \J \j 


F 


M00056714DH07 

lTX\/\/Vo' v / X FX/ «X XV/ # 


T TP? -Prim Polon 

V-/ \_xfa X X XXLXV_y\_IXV^XX 


2025 


485899 


1828 L14 cz43 232394 

1 U^U X^tgLiT>/ ******* mmtm r -X T 


F 


M00056722A-F08 

lTlUUVa/v * fafaXX.X \J KJ 


UC2-PrimColon 

V>fa X X XXllV^V/XV^XX 


2026 


641287 | 


1828 B15 ez43 232400 

X UAlU » -M-^ A mfZ^tilmJ Am*-/ Am* ¥^ VJ \J 


F | 


M0005671 1AA09 


UC2-PrimColon 

KJ Vvfa X X XXIXV^rVJXv/XX 


2027 


48619 


1828 D15 sz43 232402 


F 


M000567 13 A*F05 


UC2-PrimPolon 

vyv^fa X X XXXXV^sV/XwAX 


2028 


728929 


1828 E15 cz43 232403 


F 


M000567 14A*B09 

iTJiX/V/V/JW f X T^XV . S\mt \J ** 


UC2-PrimPolon 

Vv \_/fa X X XXil V/UIUiX 


2029 


451972 


1828 115 sz43 232407 

lU^VlAlt/lgU'a/ A****/ Am* T \J 9 


F " 


M00056718BC02 


UC2-PrimColon 

V V^fa X X XX 11. V-zV/XVyXL 


2030 


486363 


1828 J18 ez43 232456 

XUfalT.t/ IV/^gttlT*; A-*mJ Ami \ mj \J 


F 


M000567 1 9C*B 1 0 


UC2-PrimPolon 

V_/fa X X XUlVvlwll 


2031 


728897 


1828 C19 ez43 232465 


F 


M00056712BB11 


UC2-PrimPnlon 

W V^fa X X IX 11 V_y VJ1 wll 


2032 


448217 


1828 E19 ez43 232467 

JL >XJ X *** *t~A-lTmJ mZmfmS mm* » \J # 


F 


M00056714AE12 

lrlUUv/^/U / X txx.X- /Xfa 


UC2-Priii.Polon 

V_/ Vyfa X X XXXXVvwXVXX 


2033 


646309 


1828 119 ez43 232471 

iU^UaJliy «g£lTk> mm* mJ fa T I X 


F 


M000567 1 8C B0 1 


UC2-Priin Polon 
w v-/fa x x xui v/Uimi 


2034 


728303 


1828 J19 ez43 232472 


F 


M0005 67 1 9C F06 


UC2-PrimPolon 

KJ V-^fa X X XXXJL V^vlXV/XX 


2035 


728002 


1828K19ez43 232473 

10lfO,l\i7,g£|'TJ mm*mJmmw~T I -J 


F 

X 


M00056721BD03 

IVIUUU J V/ / fa X XmJ mlm*F\J -J 


T TP2-P ri m Polon 

VJ V^fa X X XXI J. VvWXV/lX 


2036 


473640 


1 82ft "M20 o-M^ 7^249? 

lOiO.nAv.gL'TJ fa^/fa^-^fa 


F 

X 


M00056724B-F 1 1 ' 

±\X\J\J\J~J\J 1 fa^XJ .XJ# Jl X 


I TP9-PrimPolon 

\J \m*t--* XT X XlllV>\JXV/XX 




777480 
# fa- /tou 


1878 071 <r74^ 7^7509 


F 


M00056775r i -Ani 


T TP7-PriTnPolon 
vJ V^fa~x x 1£U V^UlvUl 


2038 

faVs«J o 


509678 


1828P21ffz43 232510 

X OfaO .X fa X - ' *£*,mj AmfmJ XV/ 


F 

X 


M00056726D-O08 


IJP2-Prin,Polon 

VJ v^fa X X XliXV/vU SJUl 


2039 

faV/«J Zs 




1828E22a74 4 3 732515 


F 


M00056714A-H06 

L\ ±.\J \J\J .J \J i X*TxTk..XX\y\J 


T TP7-PriinPolon 

VJ V^fa X 1 XlllV>V/lV/XX 


7040 


447150 


1 878 1M2? <T74^ 732574 

J. OfaO.l^lfafa. £^fa*T J fa-»7fa— 'fa*T 


F 


M0005 6774B Ciftl 


1 TP?-Pri tnPolon 

KJ V-'fa X X IXllv^VJXVJix 


2041 


728273 

/ faOfa 1 mJ 


187X A7^ **74^ 7^2527 

X OfaO.XVfa./ .yfTJ A-.mJ /ZmmJ fa / 


F 

X 




T TP7-PriTnPolon 
vj v^fa x x xxixv> v/iwxx 


2042 


734453 


1828 C23 ez43 232529 

X OfaO . V.yfa^y .jrf)» tT mJ fa^Jfaw/faJ' 


F 


M000567 1 2B *F02 


IJP2-PrimPolon 

VJV>fa X X XXXI V-' vxxv/xx 


2043 


732006 


1828 D23 sz43 232530 

X OfaO »\\-*--WmJ . f^AmlTmJ fa mmtmJ mj \J 


F 

X 


M00056713C-A10 


UC2-PrimColon 

VJVyfa X X XXXX V/U1V/U 


2044 


449585 


1828 123 2z43 232535 

X OfaO . 1 fa«J * *-)* ' « — ^ AmlmJ Zm-mJ mj mJ 


F 

X 


M000567 1 8D*D 1 2 


I TP? -Prim Pol on 

KJ\msZm X X XXLXV^wXWXX 


2045 


728966 


1828 E24 ez43 232547 

X OfaO . 1, iifajT t &tfTJ fa J £—>J ~ 1 


F 

X 


M00056714B-B06 

IV Xvi \J \J mJ VJ 1 X^TXJ *XJ\J \J 


I TP2-PrfmPolon 

y~J Vvfa x xxxii\_ajx\jxx 


2046 

fa. V^TVf 


529356 


1818 COl cr^43 7^7945 


F 

JT 


M00056747D-A03 


1 TP ? -Pri m Pnl nn 

VJ V^fa X X XlllVvVJXVJXX 


2047 

faV#*T / 


551693 

w' ^ X \J Zs -J 


1838 P01 tr743 232958 

iOJO.X vl.gii^J fa.Jfa-^'^' o 


F 

X 


M00056764C-A02 


TJP2-PrimPolon 

vj \_/fa x x xinv^uxvjxx 


2048 


438663 


1838B03ffz43 232976 


F 


M00056746D-D06 


T TP?-PrimPolon 

VJ Vvfa X X XXIXV^VJXVJXX 


2049 


735464 


1838 E03 ffz43 232979 


F 


M00056750R-H03 


T TP2-PrimPolon 

VJVA. X X XXllV-^VJXVJXX 


2050 


736035 


1838 J03 ez43 232984 


F 

X 


M00056756C-D06 


T TP?-PrimPolon 

VJV^fa X X XlXlVw/VSXV/XX 


2051 

~-*\J ~J X 


736210 

/ .j VJ fa X V/ 


1838L03ez43 232986 


F 


M00056759r*C04 


T JP2-PrimPolon 

VJ V-sfa X XXLllV/V^XVyXX 


2052 


728464 


1838.105 .gz43_233015 


F 


M00056755B:E07 


UC2-PrimColon 


2053 


730178 


1838.K05.gz43_233017 


F 


M00056758C:B08 


UC2-PrimColon 


2054 


481614 


1838.N05.gz43__233020 


F 


M00056762B:D06 


UC2-PrimColon 


2055 


402799 


1839.A01.gz43_233327 


F 


M00056766D:G09 


UC2-PrimColon 


2056 


675768 


1839.N01 .gz43_233340 


F 


M00056784D:C06 


UC2-PrimColon 


2057 


577305 


1839.P01.gz43_233342 


F 


M00056787B:C07 


UC2-PrimColon 


2058 


730505 


1839.K02.gz43_233353 


F 


M00056781A:E06 


UC2-PrimColon 
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Table 2 



SEQ 
ID NO 


CLUSTER 


SEQ NAME 


ORJDEN 
T 


CLONE ID 


LIBRARY 


2059 


349977 


1839.B03.gz43_233360 


F 


M00056768DA09 


UC2-PrimColon 


2060 


726344 


1839.N03.gz43_233372 


F 


M00056784D:G10 


UC2-PrimColon 


2061 


724183 


1839.J04.gz43_233384 


F 


M00056780B:E06 


UC2-PrimColon 


2062 


730352 


1839.M04.gz43_233387 


F 


M00056783C:E03 


UC2-PrimColon 


2063 


510596 


1839.E05.gz43_233395 


F 


M00056773C:C09 


UC2-PrimColon 


2064 


128773 


1839.F05.gz43_233396 


F 


M00056775A:A05 


UC2-PrimColon 


2065 


728464 


1839.G06.gz43_233413 


F 


M00056776A:G04 


UC2-PrimColon 


2066 


449010 


1839.L06.gz43_233418 


F 


M00056782B:G05 


UC2-PrimColon 


2067 


460023 


1 839.M06.gz43_2334 1 9 


F 


M00056783DA02 


UC2-PrimColon 


2068 


733744 


1839.K07.gz43_233433 


F 


M00056781B:C03 


UC2-PrimColon 


2069 


2180 


1 839.L07.gz43_233434 


F 


M00056782B:H11 


UC2-PrimColon 


2070 


556458 


1839.M07.gz43_233435 


F 


M00056783D:B07 


UC2-PrimColon 


2071 


711325 


1839.A08.gz43_233439 


F 


M00056767A:F12 


UC2-PrimColon 


2072 


135593 


1839.E08.gz43_233443 


F 


M00056773C:F12 


UC2-PrimColon 


2073 


639178 


1839.K08.gz43_233449 


F 


M00056781B:C05 


UC2-PrimColon 


2074 


504513 


1839.B09.gz43_233456 


F 


M00056769C:C03 


UC2-PrimColon 


2075 


493193 i 


1 839. J09.gz43_233464 


F 


M00056780B:H04 


UC2-PrimColon 


2076 


728133 


1839.K09.gz43_233465 


F 


M00056781B:E01 


UC2-PrimColon 


2077 


727737 


1839.A1 l.gz43_233487 


F 


M00056767B:C01 ! 


UC2-PrimColon 


2078 


477053 


1839.F12.gz43_233508 


F 


M00056775B:H07 


UC2-PrimColon 


2079 


726384 


1839.P12.gz43_233518 


F 


M00056787C:G01 


UC2-PrimColon 


2080 


736686 


1839.A13.gz43_233519 


F 


M00056767B:G08 


UC2-PrimColon 


2081 


483549 


1839.L13.gz43_233530 


F 


M00056782D:F02 


UC2-PrimColon 


2082 


549801 


1839.A14.gz43_233535 


F 


M00056767CA09 


UC2-PrimColon 


2083 


730484 


1839.H14.gz43_233542 


F 


M00056777D:D04 


UC2-PrimColon 


2084 


647991 


1839.J14.gz43_233544 


I F 


M00056780C:H12 


UC2-PrimColon 


2085 


490393 


1839.L14.gz43 _233546 


F 


M00056782D:F08 


UC2-PrimColon 


2086 


643968 


1839.N14.gz43_233548 


F 


M00056785C:B09 


UC2-PrimColon 


2087 


641968 


1839.B15.gz43_233552 


F 


M00056770CA07 


UC2-PrimColon 


2088 


726261 


1839.L15.gz43_233562 


F 


M00056782D:F10 


UC2-PrimColon 


2089 


733891 


1 839.M 15 .gz43_233563 


F 


M00056784BA01 


UC2-PrimColon 


2090 


730022 


1839.015.gz43_233565 


F 


M00056786DA03 


UC2-PrimColon 


2091 


730296 


1839.M16.gz43_233579 


F 


M00056784B:C02 


UC2-PrimColon 


2092 


733149 


1839.M17.gz43_233595 


F 


M00056784B:D06 


UC2-PrimColan 


2093 


728487 


1839.A18.gz43_233599 


F 


M00056768AC10 


UC2-PrimColon 


2094 


726229 


1839.C18.gz43J233601 


F 


M00056771D:D10 


UC2-PrimColon 


2095 


523868 


1839.G18.gz43 233605 


F 


M00056776D:B02 


UC2-PrimColon 


2096 


577305 


1839.L18.gz43_233610 


F 


M00056783A:C08 


UC2-PrimColon 


2097 


727216 


1839.J19.gz43_233624 


F 


M00056780D:G05 


UC2-PrimColon 


2098 


47461 


1839.L19.gz43 233626 


F 


M00056783B:B01 


UC2-PrimColon 


2099 


550780 


1839.N19.gz43_233628 


F 


M00056785D:C09 


UC2-PrimColon 


2100 


606076 


| 1839.P21.gz43 233662 


F 


M00056788A:D06 


UC2-PrimColon 
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SEQ 
ID NO 


CLUSTER 


SEQ NAME 


ORIEN 
T 


CLONE ID 


LIBRARY 


2101 


691229 


1839.C22.gz43_233665 


F 


M00056772A-.C08 


UC2-PrimColon 

V-> -w X X UUW1V/1X 


2102 


727366 ; 


1839.D22.gz43 233666 


F 


M00056773A:G10 


UC2-PrimColon 


2103 


724722 


1 839. J22.gz43_233672 


F 


M00056780D:H10 


UC2-PrimColon 


2104 


725784 


1839 M22 ez43 233675 


F 


M00056784CC06 


UC2-PrimColon 

W X X 1XX1V/VXV/XX 


2105 


732345 


1839B23ez43 233680 


F 


M00056771AF03 


UC2-PrimColon 

V-/ X x xixx wxuxi 


2106 


726699 


1839 C23 ez43 233681 


F 


M00056772A-D03 


UC2-PrimColon 


2107 


513156 


1839 J23 sz43 233688 


F 


M00056781AC05 


UC2-PrimColon 

\J V^X< X X UllVWlVll 


2108 


631038 


1839 E24 ez43 233699 


F 


M00056774DB02 


UC2-PrimColon 


2109 


609459 


1839 H24 ez43 233702 


F 


M00056778CG08 


UC2-PrimColon 


2110 


482788 


1839 J24 ez43 233704 


F 


M0005678 1 AD02 

IWv W«*V* * *J XX >..X^V/^ 


UC2-PrimColon 


2111 


730592 


1839N24ez43 233708 


F 


M0005 6785D *G07 


UC2-PrimColon 

V-/ V/X* X X XXXXV/V/XV/XX 


2112 


735801 


1852 A13 ez43 235742 


F 


M0005687 1 CD05 


UC2-PrimColon 

V/V^A* X X XlllVvlVXl 


2113 


472704 


1852 HI 3 ez43 235749 

x ip«sx« .x 11*/ ' 1 *✓ # ■ ^ 


F 


M00056879DD0 1 


UC2-PrimColon 

v/ V^/^rf X X XX1XV/V/XV/XX 


2114 

x# X X ~ 


649299 


1852 113 ez43 235750 


F 


M00056881A A10 


UC2-PrimColon 

\~r-£* X 1 XXXA V/lvix 


2115 


650944 


1852N13ez43 235755 


F 


M00056886BB10 


UC2-PrimColon 

V/ Xm'X* X X XXXlV/V/XUXX 


2116 


481641 


1852 E14 ez43 235762 


F 


M00056876CB02 


UC2-PrimColon 

x_y V^-^-- X X XX X 1V_/V/1V/11 


2117 


488349 


1852 F14 ffz43 235763 


F 


M00056877BH09 

1TXV V/ \J -.J \J %J / / X-# *X XV/ 


UC2-PrimColon 

V>/ V^/XV X X XXXXV-S vyxv/xx 


2118 


735096 


1852 114 ffz43 235766 


F 


M0005688 1 A-C02 


UC2-PrimColon 

\s V/Xf X X UU\y v/XV/XX 


71 19 


594434 


1852 J14 ffz43 235767 


F 

X 


M00056882BE12 

i»lV/ W- ' W IJ OI(XJ. X_/ XX* 


I JC2-PrimColon 

\J\^t£* X X XXXXV^WXWXJL 


2120 


550673 


1852 L14 ez43 235769 


F 


M00056884CH08 

1*1V V \-/ V/V/ v i v *X XV V 


UC2-PriixiColon 

V/ V_> X x xxxiw/xv/xx 


2121 


735672 


1852 N14 cz43 235771 


F 


M00056886B:C05 


UC2-PrimColon 


2122 


732712 


1852 K15 ez43 235784 


F 


M00056883D:A07 


UC2-PrimColon 

V/A» X JL XXil WXV/11 


2123 


472307 


1852 L15 sz43 235785 


F 


M00056884CH1 1 


UC2-PrimColon 


2124 


729981 


1852 D16 ez43 235793 


F 


M00056875D:C04 


UC2-PrimColon 


2125 


639950 


1852 116 ez43 235798 


F 


M0005688 1 AH02 

XT A V w U XX X*X XV4* 


UC2-PrimColon 


2126 


724517 


1852 L16 ez43 235801 


F 


M00056884DC07 


UC2-PrimColon 

V_/x^-» A X A*XX^,*/VA*V^.»X 


2127 


735412 


1852 B17 2z43 235807 


F 


M00056873AH06 

lfXvVv>/VU / tj Ji l.llVU 


UC2-PrimColon 

V-/ x x xixiv^xAivrxx 


2128 

Xr XZ*\J 


724223 


1852 N17 ez43 235819 


F 


M00056886BH02 

ifxv vy v »y v/ u v> \jmj *x xv >w 


UC2-PrimColon 

V«/ Vri/Xrf X X XXIXW/XV/XX 


717Q 




18S7ni8ff743 235825 


F 

X 


M00056875DE09 


UC2-PrimColon 

\J \^r£* X X XXllV>WXwXX 


7130 


561396 


1852 URffz43 235833 


F 

X 


M00056884DD06 


UC2-PrimColon 

\J X X XlilV/UlvXX 


2131 


726408 


1852 B19 ffz43 235839 


F 


M0005 6873B • C09 


UC2-PrimColon 

V-/ Vx« X X IXUV'VJiwXl 


2132 


730059 

/ JUV/J7 


1852 C19 ez43 235840 


F 


M00056874CD05 


UC2-PrimColon 

\»>xw X X XXXX W/XV/XX 


2133 


6^S327 


1852 F19 az43 235843 


F 

X 


! M00056877C-G12 


UC2-PrimColon 

V/V^Xtf X XlXllV-'UXV/XX 




737877 


1857 Ml 9 p-743 235850 

IOJZi.IVII 17 .gZ/T.^ Jj-J ~J \J^f\J 


F 

X 


M00056886A-A09 

X TXvf UUJUOO UA ,l\V/ -7 


UC2-PrimColon 

V/ V^X« X X XIXXV/V/Xv/XX 


71^S 


47^S17 


18S7 TM10 <r743 735851 


F 

X 


M00056886r-D02 


UC2-PrimColon 


2136 


730441 


1852.F20.gz43_235859 


F 


M00056877C:H03 


UC2-PrimColon 


2137 


451184 


1852.N20.gz43_235867 


F 


M00056886C:D11 


UC2-PrimColon 


2138 


733040 


1852.D21.gz43_235873 


F 


M00056875D:H12 


UC2-PrimColon 


2139 


1 732872 


1852.K2Lgz43_235880 


F 


M00056883D:F07 


UC2-PrimColon 


2140 


152 


1852.M2Lgz43_235882 


F 


M00056886A:C11 


UC2-PrimColon 


2141 


730301 


1852J22.gz43_235894 


! F 


M00056881B:G04 


UC2-PrimColon 


2142 


471931 


1852J22.gz43_235895 


F 


M00056882D:A06 


UC2-PrimColon 
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Table 2 



SEQ 
ID NO 


CLUSTER 


SEQ NAME 


OMEN 
T 


CLONED) 


LIBRARY 


9143 


730106 


1852 L23 ez43 235913 


F 


M00056885A:D12 


UC2-PrimColon 




640030 

Ut7V/J V7 


1852 024 ffz43 235932 


F 


M00056888A:H04 


UC2-PrimColon 


914-5 


S6934S 


1861 104 &z43 235990 


F 


M00056900C:B12 


UC2-PrimColon 


914A 


479996 


1X61 P04 P743 235997 


F 


M00056910BF01 


UC2-PrimColon 


9147 


404393 


1861 E07 ez43 236034 


F 


M00056895C:E11 | 


UC2-PrimColon 


9148 


730303 


1861 B08 az43 236047 


F 


M00056891C:H08 


UC2-PrimColon 


9T40 


67699 1 


1861 G10 ez43 236084 


F 


M00056898C:B06 


UC2-PrimColon 


91 SO 


648774 

C/*tO / /*T 


1861 Oil ^z43 236108 


F 


M00056909B:E11 


UC2-PrimColon 


91 S 1 


448368 


1861 H12 ffz43 236117 

J. OVJ 1 .X A Jul 1 / 


F 


M00056899D:B06 


UC2-PrimColon 


91 S9 

Z1->Z 


480001 


1861 K12 &z43 236120 


F 


M00056902D:H09 


UC2-PrimColon 


91 ^3 




1861 J14t?z43 236151 


F 


M00056902A:G12 


UC2-PrimColon 


91 S4 


794773 


1861 C16ez43 236176 


F 


M00056893B:G12 


UC2-PrimColoii 




730608 


1861 T16ez43 236182 


F 


M00056901B:C03 


UC2-PrimColon 


91 ^6 


649807 


1861 K16pz43 236184 


F 


M00056903A:C08 


UC2-PrimColon 


91 ^7 
Z 13 / 


^4897^ 


1861N16c*743 236187 


F 


M00056908B:D02 


UC2-PrimColon 


9 1 ^£ 


7951640 


1861 ri7o^43 236192 


F 


M00056893C:A02 


UC2-PrimColon 


9 1 


470579 


1861 T*18 a743 236907 


F 


M00056892B:C09 


UC2-PrimColon 


9 1 AH 

zlou 




1861F?18a743 936910 


X 


M00056896AC01 


UC2-PrimColon 


ZlOl 


7^495 


1861 K18 a743 236216 

lOOl .xV 1 O .gZ/T-J Z*J>\J£* x V 


F 


M00056903A:F03 


UC2-PrimColon 


9 1 £9 
ZlOZ 




1861 P10ot43 936237 

10Ul.ri7.giTJ ^JUZ»J / 


F 


M00056911B:A01 ' 


UC2-PrimColon 


9 1 

ZIOO 


79Q060 


1861 090^743 236244 


F 


M00056898D:H09 


UC2-PrimColon 


91 A4 


A^A667 


1861 K91 cr743 236261 


F 


M00056900B:D01 


UC2-PrimColon 


9 1 


739676 


1861 099^743 236276 


F 


M00056899A:A11 


UC2-PrimColon 


91 

Z lOO 


730906 


1X61 H24 P743 236309 


F 


M00056900B:F07 


UC2-PrimColon 


91 fJl 
ZIO / 


4^3001 


1 1861 K94 ^743 236312 


F 


M00056903B:D09 


UC2-PrimColon 


91 Aft 
ZlOo 


79S4S1 


1861N94ov43 236315 


F 


M00056908D:A11 


UC2-PrimColon 


91 AO 

zioy 


730851 
/jUoj 1 


1863 P01 oz43 236704 


F 


M00056936C:E04 


UC2-PrimColon 


917H 
Zl /U 


797506 


1863 H01 P743 236709 

X OUJ .XXV/ X .f£yC*r~J ^.~f\J i V/-X 


F 


M00056941D:D07 


UC2-PrimColon 


9191 
Zl / 1 


736903 


1863 T 01 1T743 236713 


F 


M00056946A:F07 


UC2-PrimColon 


9179 
Z 1 /Z 


733010 


1863 NO 1 ez43 236715 


F 


M00056948C:F03 


UC2-PrimColon 


9 1 73 


77737 


1 863 C02 ffz43 236720 


F 


M00056936C:F11 


UC2-PrimColon 


9174 
Z 1 /^t 


638083 

DJO70J 


1 863 T 09 r*743 236729 


F 


M00056946A:G06 


UC2-PrimColon 


917^ 
Zl / J 


736001 


1863 M09 «T743 236730 


F 


M00056947C:B04 


UC2-PrimColon 


917A 
Zl /O 


731 106 
f D 1 If O 


1 863 P09 tT743 236733 


F 

X 


M00056950DF12 


UC2-PrimColon 


9 1 99 
Zl / / 




1 8« A03 ot43 936734 

1 OUJ .1WJD .gZ*t-> f JU r Jt 


F 

X 


M00056934A-E07 


UC2-PiimColon 


2178 


734151 


1863.E03.gz43_236738 


F 


M00056939A:C01 


UC2-PrimColon 


2179 


| 456183 


1863 .H03 .gz43_23 674 1 


F 


M00056941D:G05 


UC2-PrimColon 


2180 


734629 


1863.103.gz43_236742 


F 


M00056943A:B04 


UC2-PrimColon 


2181 


733623 


1863J03.gz43 236743 


F 


M00056944A:D09 


UC2-PrimColon 


2182 


733856 


1 863 .N03 .gz43_236747 


F 


M00056948D:A11 


UC2-PrimColon 


2183 


727480 


1863.P03.gz43 236749 


F 


M00056950D:H09 


UC2-PrimColon 


2184 


448233 


! 1863.J04.gz43 236759 


F 


1 M00056944B:C02 


UC2-PrimColon 
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Table 2 



SEQ 
ID NO 


CLUSTER 


SEQ NAME 


ORIEN 
T 


CLONE ID 


LIBRARY 


2185 


729691 


1863 105 ez43 236774 


F 


1V1000S604^ A-F01 


UV/-6"X IIlIlV^VJlVjXl 


2186 


500239 


1863N05ez43 236779 


X 


M000S 6040 A • A 04 


U v^Z-x^nxIXv^OIOll 


2187 


646314 


1863 K06 ffz43 236792 


F 

X 




T 1C* 9 -P ri m P nl r\r> 

o v^z. x iirri^oioii 


2188 


492893 


1863 A08 ffz43 236814 


F 

X 


M000S60^4P-P04 


T Tr , 9-PrimPn1rtri 
Vj x LUlJX^KjLVil 


2189 


451624 


1863 D08 ffz43 236817 


X 


TV/f 000 S 60^ R A -MOO 


vj v^/Z.~x x niiv^uivjii 


2190 


552416 


1863 L08 ffz43 236825 


F 

X 


M000S6Q46r-J*08 


KJ \sZ*-X X XlllvwiJivJXl 


2191 


555478 


1863A10ez43 236846 


F 

X 


M000560^4P*fiO0 


T TP 9 -Prim Pnl r»n 

V-zl^Z. X^l XllIV^VjlVJXl 


2192 


616541 


1863 G10 sz43 236852 


F 


M000 S6Q4 1 R-R07 

XV1VIV/VI JU-'*t 1 JJ.X/V/4 


VJ V_yZ» X X illl\_y VJlVJll 


2193 


410487 


1863 O10sz43 236860 


F 


M0005 695 OA H03 


VJ V_s^ls JT X JXlLVv VJ1VJ11 


2194 


726825 


1863.B11 ez43 236863 


F 


M0005 6935CG08 


I JC2-PrimCo1oii 

X X XLllV^V/XV/XX 


2195 


733144 


1863 D12 ez43 236881 


F | 


M00056938BC09 


I TP7 -Prim Colon 

VJ v//< X 1 1111V>VJ1VJXL 


2196 


642528 


1863 E12 £z43 236882 


F 


M000S 6Q^9RF0S 


1 TP 9 -Prim Pol r»n 

VJ VxZ. X X Xlll VjlVjll 


2197 


735994 


1863 J12 ez43 236887 


F 


M000^ 6944C -R03 

1 tX V/ V/ V/*7 V/ ^*T*T • XJ V7 J 


1 TP9 -Prim Pol on 

VJ V^-^ X X XLXiV^VJlVJXX 


2198 


724401 


1863 K12 ffz43 236888 


F 


M000S694SC-A1 1 

lVXVyV/V/»7VJ^*T^/\_/.X\.X X 


1 TP9-PrimPo1on 

VJ V^^ X X XLllV^VJlV/lX 


2199 


736129 


1863 M12 2z43 236890 


F 

X 


M000^ 6948 A -D09 


T TP 9 -Prim P ol on 

UVvi> X 1XXX1V>VJXVJXX 


2200 


448046 


1863 P13 ffz43 236909 


F 

X 


M 000S6OS 1 RF00 


T TP9 -Prim Pol on 
vj r l JJ.llv-'VJivJxi 


2201 


731216 


1863 DH **743 936990 


F 

X 


M000S60^8RTT08 

±VX\J\J\J*J\JZsJ OD .XXV/ o 


T TP 9 -Pri m Pol on 

VJ V_^Z. XT I JXllV^iJlVjJJL 


2202 


449061 


1863 MIS ff743 236938 


F 

X 


KT000S6048RR03 


T TP 9 -Prim Pol on 


2203 


730815 


1863F17&743 236963 


F 

X 


1^/1000^6040(^^0^ 


T TP 9 -Prim Pol on 

VJ X X Xlll \s yjL XJll 


2204 

\J*T 


732756 

/ 1 «J VJ 


1863 HI 7 o-74^? 93606S 


F 

X 


M000^604?r*Fl 1 

1VXV/VJWJCJ5'*tZ.V^.X^ X 1 


T TP9 -Prim Pol on 

VJv^Zf JT 1 1X11 V^VJXVJXI 


2205 


468015 


1863 117 tr74^ 236Q66 

luw.ll f .ggtJ ^£>.JVJ7V)V? 


F 

X 


M000S604^P-A01 


T TP9 -PrimPol on 
VJ V/Zi x 1 llllv^UlvJxl 


2206 


733081 

/ JJV/O X 


1863 T17 <*^43 936067 


F 


M000^6044r>- A 06 


T TP9 -Prim Pol on 
Uui x x illlv^UlUIl 


2207 


555484 


1863 T,17 *»743 736969 


F 

X 


M000 S 6046D frOQ 

1VX V7 V/ V/ J VJ Z7*+\JXJ . VJ v/^ 


T TP9 -Prim Pol on 
VJ V-'Z, x x xlliv^vllvJxl 


2208 


731355 

/ -J X »J«J«J 


1863 A18a743 736Q74 


F 


M000S60^SA-r09 

1VXVJV/VJ.J VJj7 J V^VJZ, 


T Tj^O -Prim Pol on 

VJ V^Zr'X lUIlv^UlvJll 


2209 


730375 


1 863 G1 8 <* 7 43 236980 


F 

X 


M000S6041 TVfrOK 

lVXV/VyV/«7VJ3'*T ID .VJv/O 


T TP9 -PrimPol on 

VJ V^Z. X X XlllV^VJlVJXl 


2210 

XV/ 


467803 


1863M18«>-743 9^6086 

X OU J .1V1 1 0 .K£it J ^JVJ^OVJ 


F 


M000 S 6048R • frO S 

1VXVJ VJ VJ »J VJ OX3 . \J\J J 


T TP9-PrimPolon 

VJ V^^ X IMllV^VjlVJll 




73*5306 


1863F)20a74^ 9^7000 


F 


M000S60^8P-F1 0 

iVlVJVJVJ«JVJ^OOV^.Jr 11/ 


T TP9-PrimPo1on 
vj V^Z, x 1 UIlv^Ulvxll 


2212 


729316 


1863 120 ffz43 237014 


F 

X 


MOOOS6043r>*R07 

X¥1UvvJU7^J1/.X/UAi 


T TP9 -Prim Pol on 

Vj wZ» X 1 UllV^UlvJXX 


2213 


551441 


1863L20&743 237017 


F 

X 


M000S6047A-r05 

IVXVJVJU +)\JZ7'-T 1 rt..VvUJ 


T TP9-PrimPo1on 

VJ \^Z,"X X XiHV^ VJIVllX 


2214 


735484 


1863 A21 ez43 237022 


F 

X 


M0005603S A -F04 

XVXV/V/\/»7 VJ^«?^/^\,.X7 V/*T 


T TP9-PrimPolon 

Uv*» 1 X IIIIV^VJXVIXX 


2215 


171511 


1863 E21 m43 237026 


F 

X 


M000^69391>R07 

iyx\j\j\j*j \jzs ~jzs xj „xj\j 


T TP9 -Prim Pol on 

VJ V-/Z< X 1 XlilV^VJlVJll 


2216 


728076 


1863H21 ^743 237079 


F 

X 


M000^6049r>r>07 


T TP9-PrimPolon 

VJ V^Z. X X XI I lwVJXVJIl 


2217 


511351 


1863 H22 ez43 237045 


F 

X 


M000^6Q42DD1 1 

ItIWUJ VJ^*TX»X-^ -XV X X 


T TP 9 -Pri m Pol on 

VJ X X X111V^\JXV/XX 


2218 


731125 


1863 122 £z43 237046 

lUl/J.I^.giTJ # V/~V7 


F 

X 


M00056943DH08 

XtX \J \J \JJ VJ ^*TJ XJ .X XV/ o 


T TP9-PrimPolon 

VJ V/^> XT X XlllV^VJXVJXX 


2219 


524706 


! 1863 B24 ffz43 237071 


F 


M00056936C-C06 


T TP 9 -Prim Pol on 

VJ v^Z. X XXIXIV^VJIVJXX 


2220 


485880 


1863.C24.gz43_237072 


F 


M00056937C:H08 


UC2-PrimColon 


2221 


736197 


1864.103.gz43_237126 


F 


M00056961C:C07 


UC2-PrimColon 


2222 


731089 


1864.M04.gz43_237146 


F 


M00056968C:C06 


UC2-PrimColon 


2223 


642781 


1864.G12.gz43_237268 


F 


M00056959C:B10 


UC2-PrimColon 


2224 


731947 


1864.M14.gz43_237306 


F 


M00056969A:B07 


UC2-PrimColon 


2225 


448712 


1864.M16.gz43_237338 


F 


M00056969A:C07 


UC2-PrimColon 


2226 


736289 


1864.117.gz43_237350 


F 


M00056962D:F09 


UC2-PrimColon 
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Table 2 



SEQ 
ID NO 


CLUSTER 


SEQ NAME 


ORIEN 

T : 


CLONE ID 


LIBRARY 




735123 


1873.E02.gz43 237490 


F 


M00056978D:B07 


UC2-PrimColon 


9998 


724411 


1873H02ez43 237493 


F 


M00056982D:B12 


UC2-PrimColon 


7990 


662617 ! 


1873 P02 cz43 237501 


F 


M00056993D:C05 


UC2-PrimColon 


9930 


639934 


1873 M03 ez43 237514 


F I 


M00056989C:H09 


UC2-PrimColon 


9931 


733563 


1873 C04 ez43 237520 


F | 


M00056977A:C02 


UC2-PrimColon 


9939 


903793 


1873 K04 ez43 237528 


F 


M00056987C:D08 


UC2-PrimColon 


9933 


703978 


1873 L04 sz43 237529 


F 


M00056988C.D05 


UC2-PrimColoii 


9934 


7321 14 

/ X X ~T 


1873 P05 ez43 237549 


F 


M00056993D:F05 


UC2-PrimColon 


9935 


732213 


1873 J06 ez43 237559 


F 


M00056986A:E09 


UC2-PrimColon 


9936 


725895 


1873 N06 cz43 237563 


F 


M00056990D:C11 


UC2-PrimColon 


9937 


734936 i 


1873. DOS. gz43 237585 


F 


M00056978A:A03 


UC2-PrimColon 


9938 


731607 


1873.E08.gz43 237586 


F 


M00056979B:D03 


UC2-PrimColon 


2239 


731302 


1873.K10.gz43 237624 


F 


M00056987D:A09 


UC2-PrimColon 


9940 


739 113 


1873 L10 az43 237625 


F 


M00056988D:F05 | 


UC2-PrimColon 


9941 


728013 


1873.M10.gz43 237626 


F 


M00056989D:F07 


UC2-PrimColon 


9949 


730089 

/ «J W W U -X 


1873 Dll ez43 237633 


F | 


M00056978A:H04 


UC2-PrimColon 


9943 


648034 


1873 E12 ez43 237650 


F 1 


M00056979C:D11 


UC2-PrimColon 


9944 


729623 


1873.I12.gz43 237654 


F 


M00056985A:D06 


UC2-PrimColon 


9945 


731317 


1873 M12 az43 237658 


F 


M00056989D:H11 


UC2-PrimColon 


9946 


798249 


1873 A13 ez43 237662 


F 


M00056975A:H11 


UC2-PrimColon 


9947 


613029 


1873 C13 ez43 237664 


F 


M00056977B:B06 


UC2-PrimColon 


9948 


73 1 542 


1873 013 £z43 237676 


F 


M00056993A:B08 


UC2-PrimColon 


9940 


732535 


1873.G14.gz43 237684 


F 


M00056981D:H02 


UC2-PrimColon 


9950 


735440 


1873 A16 ez43 237710 


F 


j M00056975B:E04 


UC2-PrimColon 


9951 


641658 


1873 D16 ez43 237713 


F 


M00056978B:G02 


UC2-PrimColon 


9959 


40693 1 


1873 N18 ez43 237755 


F 


M00056991C:H11 


UC2-PrimColon 


9953 


736401 


1873 P18ez43 231151 


F 


M00056994B:H05 


UC2-PrimColon 


9954 


734561 


1873 B20 ez43 2X1115 


F 


S M00056976C.F05 


UC2-PrimColon 


9955 


7971 51 

/ X«> X 


1873 F20 ffz43 237779 


F 


M00056980D:E07 


UC2-PrimColoii 


9956 


731453 


1873 120 ez43 237782 


F 


M00056985C:C06 


UC2-PrimColon 


9957 


649649 


1873 K20 ez43 237784 


F 


M00056988A:F06 


UC2-PritnColon 


9958 


649988 


1873 M20 ez43 237786 


F 


M00056990B:H07 


UC2-PrimColon 


995Q 


579496 


1873 P20 ez43 237789 


F 


M00056994C:B04 


UC2-PrimColon 


9960 


731358 


1873 K22ez43 237816 


F 


M00056988B:A06 


UC2-PrimColoii 


9961 


471883 


1 873 N22 ez43 237819 


F 


M00056992A:E01 


UC2-PrimColon 


2262 


733778 


1873.D24.gz43 237841 


F 


M00056978D:A01 


UC2-PrimColon 


2263 


731592 


1873J24.gz43_237847 


F 


M00056987A:C02 


UC2-PrimColon 


2264 


456236 


1874.D01.gz43 237857 


F 


M00056998C:B10 


UC2-PrimColon 


2265 


732792 


1874.N01.gz43 237867 


F 


M00057009C:B02 


UC2-PrimColon 


2266 


733932 


1874.E03.gz43_237890 


F 


M00057000A:A05 


UC2-PrimColon 


2267 


724810 


1874.P07.gz43 237965 


F 


M00057011C:H03 


UC2-PrimColon 


2268 


731317 


1874.B08.gz43 237967 


F 


M00056996D:A02 


UC2-PrimColon 
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SEQ 
ID NO 


CLUSTER 


SEQ NAME 


ORIEN 
T 


CLONE ID 


LIBRARY 


2269 


726408 


1874.D08.gz43 237969 


F 


M00056998DH08 


VJVyZ, X 1X111V_/I_/1VJXL 


2270 


734708 


1874.D12.gz43 238033 


F 


M00056999AG12 


UC2-Prim Col on 

V/ VyX» X X lUlVyVUUll 


2271 


631251 


1874.A15.gz43 238078 


F 


M00056995CH06 


TJC2-Prim Pol on 

*J\yii X 1 XlllVyVyllyXX 


2272 


726696 


1874D16gz43 238097 


F 


M00056999BD07 


1 JC2-PriTnr , o1 on 


2273 


448202 


1874F16ez43 238099 


F 


M0005700 1DF02 


T TP 7 -Prim Pol on 


2274 


734629 


1874H16ez43 238101 


F 


M00057003DF02 


T JP2-PrimPo1on 

X 1 XlllVyV/lVPll 


2275 


561993 


1874K16sz43 238104 


F 


M00057007BG02 


UP2-PrimPo1on 

Vy V/^i X X XlllVy v/1 UJX 


2276 


407723 


1874J17gz43 238119 


F 


M00057006AG10 


UC2-PrimPo1on 

y~J V/^i X X 111 IVy VJUJXX 


2277 


736413 


1874.K17.gz43 238120 


F 


M00057007CA06 


UC2-PrimPolon 

v/VX> X 1 XlllVy V/XVfXX 


2278 


556637 


1874P17sz43 238125 


F 


M000570 12AD 12 


UC2-Prim Pol on 

X 1 XlllVy vyXVUX 


2279 


731686 


1874A19gz43 238142 


F 


M00056995DC1 1 


TJC2-PrimPo1on 

VVA X X XlllVy VJXVJXX 


2280 


559053 


1874B20cz43 238159 


F 


M00056997BC11 


UC2-PrimPolon 

V/\-/^» X X Xlll Vy VJ1\JX± 


2281 


733625 


1875 J01 ez43 238247 


F 


M00057024BF07 


T JP2-P rimPolon 

V V/Af X X XXllVyV/XVyXX 


2282 


461486 


1875 O01 ez43 238252 


F 


M000 5703 0C B03 


TJC2 -Prim Pol on 

UV'Ai X X XlllVv U1U11 


2283 


556019 


1875 B02 ez43 238255 


F 


M0005 70 14B ■ A02 


T JP2-PrimPo1on 

VJ Vy ^ X X XXXI VvvJXV/Xl 


2284 


732119 


1875 F02 sz43 238259 


F 


M00057018C-F02 


T TP9-PrimPo1on 

V/ Vy^. XT X XXXI Vy V/XV/XX 


2285 


411128 


1875 K02 ez43 238264 


F 


M00057025CA08 


I JP2-PrimPolon 

VJ Vy^ X X XXIX Vy Ul V/XX 


2286 


732441 


1875A03 sz43 238270 


F 


M00057012DG03 


UP2-PrimPolon 

V/ VZr X X XXXI Vy U1UU 


2287 


478448 


1875B03ez43 238271 


F 


M00057014BB01 


UC2-PrimPolon 

Vy \y-£. X 1 XXXI Vy VrXV/XX 


2288 


733723 


1875 D03 ez43 238273 

X VJ / «y <Xr V/«y ■&CrT «y X^.y f — ' 


F 


M000570 16B- A09 


UC2-PrimPolon 

Cv/Z. X X XlllVy VAXVMX 


2289 


726173 


1875 103 2z43 238278 


F 


M00057023AH09 


T JP2-PrimPo1on 

U*yA. X X XlllVy VXV/XX 


2290 


451351 


1875 B04 ez43 238287 


F 


M00057014BB06 


UC2-PrimColon 

v/V/*< X XXlllVyVXVXl 


2291 


570573 


1875E04ez43 238290 


F 


M000570 1 7AF 1 1 


UC2-PrimColon 

VX V/^i X X 1X11 Vy U1V/I1 


2292 


655312 


1875K04ez43 238296 


F 


M00057025CD1 1 


T JP2-PrimPo1 on 

U\y^ X X XXXI Vy V/XV/IX 


2293 


732047 


1875 C05 ez43 238304 


F 


M0005 70 1 5 AE02 


T JP2-PrimPo1on 

Vy Vy^> X XXllXVyV/XV/XX 


2294 


734091 


1875 F05 ez43 238307 


F 


M00057018DB10 

xtiuv/v*' / vy x iii/ . X-/ x v/ 


T JP7-PrimPo1on 

Uv/Ai X X XlllVy V/XVUX 


2295 


538582 


1875 E06 ez43 238322 


F 


M000570 17AG04 

iTiuv/u-y / \/ x t /x»\j v/~ 


I IP2-Prim Colon 

VJ VyXf X X XXIX Vy VUV/XX 


2296 


487183 


1875 C07 sz43 238336 


F 


M00057015AG06 

XVXVrVrV/*/ / V X •/XktU V/Vr 


T JC2-PrirnCo1 on 

Vy \y^> X X XXllVyWX vll 


2297 


483919 


1875 G07 ez43 238340 


F 


M00057020DA05 

XVXV/V/v/*/ / vLUl/./W/J 


TJC2-PrimColon 

VyVy^tf X X XXllVy 


2298 


651049 


1875 L07 ez43 238345 


F 


M 00057027B-B 1 1 

IVXV/V/V/— / / » XJ .XJ X X 


UC2-PrimColon 

Vy \y^ X X XlllVy VS1VSXX 


2299 


734392 


1875D08ez43 238353 


F 


M00057016B*E06 


UC2-PrimColon 

\y \y^ X X JXI.l.\_/V/XVyXX 


2300 


557896 

«y«y / u .y v/ 


1875 F08 ez43 238355 


F 


M000570 1 8DE05 

xVXV/v/V/«y /VI Ul/.XyV/u' 


T JC2-PrimColon 

Vy Vy^ X X XlllVy VyXV/XX 


2301 


89082 


1 875 1,08 oz43 238^61 




M00057027B-F04 


T IC2-PrimCo1on 

U\y^> X X XlllVy V/XVUX 


2302 


512863 


1875 P08 ffz43 238365 


F 


M00057032AC01 

lVXV/\/V/«/ / vJ^rZX.VyVl 


T TC?-PrimColon 

Vy Vy^< X XXllXVyUXV/XX 


2303 


474009 


i 1 875 D09 ffz43 238369 


F 


M00057016BH08 

IVXV/V/V/^ / v/ A yJXJ ,X XV/ o 


T TC7-PrimCo1on 

Uv^ 1 X XlllVy V7XV/XX 


2304 


729731 


1875.H09.gz43_238373 


F 


M00057022B:A04 


UC2-PrimColon 


2305 


481581 


1875.A10.gz43_238382 


F 


M00057013B:H07 


UC2-PrimColon 


2306 


732550 


1875.C10.gz43_238384 


F 


M00057015A:H12 


UC2-PrimColon 


2307 


7353,96 


1875J10.gz43_238391 


F 


M00057024D:D12 


UC2-PrimColon 


2308 


642693 


1875.P10.gz43_238397 


F 


M00057032A:F12 


UC2-PrimColon 


2309 


733970 


1875.Dll.gz43_238401 


F 


M00057016C:B05 


UC2-PrimColon 


2310 


639629 


1875.HlLgz43_238405 


F 


M00057022B:F03 


UC2-PrimColon 
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Table 2 



SEQ 
ED NO 


CLUSTER 


SEQ NAME 


ORIEN 
T 


CLONE ID 


LIBRARY 


2311 


731577 


1 875 .LI 1 .gz43_238409 


p 


M00057027B:F06 


UC2-PrimColon 


2312 


733354 


1875.B12.gz43_238415 


F 


M00057014C:E01 


UC2-PrimColon 


2313 


626791 


1875.F12.gz43_238419 


F 


M00057019A:G04 


UC2-PrimColon 


2314 


559699 


1875.H12.gz43_238421 


F 


M00057022B:H04 


UC2-PrimColon 


2315 


726208 


1 8 75 .A 1 3 .gz43_23 843 0 


F 


M00057013C:E09 


UC2-PrimColon 


2316 


730828 


1 875 J3 1 3 .gz43_23 843 1 


F 


M00057014C:E03 


UC2-PrimColon 


2317 


390968 


1875.113.gz43_238438 


F 


M00057023D:D05 


UC2-PrimColon 


2318 


471522 


1875.013 .gz43_23 8444 


F 


M00057031A:H02 


UC2-PrimColon 


2319 


731838 


1 875 .D14.gz43_23 8449 


F 


M00057016C:E04 


UC2-PrimColon 


2320 


726476 


1 875 .HI 4.gz43_23 845 3 


F 


M00057022C:D12 • 


UC2-PrimColon 


2321 


466920 


1875.114.gz43_238454 


F 


M00057023D:D08 


UC2-PrimColon 


2322 


732712 


1875.N14.gz43_238459 


F 


M00057029D:A06 


UC2-PrimColon 


2323 


561338 


1875.O14.gz43_238460 


F 


M00057031B:A01 


UC2-PrimColon 


2324 


733756 


1 875 . Al 5 .gz43_23 8462 


F 


M00057013C:H01 


UC2-PrimColon 


2325 


484023 


1875.C15.gz43 238464 


F 


M00057015B:E01 


UC2-PrimColon 


2326 


557262 


1875.D15.gz43 238465 


F 


M00057016C:E11 


UC2-PrimColon 


2327 


559096 


1875.E15.gz43 238466 


F 


M00057017C:E03 


UC2-PrimColon 


2328 


455148 


1875.F15.gz43 238467 


F 


M00057019C:C08 


UC2-PrimColon 


2329 


616985 


1875.I15.gz43 238470 


F 


M00057023D:E11 


UC2-PrimColon 


2330 


733479 


1875.N15.gz43 238475 


F 


M00057029D.F01 


UC2-PrimColon 


2331 


558134 


1875.F16.gz43 238483 


F 


M00057019C:E03 


UC2-PrimColon 


2332 


703298 


1875.D17.gz43 238497 


F 


M00057016D:B07 


UC2-PrimColon 


2333 


477797 


1875.G18.gz43 238516 


F 


M00057021B:B07 


UC2-PrimColon 


2334 


432159 


1875.L18.gz43 238521 


F 


M00057027D:A12 


UC2-PrimColon 


2335 


736385 


1875.P18.gz43 238525 


F 


M00057032D:A04 


UC2-PrimColon 


2336 


619265 


1 875 .B 1 9 .gz43_23 8527 


F 


M00057014D:E05 


UC2-PrimColon 


2337 


4255 


1875.F19.gz43_238531 


F 


M00057019D:C02 


UC2-PrimColon 


2338 


554080 


1875.M19.gz43_238538 


F 


M00057028D.D09 


UC2-PrimColon 


2339 


734074 


1 875 .C2 1 .gz43_23 8560 


F 


M00057016A:B04 


UC2-PrimColon 


2340 


734177 


1 875 .E2 1 .gz43_23 8562 


F 


M00057018A:C05 


UC2-PrimColon 


2341 


727761 


1875.I21.gz43 238566 


F 


M00057024A:D08 


UC2-PrimColon 


2342 


733209 


1875.L21.gz43 238569 


F 


M00057027D:D07 


UC2-PrimColou 


2343 


512377 


1875.A22.gz43 238574 


F 


M00057014A:B11 


UC2-PrimColon 


2344 


733664 


1875B22gz43 238575 


F 


M00057014D:G10 


UC2-PrimColon 


2345 


640904 


1875 D22 ez43 238577 


F 


M00057016D:H03 


UC2-PrimColon 


2346 


425203 


1875.N22.gz43_238587 


F 


M00057030B:B03 


UC2-PrimColon 


2347 


557164 


1875.B23.gz43_238591 


F 


M00057014D:H01 


UC2-PrimColon 


2348 


494450 


1875.C23.gz43_238592 


F 


M00057016A:G01 


UC2-PrimColon 


2349 


639427 


1875.F23.gz43_238595 


F 


M00057020A:F09 


UC2-PrimColon 


2350 


554693 


1875.H23.gz43_238597 


F 


M00057022D:E10 


UC2-PrimCoIon 


2351 


732598 


1875.K23.gz43_238600 


F 


M00057026D:A05 


UC2-PrimColon 


2352 


495241 


1875,M23.gz43_238602 


F 


M00057029A:C08 


UC2-PrimColon 
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SEQ 
ID NO 


CLUSTER 


SEQ NAME 


ORIEN 
T 


CLONE ID 


LIBRARY 


2353 


735989 


1 875 .N23 .gz43_23 8603 


F 


M00057030B-F01 


T TP 9 -Prim Pnl nn 


2354 


726044 


1 875 .I24.gz43_23 86 14 


F 


M00057024BA11 


T TP9 -Prim Pnlnn 


2355 


728791 


1875.L24.gz43 238617 


F 


M00057027DG03 


T TP9-PrimPn1ori 


2356 


562769 


1875.N24.gz43 238619 


F 


M00057030C-A05 


T TP7-PrimPn1nn 


2357 


479208 


1 875 .P24.gz43_23862 1 


F 


M00057033AD08 


T TP?-PrimPr»lnn 


2358 


4244 


1876.C03.gz43 238656 


F 


M00057035PE10 


T TP^-PrimPoTnn 


2359 


732159 


1876.K03.gz43_238664 


F 


M00057046AF02 


UC2-Priin Colon 


2360 


473026 


1876.E04.gz43_238674 


F 


MOO 05 703 8 A:H07 


T JC2-PH m Col on 

W V>x« XT 1 iXlxV^UilJiX 


2361 


736385 


1 876 .B05 .gz43_23 8687 


F 


M00057034CG12 


T JCl-PrimColon 

VJVv^« X J.Xl&XV^V/XvlXl 


2362 


734466 


1 876. J05 .gz43_238695 


F 


M00057045AC04 


UC2-PrimColon 

VJ V_^x* X 1 JUU V^VUVUX 


2363 


668731 


1876.K05.gz43_238696 


F 


M00057046AG05 


UC2-PrimColon 


2364 


736881 


1876.H06.gz43_238709 


F 


M00057042DE06 


UC2-PrimColon 


2365 


466920 


1876.O06.gz43 238716 


F 


M00057051BE09 


T Jr , 2-PrimColon 

\J wx« X lllllv/VJlvll 


2366 


728353 


1876.E07.gz43 238722 


F 


M00057038C:B06 


UC2-PrimColon 

VJ\wx« X 1 XXXI V/U1VJ11 


2367 


732246 


1876.N07.gz43_23873 1 


F 


M0005 7049D-F 12 


I Jr*2-Primr i olon 

\-/ V^x* 1 lllll\»*VJlUil 


2368 


468296 


1876.A08.gz43 238734 


F 


M00057033CB03 


I Jr*2-PriTnCo1on 


2369 


454819 


1876.E09.gz43 238754 


F 


M00057038C-G08 


T TP9-PrimPolr»n 


2370 


481293 


1876.M10.gz43 238778 


F 


M00057048C-E04 


1 IP9 -Prim Pol on 

W V-yX# XT IxlUV^UXVJXX 


2371 


735815 


1876.Ell.gz43 238786 


F 


M0005 703 8D • A 1 2 


TJP2-Prin.Polon 


2372 


732315 


1876H12sz43 238805 


F 


M00057043A-G07 


T TP9-PrimPr»lon 


2373 


558573 


1876M12ez43 238810 


F 




T TP9 -Prim Pol on 


2374 


492627 


1876 G13 sz43 238820 


F 


M00057041D-r08 


T TP9 -P ri m Pol on 


2375 


732042 


1876 K14 sz43 238840 


F 


M000S7046C-F05 


T TP9 -Prim Pol on 


2376 


737087 j 


1876 N14 ez43 238843 


F 


M000570S0BF06 


T TP9 -Pri m Pol on 


2377 


456469 


1876 A16 sz43 238862 


F 


M000S70^^DF1 0 

IVlWV/v/J / V/J JJJ .X lv 


T TP 9 -Prim Pol on 


2378 


735871 


1876 116 ez43 238870 


F 

X 


1^1000^7044^-1^0^ 

iVXV/vw / U*t*tV/.iJv/J 


T TP 9 -Prim Pol on 


2379 


736855 


1876 F18 ez43 238899 


F 

X 


M00057040HH04 


T TP9 -Prim Pol on 


2380 


732562 ! 


1876M20ez43 238938 


F 


M00057048DH10 


T TP9 -Prim Pol on 


2381 


727321 


1876G22ez43 238964 


F 


M00057042BA10 


T TP9 -Prim Pol on 

VJ V^x- X 1 XillV^ VJ1U1X 


2382 


638857 


1876H22ez43 238965 


F 


M00057043CH1 1 


T TP 9 -Prim P ol on 

VJ V-/X- X 1 IXIXVv VF1V/X1 


2383 


735972 


1876 A23 ez43 238974 


F 


M00057034BB01 


T TP9 -Prim Polon 

KJ V_/x* X 1 xillVvUlLUx 


2384 


735054 


1876 L23 ez43 238985 


F 


M00057047DH04 


T TP9-PrimPolon 

VJ V_vX- X 1 XXIXV*' V/XVUX 


2385 


554854 


1885 M01 ez43 239018 


F 


M00057068DF04 


T TP9-PrimPolon 

KJ V^xV X X illiV^VJlVJH 


2386 


731262 


1885 O01 ez43 239020 


F 


M0005 707 1 A • A 1 0 


T TP9 -Prim Polon 

KJ V/x^ X 1JX11Vv\JXVU1 


2387 


729981 


1885 C02 ez43 239024 


F 


M0005 70^ ^ r ■ F0 1 


T TP9 -Prim Pol on 
Uv/xj n xxixvvtJivui 


2388 


735292 


1885.F02.gz43_239027 


F 


M00057059D:H09 


UC2-PrimColon 


2389 


726699 


1885.L02.gz43_239033 


F 


M00057067D:H06 


UC2-PrimColon 


2390 


624133 


1885.O02.gz43 239036 


F 


M00057071A:B04 


UC2-PrimColon 


2391 


645262 


1885.A03.gz43_239038 


F 


M00057053A:A02 


UC2-PrimColon 


2392 


558549 


1885.K03.gz43_239048 


F 


M00057066D:B03 


UC2-PrimColon 


2393 


736093 


1885.L03.gz43 239049 


F 


M00057068A:C10 


UC2-PrimColon 


2394 


733573 


1885.G04.gz43j239060 


F 


M00057061B:H02 


UC2-PrimColon 
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SEQ 
ID NO 


CLUSTER 


SEQ NAME 


ORBEN 
T 


CLONE ID 


LIBRARY 


2395 


481366 


1885.104.gz43_239062 


F 


M00057064B:H10 


UC2-PrimColon 


2396 


556385 


1 8 85 .N04 .gz43_23 9067 


F 


M00057070A:B07 


UC2-PrimColon 


2397 


559656 


1 885 .L05 .gz43_23908 1 


F I 


M00057068A:F05 


UC2-PrimColon 


2398 


446616 


1885 . A06.gz43_23 9086 


F 


M00057053A:D11 


UC2-PrimColon 


2399 


728445 


1885.C06.gz43_239088 


F 


M00057055C:H07 


UC2-PrimColon 


2400 


732223 


1885.D06.gz43_239089 


F 


M00057056D:F11 


UC2-PrimColon 


2401 


595506 


1885.G06.gz43_239092 


F 


M00057061C:D04 


UC2-PrimColon 


2402 


627515 


1885.K06.gz43_239096 


F 


M00057066D:E01 


UC2-PrimColon 


2403 


730528 


1885.L06.gz43_239097 


F 


M00057068A:F07 


UC2-PrimColon 


2404 


451233 


1885.E07.gz43_239106 


F 


M00057058D:G08 


UC2-PrimColon 


2405 


728884 


1885 .H07 .gz43_23 9109 


F 


M00057063A:C08 


UC2-PrimColon 


2406 


541793 


1885.107.gz43_2391 10 


F 


M00057064C:F11 


UC2-PrimColon 


2407 


736415 


1885.L08.gz43 239129 


F 


M00057068A:G05 


UC2-PrimColon 


2408 


552641 


1885.F09.gz43_239139 


F 


M00057060B:D07 


UC2-PrimColon 


2409 


736156 


1885.G09.gz43_239140 


F 


M00057061D:D03 


UC2-PrimColon 


2410 


560581 


1 8 85 .M09.gz43_239 1 46 


F | 


M00057069A:F09 


UC2-PrimColon 


2411 


654723 


1885.D10.gz43_239153 


F 


M00057057A:G01 


UC2-PrimColon 


2412 


731449 


1885.E10.gz43_239154 


F 


M00057059A:B04 


UC2-PrimColon 


2413 


452623 


1 885 .G10.gz43_23 9 1 56 


F 


M00057061D:F05 


UC2-PrimColon 


2414 


732072 


1885.M10.gz43_239162 


F 


M00057069A:H08 


UC2-PrimColon 


2415 


452224 


1885.P10.gz43_239165 


F 


M00057072C:A03 


UC2-PrimColon 


2416 


550637 


1885 .Kl 1 .gz43_239 1 76 


F 


M00057067B:C11 ! 


UC2-PrimColon 


2417 


| 453508 


1885.Mll.gz43 239178 


F 


M00057069B:A08 


UC2-PrimColon 


2418 


732872 


1885.A12.gz43 239182 


F 


M00057053C.B10 


UC2-PrimColon 


2419 


735283 


1885.P12.gz43_239197 


F 


M00057072C:A09 


UC2-PrimColon 


2420 


422590 


1 8 85 .D 1 3 .gz43_23 920 1 


F 


M00057057B:E06 


UC2-PrimColon 


2421 


422242 


1885J13.gz43__239207 


F 


M00057066A:A09 


UC2-PrimColon 


2422 


471982 


1 885 .D 1 4.gz43_2392 1 7 


F 


M00057057B:E07 


UC2-PrimColon 


2423 


736285 


1885.H14.gz43_239221 


F 


M00057063B:F06 


UC2-PrimColon 


2424 


734787 


1885.P14.gz43 239229 


F 


M00057072C:C02 


UC2-PrimColon 


2425 


733623 


1885.A15.gz43 239230 


F 


M00057053C:G04 


UC2-PrimColon 


2426 


736318 


1885.D15.gz43 239233 


F 


M00057057B:G03 


UC2-PrimColon 


2427 


677769 


1885 A16 ez43 239246 


F 


M00057053D:D01 


UC2-PrimColon 


2428 


726380 


1885.B16.gz43 239247 


F 


M00057055B:D07 


UC2-PrimColon 


2429 


134501 


1885 C16 ez43 239248 


F 


M00057056B:C06 


UC2-PrimColon 


2430 


731844 


1885.F16.gz43J239251 


F 


M00057060C:D05 


UC2-PrimColon 


2431 


730472 


1885.B17.gz43_239263 


F 


M00057055B:E10 


UC2-PrimColon 


2432 


455884 


1885.C17.gz43_239264 


F 


M00057056B:D05 


UC2-PrimColon 


2433 


669159 


1885.H17.gz43_239269 


F 


M00057063C:C04 


UC2-PrimColon 


2434 


736146 


1885.K17.gz43_239272 


F 


M00057067C:D04 


UC2-PrimColon 


2435 


729502 


1885.A18.gz43_239278 


F 


M00057054A:B12 


UC2-PrimColon 


2436 


731830 


1885.C18.gz43_239280 


F 


M00057056B:D11 


UC2-PrimColon 
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ID NO 
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SEQ NAME 


ORIEN 
T 


CLONE ID 


LIBRARY 


2437 


706245 


1885.N18.gz43 239291 


F 


M00057070C-H10 

iVJLvvv*/ 1 \J 1 uv^.l XXV/ 


T Tr"?-PrimfY>lnn 


2438 


631526 


1885.P18.gz43 239293 


F 


M00057072C-H01 


UC2-PrimColon 

WVil/ A XXIXl\_yt/i.WU. 


2439 


736113 


1885A19sz43 239294 


F 


M00057054AD0 1 

ITIVW*/ / V/ T\*X » X-^ V/ X 


T TP9-Prim Pnlnn 


2440 


567005 


1885 B19 sz43 239295 


F 


M00057055BF05 


T TP9 -PtimCnlriTi 


2441 


730128 


1885 Kl 9 ez43 239304 


F 


M00057067C-H09 


UC2-PrimCo1oii 


2442 


620462 


1885 E20 ez43 239314 


F 


M00057059CH02 


UC2-PrimColon 


2443 


450551 


1885 F20 gz43 239315 


F 


M00057060DC09 


IJC2-PrimColon 


2444 


724773 


1885 L20 ez43 239321 


F 


M00057068DB03 


IIC2-PritnCo1oTi 

V/V^^r X X XlllV^UJAJXI 


2445 


735028 


1885 N20 ez43 239323 


F 


M00057070DB08 




2446 


476947 


1885 E21 ez43 239330 


F 


M00057059DA09 




2447 


420958 


1885 121 ez43 239334 


F 


M00057065C-B07 


T TP9-pTimr , ri1nn 

VV/i X XXlll\_/WHJXl 


2448 


447758 


1885 021 ez43 239340 


F 


M00057071DE01 ' 

IVXVfVfV/^ 1 \J t XXV.XJ/v/X 


11(^9^14111^01011 

UV/i 1 1 XlliV^VJlWU 


2449 


447075 


1885 G22 ffz43 239348 


F 


M00057062BH04 

XVlUuUJ / V/VAt»XJ.XXV/*T 


T TP9 -Prim Pol nn 


2450 


455248 


1885 1\29 {*t43 9393^3 


X 


M000S7068Dr0Q 

IVXuvvJ /uv01/,V/\/7 


T TP? -PrimPoloTi 


2451 


712351 


1885 02? 0743 939356 


F 

X 


M0005707 1 DGO 1 

IVXV/V/V/^/ 1 \J 1 XXV. VJ V/ X 


T TP9-PrimPo1on 

\J V^Z, X 1 1111 V-/ \J1 \J11 


2452 


620159 


1885 P22 ez43 239357 


F 


M00057073 AB1 2 


T IP9-PrimPolon 

' Vx^£> X X XlllV^V/lWll 


2453 


472704 


1885 K23 ez43 239368 


F 

X 


M00057067T)F03 

lVXV/V/V/%/ 1 \J\J I XV. X v/J 


T TP9 -Prim Pol on 


2454 


649349 


1885 N23 ffr43 239371 


F 

X 


M00057070DG03 


T TP9-PrimPo1oTi 

\J \**t£* X7 X lxllV^VJxLUl 


2455 


735216 


1885 F24 ffz43 239379 


F 

X 


M00057061 AF09 


I TP? -Prim Pol on 


2456 


572273 


1885 T24 cr 7 41 219183 


F 

X 


M000 ^ 7066P -B09 


T TP9-PrimPolon 

VJ V/^> x 1 llxl w wxVlxl 


2457 


736210 


1885 L24 erz43 239385 


F 

X 


M00057068DE05 


UC2-PrimPolon 

KJ V-/^< X X Xillv^UJLUXl 


2458 


731358 


1886 M02 ez43 239418 


F 


M00057088BE02 


TJP2-PrimPo1on 


2459 


639629 


1886 D03 ez43 239425 


F 


M00057077BB06 


UC2-PrimPolon 


2460 


733910 


1886 L03 ffz43 239433 


F 

X 


M00057087B-A07 


UC2-PrtmColoTi 


2461 


473588 


1886 N04 £z43 239451 


F 


M00057090A:B02 


tJP2-PrimPn1on 

V_/X- X 1 XIX1\_/U1\_/X1 


2462 


727407 


1886 N05 ez43 239467 


F 


M00057090AC03 


UC2-PrimColon 


2463 


368965 


1886 H07 £z43 239493 


F 


M00057082CG03 


UC2-PrimColon 


2464 


734059 


1886 L13 ffz43 239593 


F 


M00057087DB04 


UC2-PrimColon 

»J V-'j- X X UIXV/VPXV/XK 


2465 


419465 


1886 P15 ez43 239629 


F 

X 


M00057092C A05 


TTP9-PrimPo1on 


2466 


732487 


1886 A18 ez43 239662 


F 

X 


M00057073D*H05 


I TP9 -PrimPolon 


2467 


538582 


1886 M20 ffz43 239706 


F 

X 


M00057089BD01 


T TP9-PrimPolon 


2468 


732131 


! 1 886 022 ffz43 239740 


F 

X 


M0005709 1 D-Fl 1 


T TP 9 -Prim Pol on 

V^^» X X Xlll V> VJ1VJX1 


2469 


556212 


1886 E23 £z43 239746 


F 

X 


M00057079AF05 


T TP9 -Prim Polon 


2470 


731966 


1 886 M94 az43 239770 


F 


M0nn57fl80r>-F03 


T TP9 -PrimPolon 

U V^Z, X 1 11I1V-/ vJl wli 


9471 




1 RR7 P09 CT743 93Q80Q 




Mnno57nQ*5R*fTOi 

IVxV/UV/J / V^Jx^.vJV/^ 


T TP9 -PrimPolon 

U v^Z, x IxlllV^L/Iwll 


2472 


732550 


1887.G02,gz43_239813 


F 


M00057099D:D11 


UC2-PrimColon 


2473 


731477 


1887.D04.gz43_239842 


F 


M00057096C:B08 


UC2-PrimColon 


2474 


485441 


1887.104.gz43__239847 


F 


M00057102A:F12 


UC2-PrimColon 


2475 


533689 


| 1887.B05.gz43J239856 


F 


M00057094B:D07 


UC2-PrimColon 


2476 


735515 


1887.H05.gz43_239862 


F 


M00057100D:E09 


UC2-PrimColon 


2477 


678846 


1887.M06.gz43_239883 


F 


M00057106A:H04 


UC2-PrimColon 


2478 


638908 


1887.P06.gz43__239886 


F 


M00057110CA04 


UC2-PrimColon 
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ID NO 


CLUSTER 


SEQ NAME 


ORIEN 
T 


CLONE ID 


LIBRARY 


2479 

Z*T 1 Zs 


516799 


1887.L07.gz43_239898 


F 


M00057105C:C06 


UC2-PrimColon 


2480 


553898 


1887A08.gz43J239903 


F 


M00057093C:A02 


UC2-PrimColon 


2481 


644354 


1 8 87.F09 .gz43_239924 


F 


M00057099A:C04 


UC2-PrimColon 


2482 


731697 


1887.K09.gz43_239929 


F 


M00057104D:A02 


UC2-PrimColon 


2483 


645139 


1 8 87 .F 1 0 .gz43_23 9940 


F 


M00057099A:H12 


UC2-PrimColon 


2484 


732885 


1887.O10.gz43_239949 


F ! 


M00057109A:F11 


UC2-PrimColon 


2485 


736634 


1887.A1 1 .gz43_23995 1 


F 


M00057093C:E01 


UC2-PrimColon 


2486 


457842 


1887.D1 l.gz43_239954 


F 


M00057096D:H05 


UC2-PrimColon 


2487 


654475 


1887.FlLgz43_239956 


F 


M00057099B:A06 


UC2-PrimColon 


2488 


543772 


1887.112.gz43_239975 


F 


M00057102B:E12 


UC2-PrimColon 


2489 


734344 


1887.L12.gz43_239978 


F 


M00057105C:G02 


UC2-PrimColon 


2490 


693869 


1887.N12.gz43_239980 


F 


M00057108B:A12 


UC2-PrimColon 


2491 


733*348 


1 887.0 12.gz43_23998 1 


F 


M00057109A:H09 


UC2-PrimColon 


7492 


733209 


1887L14gz43 240010 


F 


M00057105D:C10 


UC2-PrimColon 


9493 


401368 


1887 F 15 ez43 240020 


F 


M00057099B:D02 


UC2-PrimColon 


2494 




1887 117 ez43 240055 


F 


M00057102D:C11 


UC2-PrimColon 


2495 


736220 


1887.P17.gz43_240062 


F 


M00057110D:E12 


UC2-PrimColon 


7496 


735584 


1887.G18.gz43 240069 


F 


M00057100C:F02 


UC2-PrimColon 


9497 


448325 


1887 J19 ez43 240088 


F 


M00057104A:H06 


UC2-PrimColon 


949R 


739319 

/ JZJ az» 


1887 019 cz43 240093 


F 


M00057109C:D10 


UC2-PrimColon 


2499 


477387 


1887 C20 ez43 240097 


F 


M00057096B:C05 


UC2-PrimColon 


9500 


729222 


1887 D20 ez43 240098 


F 


M00057097B:E02 


UC2-PrimColon 


9501 




1887.F20.gz43 240100 


F 


M00057099C:A06 


UC2-PrimColon 


9509 


7351 13 


1887B21ez43 240112 


F 


M00057095A:F11 


UC2-PrimColon 


9503 


650920 


1887 121 az43 240119 


F 


M00057103A:F11 


UC2-PrimColon 


2504 


729560 


1887 024az43 240173 


! F 


M00057110A:A03 


UC2-PrimColon 


2505 


415825 


1 888 AO 1 .gz43_240 1 75 


F 


M00057112A:C12 


UC2-PrimColon 


2506 


733225 


1 888 .DO 1 .gz43_240 178 


F 


M00057115D:C10 


UC2-PrimColon 


2507 


732113 


1 8 8 8 .D 02 .gz43_240 1 94 


F 


M00057115D:D06 


UC2-PrimColon 


2508 


568632 


1 8 88 .L03 .gz43_2402 1 8 


F 


M00057126D:A04 


UC2-PrimColon 


2509 


736539 


1888J04gz43 240232 


F 


M00057124A:G08 


UC2-PrimColon 


2510 


602673 


1 888 .D05 ,gz43 240242 


F 


M00057115D:F06 


UC2-PrimColon 


951 1 


553264 


1888.F06.gz43 240260 


F 


M00057118B:E10 


UC2-PrimColon 


9519 


555655 


1888.G06.gz43 240261 


F 


| M00057120A:D01 


UC2-PrimColon 


9513 


736534 


1888K06ez43 240265 


F 


M00057125C:B11 


UC2-PrimColon 


2514 


451764 


1888.O06.gz43_240269 


F 


M00057129D:F03 


UC2-PrimColon 


2515 


732896 


1888.D07.gz43 240274 


F 


M00057116A:B06 


UC2-PrimColon 


2516 


733365 


1888.I07.gz43 240279 


F 


M00057122D:A01 


UC2-PrimColon 


2517 


736548 


1888. 008 .gz43_24030 1 


F 


M00057130A:A02 


UC2-PrimColon 


2518 


553338 


1888.H09.gz43 240310 


F 


M00057121B:H10 


UC2-PrimColon 


2519 


105056 


1888.K10.gz43 240329 


F 


M00057125C:E02 


UC2-PrimColon 


2520 


288535 


1888.O10.gz43 240333 


F 


M00057130A:A11 


UC2-PrimColon 
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Table 2 



SEQ 
ID NO 


CLUSTER 


SE0NAME 


ORIEN 
T 


CLONE ID 


LIBRARY 


9^9 1 


4jj / OD 


lft.R£ PM 0 cttA'* 9AfT3^7 
1000.\JlZ.gZ'fO_Z*fLOD / 


r 


IVIUUUD / lZUl^.DUy 


U ^Z-l^ilml^OlOll 


9^99 
ZdZZ 


^97ftQ 


lOQO TT1 0 rr-yA'* 9AAQ^ft 

1 ooo.xxlZ.gZ4j Z4Ujjo 


Jr 


JVIUUU3 / lZl^/.lilJo 


U UZ-JrnilH^OlOIl 


ZdZd 


I jZzfD 1 


1QQO Tin rrrrAI OAfM^Q 

1 0 o o . 1 1 Z .gZ4D__Z4U D D V 


r 


1VAAAA^71 9*5 A *PAO 
JV1UUU3 / 1Z jA.r Uy 


u uz-irriinL/Oion 


9^9A 
Z3Z4 


^a9aoa 

D4Z4yU 


1 ftftft TiTl 9 rrrr/lQ 9AA*2£1 

looo.xVlZ.gZ4o_Z4UJOl 


r 


A4"AAA^71 9^T^»'QA9 
JV1UUUD / 1ZDU.UUZ 


ui-/Z-i'ritncoion 


ZD ZD 


Aft7 
DD JUo / 


1 QQQ T 19 rr**A1 9AAQ£9 

1 o o o .Lj 1 Z .gZ4J_Z4 U j OZ 


r 


AAAAA^7 1 97 A »T? 1 A 
JVIUUUD / 1Z /A.JilU 


u L/Z-Jr nnii^oion. 


9^9£ 
Z3ZO 


^aqa^q 
ouynoy 


1 ftftft fil A crvAl 9AA^ftQ 
looo.vjl*f.gZ4<3 Z4Uooy 


■p 


1VA AAA^71 9AP«PAft 
IVIUUUj / IZUl^.JrUo 


u v^z-x riiiiv^oion 


9^97 
ZDZ / 


/JOolU 


looo.VJ14.gZ4.3_Z4U.3y / 


r 


AAnnn^7 1 3fi a -Ti i n 

1V1UUUD / 1jUA.JLI1U 


UwZ-Jrrini^OlOIl 


9^9ft 
ZDZo 




1 ftftO T rrrrAI 9 AAA1 0 

iooo .JL* 1 D .gZ*f 3_Z^f UH- 1 U 


p 

Jr 


TVAnnn^7197 A «T-T1 fi 
1V1UUU3 / 1Z /A.xxlU 


u L/Z-Jr nrnv_/Oion 


ZDzy 


jOUoOo 


i ftftft. wi 7 cvrAi 9AAA^ft 

looo.xxi / .gZ4:>_Z4U*f jo 


p 


lV>f HAA^7 1 99 A * A A7 


u i-/Z -i^iiin v^oion 


9^n 

ZDDU 


79£1 7£ 


1 ftftft PI 7 cxtAI 1A(\AAfk 


p 






9^Q 1 
ZDD 1 


79AG^9 


looo .A 1 o .gZ*f j_Z*f U4*f / 


P 

X* 


TV/fflAA^71 1 9Ti-r^Aft 
1V1UUUD / 1 IZJLJ.UUo 




9^9 

ZDDZ 


i a^h 


1 ooo.lxlo.gZHD ZHuH-DH- 


P 


TV/f AAA^7 1 99 A -f* AA 
1V1UUUD / IZZxY.VyU 4 !- 


%J LZ"r IIIIIV^OIOII 




*fD4yoi 


1 Tift ct^AI OACiA^ ft 


p 

X* j 


1V1UUUD / 1Z / X5.U1U 


T TP 9 _"D-rim Pnl rm 
U l^Z-l^ITlTlV_/010n 


ZDD4 


AA^ ^ 1 7 
04.3D1 / 


1 o o o . 1 ¥ .gZ4«5_Z4U403 


p 
r 


iv>fnnn<\7i 1 ^rpi 1 

IVIUUUD / 1 IjL.DI 1 


T TP9_Pfn*nr*n1nTi 

u \^z _ x iiiiiv^oion 


9^^ 


ijD\J\JO 


1 Rftft T-T10 cttAI 9AOA7H 
looo .1x1 z? .gZH- 3 _ZHU4 / U 


p 
r 


\>innn^7 1 99 a -r* 1 1 

1V1UUUD / lZZA.L^l 1 


T TOO «."D*i m C*r\\ rvn 

U VyZ^x IllTl\_/OIOIx 


9^£ 
ZD DO 


/JJ 1/1 


1 ftftft A9H 9AOA7Q 
looo./YZU.gZH- j_Z*fU** /y 


p 
r 


A/ffinA^7 1 1 1 A • A Aft 


U V^Z -x IllTl\^010Il 


ijj / 




1 ftftft T59H rrrrA^ 9AHAftH 
1 ooo.r>ZU.gZ4D__Z4U4oU 




A>innn^7i izir , -PA7 


TTP9 Pi-rrviPAI^vri 


ZDDO 


79ftA7Q 


1 ftftft K91 rvy/l^ 9AfiAOA 
1 ooo .r5Z 1 .gZ4o_Z4U4yo 


p 
r 


AAAAA571 1Ar*«PAft 
JVIUUUD / 1 14t^.rUo 


u \sZ -i^iTni^oion 


ZD^y 


79/^7^ 
/ZOD ID 


1 ftftft T91 rrrrAI OAfi^fiA 

1 o o 5 . JZ 1 .gZ4D_Z4U3 U4 


P 


iv>rrtnn^7 1 9 ^ a • a cyj 


U l^Z-l^IxmvAxlOll 


ZD4U 


11A{i(\£. 
/34oUO 


1 ftftft T99 r*r*A 1 OAA^IO 
looo.lZZ.gZ4o_Z4UD iy 


p 
r 


AAAAA^71 9^T^»PA7 


uv^z-Jrniiiv^/Oioii 


Z54I 


73Z4/0 


iooo .UZ3 .gz4 J_Z4UD 3 U 


T7 

r 


AAAAA^71 1 A^-TJTAO 


u i^z-r*nitii^oioii 


ZD4Z 


ODOZOD 


1 ftftft XJO'2 (v-zll OArt^QA 

looo .r\Z5 .gZ43_Z4UD34 


p 
r 


AAAAA^71 99Tl'PAA 
JVIUUUD / !ZZl3.rU4 


U L/Z-r nmi^oion 


ZD43 


ooZol / 


1 ooo.jSZj .gZ43_Z4UD3 / 


p 
r 


AAAAA^71 9/i/ w *«/^A^ 


u i^z-i*nincoioii 


2544 


A1A 1 AO 

47410© 


looo .UZ4 .gz4 J_Z40 54o 


r 


A>!AAAC71 1£n<D1 A 
JV1UUUD / 1 1 Oil .0 1 U 


u cz-i*nnicoion 


ZD4D 


4DDDol 


looo.JZ4.gZ43_Z4UDDZ 


P 
r 


AAAAA^7 1 9 ^ A -PA7 
jMuUUD / iZDA.l*U / 


T TPO Drirvt Prxl i-wi 

u i^z-i^niiiv^oioii 


ZD4o 


44/oyz 


1 o o o . JLZ4 ,gZ4 o__Z4U j D4 


p 


A>f AAA<71 97r n «PA'2 


T TP 9 "Dfi-mPrfVl rtlt 

u i^z-ir nmcoioii 


9<A7 
ZD4/ 


7*5^^ 1 A 
/ 3DD 14 


1 ftftft lVvf9A rrvAl 9Af\<\^^ 
looo. 1V1Z4 . gZ4 O Z4 U3D J 


p 
r 


A AS A A A ^ 7 1 9 ft C 1 0HTfi9 
1VIUUUD / 1ZoV^.x1UZ 


u \^z-i^niiiv-/Oion 


ZD4o 


AAA9ftA 
44UZ54 


1 ftftft XT9A n*rA1 9Art^</i 
1 5oo.rsZ4.gZ4j_Z4UDOO 


p 


AAAAA<71 90TVTW7 

jyiuuud / izyL/.uu / 


T TP9 D-r-f-rvi f^r^lrvty 

u v^z-r nm^/Oion 


Z34y 


ODyo44 


1 ftQ7 AH9 nc^AI 9AA^7<I 
loy / .iVUZ.gZ4D_Z4UD ID 


p 


A/TAAA^71 ^9Tl«tfA^ 


T TP 9 Drttn P/-\1 r\n 


ZDDU 


DOOU3 1 


1 ftQ7 CU\1 er^AI 9AA^ft1 
1 ©y / .VJUZ.gZ4j_Z4UDo 1 


p 
r 


AvTAAA^71 QQA «r*Aft 


TTP9 PrimPrtlnn 

u \sZ -Jr nniLX)ioii 


ZDD 1 




1 ftQ7 T H9 cttAI 9Art<ft^ 
1 07 / .JLUZ.gZ4j_Z4U3oO 


Jr 


AA"AAA^7 1 AATi'T^A^ 


TTP9 DrimPAlnn 

u v^z-i^nini^oioii 


9^9 

Z33Z 


A£9770 
4uZ / / y 


1 ftQ7 A>f A9 cttAI 9AfKft7 
1 07 / .1V1UZ .gZ4 j__Zh\Jd o / 


P 


A>TA AA^7 1 A^Ti'T7A 1 


TTP9 "D*n*r»Pr*1r\ri 
U l^Z-x^xmLXJlOIl 


ZDDD 


4/ZUOo 


1 ftQ7 C^Cil cttAI 1Af\*Q1 
1 oy / . vAo .gZ43__Z4U D^D 


p 
r 


AAAAA^71 ^ART^nA 
JVIUUUD / 1D4O.1-IU0 


TTP9 "D«mP/%1/M^ 

u i^z-i^niTi\-.oion 


ZDD4 


/Zo41:> 


1 0<V7 TVT^Q r*r*A1 *}A(\CE\A 

1 oy / . jNUo .gZ4 j__Z4UoU4 


p 
r 


AiffAAA^T 1 A 7 A «I2 A7 

IVIUUUD / 14 /A.JOU / 


T TP9 'Dvi-m P/^1 rtn 

u i^z-i^nini^oion 


9<^^ 
ZDDD 


c 1 <y>7ft 


1 ftQ7 \>(YX rrrrAI 9AH^AA 

1 oy / .r\J D .gZ4 j_Z4UOU0 


p 
r 


AAAAA^71 AO*R'"RA7 
JV1UUU0 / 14yo.JDU / 


TTP9 T5t-imPi-»1riri 


2556 


733951 


1897 A04 ez43 240607 


X 


M00057 1 32CC04 


UC2-PrimColon 


2557 


448212 


1897.P04.gz43_240622 


F 


M00057149B:B08 


UC2-PrimColon 


2558 


536225 


1897.B05.gz43_240624 


F 


M00057133C:B02 


UC2-PrimColon 


2559 


736288 


1897.F05.gz43_240628 


F 


M00057138A:F02 


UC2-PrimColon 


2560 


225960 


1897.E06.gz43_240643 


F 


M00057137A:H12 


UC2-PrimColon 


2561 


655327 


1897.H08.gz43_240678 


F 


M00057140B-.H01 


UC2-PrimColon 


2562 


735729 


1897.N08.gz43_240684 


F 


M00057147A:H07 


UC2-PrimColon 



133 



3NSDOCID: <WO 021 4500A2_I_> 



WO 02/14500 



PCT/US01/25840 



Table 2 



SEQ 
ID NO 


CLUSTER 


SEQ NAME 


ORIEN 
T 


CLONE ID 


LIBRARY 


9563 


642273 


1897F09ez43 240692 


F 


M00057138B:B02 


UC2-PrimColon 


9564 


540985 


1897O09sz43 240701 


F 


M00057148C:B02 


UC2-PrimColon 




735756 


1897 P1 1 <7z43 240734 

1 / .X X X ,^£*TJ J*T\S 1 -7*T 


F 

X 


M00057149CH01 


UC2-PrimColon 


9566 


640667 


1897 E12 ffz43 240739 


F 


M00057137B:C07 


UC2-PrimColon 


9567 


543772 

J*TJ tlx* 


1897 F12 ez43 240740 


F 


M00057138B:H02 


UC2-PrimColon 


9568 


735801 


1897M12az43 240747 


F 


M00057146A:H05 


UC2-PrimColon 


9560 


457846 

HJ / 0*TV7 


1897N12ffz43 240748 


F 


M00057147C:B01 


UC2-PrimColon 


9^70 
zj / yj 


73631 8 


1897 019 P743 240749 


F 


M00057148CC09 


UC2-PrimColon 


9571 


658971 


1897 K13 az43 240761 


F 


M00057144B:D04 


UC2-PrimColon 


9579 
Zj /z 


735983 

/ jJZOj 


1897M13ffz43 240763 


F 


M00057146BC06 


UC2-PrimColon 


9573 
Z.} / J 


Z147550 


1897 A15 az43 240783 


F 


M00057133A:B07 


UC2-PrimColon 


9574 

Z.J /*T 


500758 

Jul/ / JO 


1 897 HI 5 P743 240790 

107 / .X X J. *J X-f~ V/ / .7 V/ 


F 


M00057140D:B03 


UC2-PrimColon 


9575 


567005 


1897 HI 7 ffz43 240822 


F 


M00057140D:F02 


UC2-PrimColon 


9576 
z.} / o 


593171 

JZj 1/1 


1 897 K 1 7 pz43 240825 


F 


M00057144B:H10 


UC2-PrimColon 


9577 


30751 5 

J^ / J u 


1 897 Ml 7 ffz43 240827 


F 


M00057146B:H12 


UC2-PrimColon 


9578 
Zj / o 


7363 1 8 


1 897 F1 8 a743 240836 


F 


M00057138D:F03 


UC2-PrimColon 


9570 

Zj 


44Q035 


1 897 T1 8 a^43 240839 


F 1 


M00057142A:H07 


UC2-PrimColon 


958n 


486076 


1 897 K 1 8 ^743 240841 


F 


M00057144C:A02 


UC2-PrimColon 


9581 


790843 


1897 018(7743 240845 


F 


M00057 149A: A04 


UC2-PrimColon 


9589 

ZJOZ 


796440 


1897 F19 ffz43 240852 


F 


M00057138D.F10 


UC2-PrimColon 


9583 


731317 


1897 G19 ez43 240853 


F 


M00057139D:G07 


UC2-PrimColon 


9584 


550004 


1 897 122 ez43 240903 


F 


M00057142B:F06 


UC2-PrimColon 


9585 

ZJOJ 


448909 


1 897 L22 ez43 240906 


F 


M00057145C:H03 


UC2-PrimColon 


9586 

ZJOU 


71 8314 

/ 1 0 J If" 


1 897 N22 ffz43 240908 


F 


M00057147D.H09 


UC2-PrimColon 


9587 


735989 


1897 C23 ez43 240913 


F 


M00057135A:B02 


UC2-PrimColon 


9588 


489001 


1897 F23 ez43 240916 


F 


M00057139A:B10 


UC2-PrimColon 


9580 


467779 


1 897 H23 ffz43 240918 


F 


M00057141A:G06 


UC2-PrimColon 


9 son 


596809 


1898 C02 sz43 242213 


F 


M00057153B:G07 


UC2-PrimColon 


9501 


447003 


1 898 A03 az43 242227 


F 


M00057150D:C04 


UC2-PrimColon 


9509 


73066 7 

/ JUUU 1 


1 898 D04 az43 242246 

1070 .A-/ V7*T .^Z.*T*7 Z»^Z»<t.T^V7 


F 


M00057155A-E11 


UC2-PrimColon 


95Q3 


461316 

HOJ.J lO 


1 898 H04 ar43 242250 


F 


M00057160D:A11 


UC2-PrimColon 


95QA 


450794 

HJU /Z*t 


1 808 AOS a^43 949959 


F 

X 


M00057150DF08 


UC2-PrimColon 






1 808 P05 ot43 949960 

1 0^70 .Dwj .gZHJ ZtZZOu 


X 


M00057152BH02 


UC2-PrimColoii 


95Q6 


777608 
/Z /OUo 


1 808 F05 a743 242964 


F 

X 


M00057157DH08 


UC2-PrimCoIon 


9507 

zjy / 


0*f70JZ 


1 808 P05 ot43 742274 

lOfO.XT v/-7.g^'T«7 ztzz / *t 


F 

X 


M00057169AF05 


UC2-PrimColon 


2598 


650297 


1 898.A06.gz43_242275 


F 


| M00057151A:B04 


UC2-PrimColon 


2599 


491933 


1898.E06.gz43_242279 


F 


M00057156B:D10 


UC2-PrimColon 


2600 


733552 


1898.P06.gz43_242290 


F 


M00057169A:F06 


UC2-PrimColon 


2601 


730341 


1898.D07.gz43 242294 


F 


M00057155A:G11 


UC2-PrimColon 


2602 


640563 


1898.D08.gz43_2423 10 


F 


M00057155A:H07 


UC2-PrimColon 


2603 


640603 


1898.B10.gz43 242340 


F 


M00057152C:C10 


UC2-PrimColon 


2604 


552972 


1898.E10.gz43 242343 


F 


M00057156C:E08 


UC2-PrimColon 
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Table 2 



SEQ 
ID NO 


CLUSTER 


SEQ NAME 


ORIEN 
T 


CLONE ID 


LIBRARY ! 


2605 


736030 


1 898.F 10.gz43_242344 


F 


M00057158B-C02 


T TP9-PrimPri1nn 
vj x x xiiiv^tJiun 


2606 


736228 


1898.G1 l.gz43_242361 


F 


M00057159CE1 1 


T TP? -Prim r*n1rm 


2607 


648820 


1898.Mll.gz43 242367 


F 


M00057 165D-H05 


T TP 7 -Prim Pr»1 ah 
vJ V_^Z« ~x I 111 IV> OlOIl 


2608 


735993 


1898 D12 ez43 242374 


F 




T TP?-PrimPr»lriTi 


2609 


402516 


1898.H12.gz43_242378 


F 


M00057161BD02 


T TP7 -Prim Pol nn 


2610 


583625 


1898.012gz43 242385 


F 


M 0005 7 1 68R • D07 


T TP7-PrimPnlr»n 


2611 


726448 


1898.N13.gz43 242400 


F 


M00057167AD08 


T TP?-PrimPnlriTi 

VJ v>Z, X X XXllv^VJXlJlX 


2612 


735756 


1898H14ez43 242410 


F 


M00057161BE07 


T TP? -PrimPnl nn 
VJV-/Z- "x x xxxx vJiun 


2613 


551654 


1898.C15.gz43 242421 


F 


M00057153DH01 

IX WJ \J\J \J / X *J J LJ .X XV/ X 


T TP?-PrirnPr»1rin 

vJ \^/Z,-x X XlllV^VJXUJJl 


2614 


558477 


1898 D15 ez43 242422 


F 


M000571 55r-rr04 


T TP?-PrimPn1rrt-i 
VJ V^/Zr""X 1 IXIlV-xVJlvjIl 


2615 


728353 


1898 115 ez43 242427 


F 


M00057162BE1 0 


T TP? -PrimPnl nn 


2616 


736634 


1898.M15.gz43 242431 


F 


M00057 1 66BB07 


T TP?-PrimPn1nn 


2617 


737006 


1898 P15 ez43 242434 


F 


1VXV/WWJ / J.VJS'^/.XX.I.vl 


T TP? -PrimPnl n« 


2618 


736288 


1898 J16 2z43 242444 




M00057i63r , nni ! > 


T TP?-PrtmPnlnn 
VJ v^^-Jr xxlxlv-/010n 


2619 


734794 


1898 P16 ez43 242450 


F 

X 


M00057 1 9 


T TP7 -PrimPnl /wi 

vj Vsz, -Jrixniv^OxOIl 


2620 


734928 


1898 El 7 ez43 242455 


F 

X 


M000571 5fin-F0? 


T TP? -PrimPnl nn 
U V_/Z.~x XXIIll^VJlOlX 


2621 


727976 


1898F17ez43 242456 


F 

X 


M00057158rri0 


T TP? -PrimPnl nn 

vj v^z. -x xiiixv_aj10I1 


2622 


736790 


1898 F19 ez43 242488 


F 




T TP? -Pi-tmPnl nn 
VJ xxxTlV^OlOn 


2623 


735633 


1898 T21 ffz43 242523 


F 


ivrnnn57 1 ^r-rn i 


TTP9 Pt-itviP^x1/^*i 

vj v^z-x^nxni^oioii 


2624 


734484 


1898 B22 £z43 242532 


F 

X 


M000571 53RA04 


T TP?-PrimPn1nn 
U \^/Z.~x X IXXXV^OIOIX 


2625 


480142 


1898D22ctz43 242534 


F 


M000571 55n*F19 


T TP?-Pfi mPnlrfcn 
VJ \^Z-x^xxIxlv^OiOIl 


2626 


454073 


1898 G22 ffz43 242537 


17 

X 


M00fi571 ^OR-nOR 

/ lUU£>.%J\JO 


T TP? -Pri m P nl nn 

Uv^z-xruxiv^oxoix 


2627 


599714 


1898 L22 az43 242542 


X 


M00O571 fiSPFOR 


T TP 9 -Pri mPnl nn 


2628 


737042 


1898 M22 ez43 242543 


X 


IVxv/V/V/J / XV/OJj.Vj vO 


T TP9-Prim Pnlnn 
VJ V^Z,~x iXJnLXV^OxOIX 


2629 


405016 


1898 N22 ez43 242544 


F 

X 


M00057167r*A05 


T TP9 -Prim Pnlnn 
UV/Axl XX XXV^ (JXIJIX 


2630 


723915 \ 


1898 L23 ez43 242558 


F 


M00057 165CF05 

XVXV/vlV/J / 1VJJV/.1 V/»7 


T TP9-Prim Pnlnn 
VJ V_/Z x X XXllV^VyXUXX 


2631 


737006 


1898 023 sz43 242561 


F 

X 


M00057 1 68r-D03 


T TP? -Prim Pnlnn 

VJ V-/Z x X XXI XV_AJI<JXX 


2632 


734808 


1898 B24 ez43 242564 


F 

X 


M000571 53B-nO? 

1VXV7WV/-7 / i.JJD,l/vL 


T TP? -Prim Pnlnn 
VJv>Z. x I XXXXV-AJXlJXl 


2633 


471887 


1898 P24 ez43 242578 


F 

X 


M00057170AT)1 1 


T TP9-PrimPn1nn 
VJ ^Z xvXXXIIV»>rV/X<JXx 


2634 


733972 


1899 A01 ez43 242579 


F 

X 


XVXV/V/V/*/ J x t urV.i-/UU 


T TP? -Prim Pnlnn 
VJ Vw/Zr "X I XXXI V^UXvJXX 


2635 


554221 


1899 J01 ez43 242588 


F 

X 




T TP? -Prim Pnlnn 
VjV^Z"x IxlXXv^OlOxX 


2636 


735426 


1899N01ez43 242592 


X 


M 00057 1 0? A T) 1 7 


T TP? -Prim Pnlnn 

u^z~x ixni.\^oxuxx 


2637 


454812 


1899 P01 P743 247594 


F 


M0005710ilP'A03 


T TP 9 -PrimPnl n« 

vj i^z -x^nnxv-zOioix 


2638 


562453 


1899 D02 ff?43 247598 


F 

x^ 


1V/T00057 1 73 Ti-R 1 7 

IVxV/VJvJJ / 1 / Ji/,D1Z 


T TP? -PrimPnl nn 
U v-'Z - JT XXlIxVw'OlOIX 


2639 


554683 


1899 M02 pz43 247607 


F 

X 


A/f 000 5 7 1 8 OT1 -t^OS 

XVxV/v/V/J / X O "X-/.VJV/0 


T TP?-Prt mPnlnn 

vj v^z -xxiiriv^oxori 


2640 


474346 


1 899.N02.gz43_242608 


F 


M00057192A:F01 


UC2-PrimCplon 


2641 


729039 


1899.D03.gz43J242614 


F 


M00057173D:C07 


UC2-PrimColon 


2642 


560183 


1899.J03.gz43_242620 


F 


M00057181C:G07 


UC2-PrimColon 


2643 


422590 


1 899.D04.gz43_242630 


F 


M00057173D:E04 


UC2-PrimColon 


2644 


642936 


1899.K04.gz43__242637 


F 


M00057182C:C03 


UC2-PrimColon 


2645 


730106 


1 899.N04.gz43 J242640 


F 


M00057192A:H05 


UC2-PrimColon 


2646 


449548 


1899.H05.gz43_242650 


F 


M00057179C:B05 


UC2-PrimColon 
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Table 2 



SEQ 
ID NO 


CLUSTER 


SEQ NAME 


ORTEN 
T 


CLONE ID 


LIBRARY 


2647 


736861 


1899.P05.gz43 242658 


F 


M00057194C:F02 


UC2-PrimColon 


2648 


730220 
/ «j \j ±* \j 


1899.A06.gz43 242659 


F 


M00057170A:H04 


UC2-PrimColon 


7649 


736309 


1899.I06.gz43 242667 


F 


M00057180C:F09 


UC2-PrimColon 


7650 


373615 


1899K06ez43 242669 


F 


M00057182C:C11 


UC2-PriiiiColoii 


7651 


640635 


1899O06gz43 242673 


F 


M00057193C:F11 


UC2-PrimColon 


7657 


735764 


1899F07gz43 242680 


F 


M00057176A:H10 


UC2-PrimColon 


7653 


735801 


1899J07ez43 242684 


F 


M00057181D:C09 


UC2-PrimColon 


2654 


477098 


1899.K07.gz43 242685 


F 


M00057182D:A08 


UC2-PrimColon 


2655 


73403 1 


1899.M07.gz43 242687 


F 


M00057191A:H09 


UC2-PrimColon 


2656 


419711 


1899N07gz43 242688 


F 


M00057192B:D10 


UC2-PrimColon 


2657 


390968 


1899.P07.gz43_242690 


F 


M00057194C:F10 


UC2-PrimColon 


2658 


732625 


1 899.E08 .gz43_242695 


F 


M00057175B:G12 


UC2-PrimColon 


765Q 


462986 


1899 F08 ez43 242696 


F 


M00057176A:H12 


UC2-PrimColon 


2660 


482512 


1899.H08.gz43 242698 


F 


M00057179C:G10 


UC2-PrimColon 


7661 


473238 


1899C09ez43 242709 


F 


M00057172D:F08 


UC2-PrimColon 


7662 


735789 


1899 109 ez43 242715 


F 


M00057180D:C10 


UC2-PrimColon 


7663 


736595 


1899 K09 sz43 242717 


F 


M00057182D:B11 


UC2-PrimColon 


7664 


735514 


1899 M09 ez43 242719 


F 


M00057191B:C06 


UC2-PrimColon 


7665 


6305 1 6 


1899O09sz43 242721 


F 


M00057193D:F06 


UC2-PrimColon 


7666 


730899 


1899 N10 sz43 242736 


F 


M00057192C:B11 


UC2~PrimColon 


7667 

ZOO / 


557055 


1899P10ez43 242738 


F 


M00057194D:C12 


UC2-PrimColon 


7668 

Z.DUO 


578404 


1899 Cll ez43 242741 


F 


M00057173A:C07 


UC2-PrimColon 


ZOO" 


734209 


1899Dllez43 242742 


F 


M00057174B:C06 


UC2-PrimColon 


7670 


457243 


1899 Jll ffz43 242748 


F 


M00057181D:H07 


UC2-PrimColon 


2671 


734061 


1899 Oil sz43 242753 


F 


M00057194A:B07 


UC2-PrimColon 


7677 


675988 


1899G12ez43 242761 


F 


M00057177C:D07 


UC2-PrimColon 


7673 


561632 


1899D13ez43 242774 


F 


M00057174B:C11 


UC2-PrimColon 


7674 


731370 

/ J A-/ /V/ 


1899 E13 ez43 242775 


F 


M00057175C:D02 


UC2-PrimColon 


7675 


480142 


1899 G13 ez43 242777 


F 


M00057177C:G11 


UC2-PrimColon 


2676 


736049 


1899 013 ez43 242785 


F 


M00057194A:C06 


UC2-PrimColon 


JLX) 1 I 


775691 


1899 A 14 ez43 242787 


F 


M00057170C:C01 


UC2-PrimColon 


7678 


777523 


1899 M14 ez43 242799 


F 


M00057191C:D02 


UC2-PrimColon 


7670 


47 1 794 


1899D15ez43 242806 


F 


M00057174B:F11 


UC2-PrimColon 


7680 


737429 


1899 G15 ez43 242809 


F 


M00057177C:H04 


UC2-PrimColon 


7681 


530883 


1899H15sz43 242810 


F 


M00057180A:H11 


UC2-PrimColoii 


2682 


729173 


1899.P15.gz43_242818 


F 


M00057196A:E03 


UC2-PrimColon 


2683 


734744 


1899.D16.gz43 242822 


F 


M00057174B:G03 


UC2-PrimColon 


2684 


735605 


1899.F16.gz43_242824 


F 


M00057176B:F10 


UC2~PrimColon 


2685 


! 726786 


1899.N16.gz43_242832 


F 


M00057192D:G02 


UC2-PrimColon 


2686 


725089 


1899.016.gz43_242833 


F 


M00057194A:F01 


UC2-PrimColon 


2687 


448770 


1899.D17.gz43_242838 


F 


M00057174B:G12 


UC2-PrimColon 


2688 


532904 


1899.E17.gz43 242839 


F 


M00057175D:A12 


UC2-PrimColoii 
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Table 2 



SEQ 
ID NO 


CLUSTER 


SEQ NAME 


ORIEN 
T ! 


CLONE ID 


LIBRARY 


2689 


560568 


1899.K17.gz43_242845 


F 


M00057186A:E12 


UC2-PrimColon 


2690 


736402 


1 899 .N 1 7 .gz43_242848 


F 


M00057192DG04 


UC2-PrimColon 


2691 


516729 


1899.P17.gz43__242850 


F 


M00057196BH10 


UC2-PrimColon 


2692 


718314 


1899.H18.gz43_242858 


F 


M00057180BC06 


UC2-PrimCrj1oTi 


2693 


737116 ! 


1 899. Jl 8.gz43_242860 


F 


M00057182AH07 

ITJIVVV*/ » -A. U^l X.X XW # 


UC2-PrimColon 

VV X X XlXl.V_sVJAVJXX 


2694 


723985 


1899.B19.gz43_242868 


F 


M00057172AB02 


UC2-PrimColon 


2695 


573733 


1899.F19.gz43 242872 


F 


M00057176C:H08 


UC2-PrimColon 

X_sx» X X J_LlA\_svJxVyAX 


2696 


475872 


1899.H19.gz43_242874 


F 


M00057180B:D01 


UC2-PrimColon 


2697 


625988 


1899.F20.gz43_242888 


F 


M00057176D:A03 


UC2-PrimColon 


2698 


736354 


1 899.L20.gz43„242894 


F 


M00057189C:G11 


UC2-PrimColon 


2699 


535866 


1 899. C2 1 .gz43_24290 1 


F 


M00057173C:C07 


UC2-PrimColon 


2700 


648221 


1 899.D2 1 .gz43_242902 


F 


M00057174C:G07 


UC2-PrimColon 


2701 


736280 


1 899 .H2 1 .gz43_242906 


F 


M00057180B:F05 


UC2-PrimColon 

V-/ V/m X JL 1111 V-/\-71\/xx 


2702 


156329 


1 899.N2 1 .gz43_2429 12 


F 


M00057193B:C11 


UC2-PrimColon 

aw JL X XA Jl A V/V/A\/J>JL 


2703 


721768 


1899.021.gz43 242913 


F 


M00057194B:E05 


UC2-PrimColon 


2704 


639507 


1 899 .B22.gz43_2429 1 6 


F 


M00057172A:H06 


UC2-PrimColon 


2705 


451383 


1899.E22.gz43_242919 


F 


M00057175D:E12 


UC2-PrimColon 


2706 


349977 


1899.J22.gz43 242924 


F 


M00057182BD09 


UC2-PrimColon 


2707 


734928 


1899M22sz43 242927 


F 


M00057191DG10 


UC2-PrimColon 

\J VrZ< X XXX&X\^V7XWXX 


2708 


480623 


1899.A23.gz43 242931 


F 


M00057170D:B08 


UC2-PrimColon 

w \>>J^ X A 1111 V> v/AV/ll 


2709 


674526 


1899.H23.gz43 242938 


F 


M00057180BG06 


UC2-PrimColon 


2710 


730805 


1899.M23.gz43 242943 


F 


M00057 1 92AB04 


UC2-PrimColon 


2711 


640350 


1900.G01.gz43 242969 


F 


M00057206A-C06 


UC2-PrimColon 

x^*^^ x x xjLxji v^^^xvyxx 


2712 


726449 


1900.K01.gz43 242973 


F 


M00057210AC12 


UC2-PrimColon 

\—/ V-^-C/ X X XLIXV^VSXVSXX 


2713 


552457 


1900.E02.gz43 242983 


F 


M00057203B:F08 


UC2-PrimColon 

•\ m0 F X-^X^ X X AAAA V/VA\/AA 


2714 


730308 


1900C04ez43 243013 


F 


M00057200DE03 


UC2-PrimColon 

V-/ X X XLXXV^V/XV/XX 


2715 


734993 


1900 E04 sz43 243015 


F 


M00057203CA09 


T JC2-PrimColon 

W V/Z< X X XXXXV^WXWXI 


2716 


562320 


1900H04gz43 243018 


F 


M00057207BF06 


UC2-PrimColon 

V-/ V/i-- X X XXXX^/V/XV/XX 


2717 


556656 


1900 O04 sz43 243025 


F 


M00057216DD05 

ATA V \J\J ~J / ^JLUi/.i/Vt/ 


UC2-PrimColon 

V^^# X X XXXXV./V/XV/XX 


2718 


446752 


1900.D06.gz43 243046 


F 


M00057202AD05 

1T1W v«/ # «rvJ#X JW»X^ 


UC2-PrimColon 

^^Aa* X X AlAAx^V/AV/AA 


2719 


736738 


1900.L06.gz43 243054 


F 


M00057211C # C06 


UC2-PrimColon 

V^/ V^^« X JL XA A A. V^A. VJ'AX. 


2720 


639395 


1900 O07 sz43 243073 


F 


M00057216DF10 

XfiVUv*/ / j^r X \J -X X V 


UC2-PrimColon 


2721 


516729 


1900 B08 az43 243076 


F 


M00057199BB06 


UC2-PrimColon 


2722 


557262 


1900 E08 sz43 243079 


F 


M00057203DA09 


UC2-PrimColon 

V>xV X X AAH V> V/l V/1A 


2723 


731531 


1900.I08.gz43 243083 


F 


M00057208B:H08 


UC2-PrimColon 


2724 


736778 


1900.K08.gz43_243085 


F 


M00057210C:D09 


UC2-PrimColon 


2725 


724638 


1900.O10.gz43_243121 


F 


M00057217A:A12 


UC2-PrimColon 


2726 


483061 


1900.B12.gz43_243140 


F 


M00057199C:H08 


UC2-PrimColon 


2727 


j 725905 


1900.C12.gz43_243141 


F 


M00057201A:H03 


UC2-PrimColon 


2728 


648757 


1900.M12.gz43_243151 


F 


M00057213C:D06 


UC2-PrimColon 


2729 


735797 


1900.A14.gz43_243171 


F 


M00057197D:H10 


UC2-PrimColon 


2730 


732771 


1900.114.gz43_243179 


F 


M00057208C:E12 


UC2-PrimColon 
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Table 2 



SEQ 
ID NO 


CLUSTER 


SEQ NAME 


ORIEN 
T 


CLONE ID 


LIBRARY 


07^1 
LI 3 1 


7^^77A 
/jOz/O 


i onn f i s a^d^ oj.^101 




M00OS7904AF1 1 


T TP?-Primr , rvlnn 




AAQn7Q 


ionn oio a74^ Oznofis 


r 


M00OS79 1 7R-F 1 9 

IVlUuuJ / £m X / XJ.X X 


Ur2-PrimColon 


LI DD 


1 A1 A£Q 
lO 14oV 


1QOO AOO ftvdl 9A^967 
l^UU./xfcV/.g/iT-j Z*tDZ\j I 


17 


M000S7 1 9RRT*09 




LI J** 


7*}A00£ 
IdHLLO 


1 Onn 1790 9 A^O 79 

iyuu..r zu. gz*K> lhdl i z 




M000S790SDO06 

lVlV/vv/3 / Z,\J~)U.\J\J\J 


T TP9 -Prim PaI ati 


LI dd 


/303UU 


i onn noo avAi oa^07^ 

iyUU.*JTZU.gZ*+3 Zs\DZID 


P 


M000S79O7A-AO7 


T TP9-PrimPAl An 


LI DO 


7'3'3070 
/DDL 1 L 


1 Qnn 1^00 crrAI 9A3977 
lyUU.IxZU.gZHO Z*+3Z/ 1 


P 

JT 


M000S79 11 A D0 1 


T Tr*9-Primr*Al An 

U V/Z> XT I IXI1\_/IXIUI1 


0707 

LI 5 1 


33ZyU4 


ionn t*oo cttA'X OA^nn 
i y uu .dzz .gz**3 z^ojuu 


P 


MOOOS7900RT>04 


T Tr , 9-PrimpAl An 
Uva r xxutv^uxuxi 


Ll Do 


045.3 /y 


ionn uo7 rr'yA^ OA^no 

iyUU.JVZZ.gZ*t 3 LIDDkIz? 


P 


M000S791 1R-A0R 


1 TP 9 -Prim Pol nn 


Z/3y 


/C/IO/tOA 

o4y4yu 


ionn lv/ioo cttA'X oa^i i 
xyuu.xvizz.gZH3_z*f33 1 1 


P 


^000^57914^1^ 19 

IV1UUUJ / Z. 1*tXJ.X> 1Z. 


T TP 9 -Prim PaI An 

yj v^z x i ill ivunjii 


2/4U 


loo3ZZ 


i onn xroo cr^A'x oa^io 

iyUU.INZZ.gZH- D^LHDD ll 


P 


M000S79 1 fif0 1 9 


T TP9 -Prim PaI An 


2/41 


3616Z6 


ionn poa cttAI oa^^ 

lyUU.HZ^T.gZT-J Z4jjjJ 


P 

JT 


M000S 79 GdC-CrCi6 


T TP9-PrimP a1 An 


07/tO 

2/42 


033olo 


\Qf\(\ \£*)A crvAI. OA^Al 

lyuu.ivz^f .gzHo zH'OO'ri 


P 

.r 


MO0OS791 1RP09 


T TP9-PrimPo1on 

\J \^J* X 1 XlilV^vIWAX 


2/43 


/34oZo 


lOnO/^ni rrrrA^ OA'*'*^ 

lyuy .\J\) 1 .gZ*4-3_Z43333 


P 


A^000S799Sr-FT07 


T TP 9 -Prim Pnl An 

\J K^Z IT 1 AlllV/^Jl*JAl 


2/44 


A071 1 5 
42 / 1 1 3 


lOnOMni rr-rA^ OA*33^A 

lyuy.xiu 1 .gz^t3_z*t3 jjh- 


p 
r 


M000S7996r*F0S 


1 TP9-PrimPAlAn 


2/43 


3olo / / 


1AAQ imi rrrrA'J OA55^7 

iyUy.rvUl.gZ43_Z4333 / 


p 


M000S79^1R* A01 

1V1UUU3 / Z.3 1 


T TP9-PrimpAlAn 

U Vi- I 1 XIIIVxVJlwlX 


2/46 


434363 


lOnOA/fni rrrrAI OA^^O 

1 yuy .JVIU 1 .gZ43_24333y 


P 
r 


A/T000^79^^R-O04 


T TP9 -Prim P a1 An 

U V^Z. x I Alii vUlUll 


2/4/ 


£0*£Q 1 A 


lOnOPm rrrrA'X OA^/^O 

1 y Uy. r Ul.gZ4 3 _Z43 3 OZ 


p 


M000S79 ^ -FTOO 


T TP9 -Prim PaI on 


274 o 


4/3562 


i nno t?ao /w/i'j OAQ'i/CS 
iyuy.rU2.gZ43_z433Do 


p 
r 


IV/T000S799ziP-R09 

1V1UUU3 / ZZ*rVjr.J3v/Z. 


T TP9 -Prim PaI An 

U V>-Z X 1 JXIlV^VJUJlX 


2749 


7"2 *£7/f A 

/33 /4y 


1 OnO T^nO rrrrA'X O AI'XTl, 

iyuy.Jvuz.gz4j_z43j id 


p 


A/f 000579^ 1 PR04 


T TP9 -Prim PaI An 

\J X 1 XIIIV^VJIUIX 


2/3U 


2oy32o 


1 OAO PA^ rr —A'3 oA^^sn 
iyuy.I3UJ.gZ43 Z4335U 


p 


MO0nS7919D-fr1 1 


T TP9-PrimPAl An 


oar -I 

2731 


4636y / 


i ono r^n^ mA'x oa^sii 

lyUy.vAO .gZ**3_Z z f33ol 


P 


M000S7990D-F06 


T TP9 -PrimP a1 An 

W V^Z» X X XI llVx UiUll 


2 /3Z 


70ZTOA1 

/zooyz 


1 OnO THYX cttA'X oa^^r^ 
1 yuy ,IiU3 .gZH-3_Z^t33 oj 


p 
r 


M000S7995B-O0 1 


T JP9-PrimPolon 

\J X X lillV/vAUU 


2/33 


71*20*^0 

/ 23 y3y 


lono crvAi OA^^RR 
iyuy.JU3 .gz*to_z*io joo 


p 

17 


M000S79^0r*C , 05 


T TP9-PrimPolon 

W V^Zr XT X UllV/vlUll 


2 /34 


/33o /4 


i ono xrn^ n^Ai oa^^oo 




M000S7234D*A12 


UP2-PrimColon 

\J Vs X X XXIXVsVSXWIX 


2/33 


ACA 1 OO 
43412y 


i y u y . AU4 . gz*f D_L e + j 3 y d 


p 


M000S791 8C-F10 


T TP9-PrimPolnn 


2 /3o 


/33oOo 


lOnO DHA rr^A^ OA^^Q/> 

iyuy.oU4.gz £ f3_Z'r33yo 


r 


M000S79 1 9DH04 

J.VXV/V/V/— ' / Z> X -7X-^.X XV/ "T 


UP2-PrimColon 

\J X X XXXLV^V/XV/Xl. 


2/3 / 


o42y4U 


i ono urn /i rr-yA^ OA'ZACf) 
I y Uy JrU4.gZ43_Z43*f uz 


p 


M000S7996P-F1 9 


T TP 9 -Prim Pnl on 


2 /jo 


3yoU61 


i ono nn^ crvAi 'JA'XA'yA 
iyuy.uu3 .gz*f3_ZH-3M"i*fr 


p 


M000S7999RA06 


T TP9-PrimPAlon 

V^^> X 1 XXXX\^\JXV/XX 


1T< A 

2/3y 


7*3 /I /CO A 

/346yU 


i ono rin<k rr^A^ oa^a 17 

iyuy.VJU3.gZ43_Z43 £ fl / 


p 
r 


M000S799SDE01 

iVAUv/U3 / ZZJ A-/.A-/W A 


T TP2-PrimPolon 


2/6U 


6U1U31 


i ono xrn^ mAi iaiaoa 

iyuy.INU3 .gZ43_Z z f3 £ fZ*f 


p 


i M000S79^4D*F04 


T TP9-PrimPn1nn 


2/61 


7070*C*C 

/2/233 


i ono T? n^ />tAQ oao^a^i 

iyUy.IlUO.gZ43_Z43H3 1 


p 

JT 


MO00S7993f**R01 

1V1UUU3 / ZZ. JV/.A7U A 


T TP9 -Prim Pol on 

\J V^Z< X X XXII wlvXX 


1*7/0 

2762 


632499 


1 OnO TA£ 2 OA'JAQ^ 

1 y Uy ,lUO.gZ43_Z43433 


p 


1 M00n^799QA-R0^ 


TTP9-PrimPnlAn 

\J X 1 IXllVvUIUXl 


1*7 /CO 

2763 


/3/01U 


iyUy.xvUo.gZ43_z4343 / 


p 


1V>T000^70^ 1 P-F06 

XV1UUU3 / ZD 1\-/.A_/V/U 


T TP9 -PrimP a1 An 

V/Zr XT 1 XX11V>\J1VXX 


1*7 Z! /I 

2764 


TOO 111 

728121 


1 AAA T A/C ~-,A 1. O 

iyUy.L»U6.gz43_24343o 


P 

r 


A>rnnn^7O70'R-rTO9 

1V1UUU3 1 LDZiDXJyjZ 


T TP9-PrimPn1 An 


2763 


A *71 Oil 

472511 


1 AAA DHO rt-yjO- O A1AC(\ 

1 yuy .i5Uo.gZ43_Z434oU 


p 
r 


A>rnnn^7oon A -pn^ 

iVlUUU3 /ZZUA.V-UO 


T TP9 -P ri m P r»1 nn 
U xlllv^VJlt/ix 




648 140 

VJ^O l*tv 


1 909 H08 az43 243466 


F 


M00057226C:H10 


UC2-PrimColon 


2767 


! 726251 


1909.A09.gz43_243475 


F 


M00057219A:D05 


UC2-PrimColon 


2768 


642631 


1909.B09.gz43_243476 


F 


M0O057220A:DO9 


UC2-PrimColon 


2769 


598087 


1909J09.gz43_243484 


F 


M00057230C:G12 


UC2-PrimColon 


2770 


398061 


1909.K09.gz43_243485 


F 


M00057231C:F12 


UC2-PrimColon 


2771 


463513 


1909.M09.gz43_243487 


F 


M00057233D:G12 


UC2-PrimColoii 


2772 


81 ' 


1909.F10.gz43_243496 


F 


M00057225A:C08 


UC2-PrimColon 
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Table 2 



SEQ 
ID NO 


CLUSTER 


SEQ NAME 


ORIEN 
T 


CLONE ID 


LIBRARY 


2773 


735050 


1909.H10.gz43 243498 


F 


M00057226DB03 


T TP9~Primf , n1nn 
iJ v_/Z- x x UIlV^UlOxl 


2774 


735140 


1909.K10gz43 243501 


F 


M0005 723 1 C -G04 


T TP9-PriinPn1nn 


2775 


220107 


1909.O10.gz43 243505 


F 


M00057236AF08 




2776 


735817 


1909.P10.gz43 243506 


F 


M00057237BD10 

IflvV/l/^ / 4~-J 1 Xjt , X-' X V/ 


UvA **x xxxiiv^UxUxx 


Till 


735514 


1909.Dll.gz43 243510 


F 


M0005 7222D C10 


u v>-^- X X XXlIV^L/lUXl 


2778 


602673 


1909.Fll.gz43 243512 


F 


M00057225AE03 


TTP9-PrimPn1on 

w V_>Z# X X XXllV^UXLUx 


2779 


647815 


1909.11 l.gz43 243515 


F 


M00057229BF1 1 


TTP9-PrimPn1nn 


2780 


402024 


1909A12gz43 243523 


F 


M00057219AE1 1 


T TP9-PrimPnlon 

VJ Kjt^ X X XXXXV_vVJXiJXX 


2781 


631111 


1909.C12.gz43_243525 


F 


M00057221BB01 


UC2-PrimPn1nn 


2782 


451888 


1909.112.gz43_243531 


F 


M00057229BG1 1 


IIP9-PriTnPnlr>n 

\J V/Zi X X XXXXV_/UXUXX 


2783 


456224 


1909J12.gz43_243532 


F 


M00057230DC05 


T TP 9 -Prim Pnl nn 
uvy/,"r xxxxiv_fUHJix 


2784 


555336 


1909.L12.gz43 243534 


F 


M00057232DB03 


T TP9 -Prim Pnl nn 
UVyL "*x I xlIlV^UlUIl 


2785 


736988 


1909 A13 ez43 243539 


F 


M0005771QA-H1 1 


T TP9-PrimPn1rvn 
vJ v^Z.~x I xxilv_/DxOIl 


2786 


613067 


1909.B13.gz43 243540 


F 




T TP 9 -Pri tyi Pr\1 rrn 


2787 


736738 


I909.E13.gz43 243543 


F 


M00057223D-H03 


T TP9-PrimPnlnn 
u r x xxii v^uivJxx 


2788 


646552 


1909 G13 sz43 243545 


F 


M00057226AB04 


T TP 9 -Prim Pr»1 nn 
U v>z. x I xxiXv^UxvJxx 


2789 


725120 


1909 013 2z43 243553 


F 


1VXV/V/V/J / Z.JUD .lv 1 1 


T TP9wPi-imPrk1r»n 


2790 


591979 


1909 A14 ez43 243555 


F 

X 


M00057? 1 QB-R 1 0 


T TP 9 -Prim Pnl nn 
U V-'xC -x ITIIIV^OIOII 


2791 


731467 


1909 N14 ez43 243568 


F 


M00057235B- A07 


T TP 9 -Prim Pnl nn 


2792 


474869 


1909 A15 ez43 243571 


F 


M000572 1 QB-POfi 


T TP 9 -Prim Pnl nn 
UV//6 xl xxxlv^UXUxx 


2793 


735306 


1909 C15 sz43 243573 


F 


IViVUvJ /Z-ZrlJJ.Dl 1 


T TP9-PrimPnlnn 
U \^LT3l xxxxll^OIOIl 


2794 


732736 


1909K15sz43 243581 


F 


M0005723 1 D • A OS 


TTP9-PrimPn1nn 


2795 


724417 


1909 L15 ez43 243582 


F 


M00057233A*r04 


T TP9-PrimPn1nn 


2796 


454355 


1909 E16 sz43 243591 


F 


M00057224AT>07 


T TP 9 -Prim Pnl nn 


2797 


732899 


1909.D17.gz43 243606 


F 


M00057223AF06 


T TP9-PrimPn1nn 

l-J V_^X- XT X XXXlV^VJXUXX 


2798 


733149 


1909J17gz43 243612 


F 


M0005723 1 A*D04 


T TP9-PrimPn1nn 

\J \_^~x X XX11V_^VJXL/XX 


2799 


735622 


1909B18sz43 243620 


F 


M00057220B*rr1 0 


T TP9 -Prim Pnl nn 


2800 


453132 


1909 019 ez43 243649 


F 


M00057236C-C07 


T TP9 -Prim Pnl nn 

U V-/Z. X X XXXlv_*L/XwXX 


2801 


724781 


! 1909 P19 sz43 243650 


F 


M00057237DC1 1 


T TP9-PrimPolon 

KJ \^*JZ* X X XXXlV^wXV/Xl 


2802 


556852 


1909 B20 ez43 243652 


F 




T TP9 -Prim Pnl nn 


2803 


732159 


1909 H20 ez43 243658 


F 


M00057227B-A05 

XVXV/V/V/*/ / x«xV # Xf.AUJ 


T TP9-PrimPnlnn 

VJ V-/^« X XXXllV_y\JXV/XX 


2804 


734609 


1909.D21.gz43 243670 


F 


M00057223BA07 


T TP9 -Prim Pnl nn 

VJ Ks-L* X X XXIiV_/WlVJXX 


2805 


616985 


1909 121 ez43 243675 


F 




T TP9 -Prim Pnl nn 

W V>Z,~X X XXll\_/VjlVJXX 


2806 


734151 


1909N21sz43 243680 


F 




T TP9-PrimPnlnn 

VJ V/^ X X XXIlV_/IJXk/XX 


2807 


553850 

w *h/ vj vy 


1909 C22 sz43 243685 


F 




T TP9-PrimPnlnn 

VJ \~f£* X X XX X LV^f SJl. \JLl 


2808 


736894 


1909.D22.gz43_243686 


F 


M00057223B.B04 


UC2-PrimColon 


2809 


486051 


1909.K22.gz43_243693 


F 


M00057231D:F10 


UC2-PrimColon 


2810 


734553 


1909.N22.gz43_243696 


F 


M00057235C:F03 


UC2-PrimColon 


2811 


473578 


1909.P22.gz43_243698 


F 


M00057237D:D09 


UC2-PrimColon 


2812 


737010 


1909.A23.gz43_243699 


F 


M00057219D:C02 


UC2-PrimColon 


2813 


735514 


1909.B23.gz43_243700 


F 


M00057220C:F08 


UC2-PrimColon 


2814 


418682 


1909.C23.gz43_243701 


F 


M00057221C:F02 


UC2-PrimColon 
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SEQ 
ID NO 


CLUSTER 


SEQ NAME 


ORTEN 
T 


CLONE ID 


LIBRARY 


2815 


734562 


1909.F23 .gz43_243704 


F 


M00057225C:F09 


UC2-PrimColon 


2816 


664711 


1909.M23.gz43_24371 1 


F 


M00057234C:D11 


UC2-PrimColon 


2817 


727558 


1909.E24.gz43_243719 


F 


M00057224B:H02 


UC2-PrimColon 


2818 


730028 


1910.C01.gz43_243733 


F 


M00057240A:F03 


UC2-PrimColon 


2819 


611604 


1 9 1 0.E0 1 .gz43_243735 


F 


M00057242C:G12 


UC2-PrimColon 


2820 


558617 


1910.H0Lgz43_243738 


F 


M00057246A-.G11 


UC2-PrimColon 


2821 


733144 


1 9 1 0 JO 1 .gz43_243740 


F 


M00057248B:D11 


UC2-PrimColon 


2822 


647242 


1 9 10.M0 1 .gz43_243743 


F 


M00057252B:E04 | 


UC2-PrimColon 


2823 


733552 


1910.F02.gz43_243752 


F 


M00057243C:D01 


UC2-PrimColon 


2824 


729428 


1910.N02.gz43_243760 


F 


M00057253C:A06 


UC2-PrimColon 


2825 


641680 


1910 . A03 .gz43_243763 


F 


M00057238A:D07 


UC2-PrimColon 


2826 


514838 


1910X03 ,gz43_243774 


F 


M00057251A:F02 


UC2-PrimColon 


2827 


735403 


1910.N03.gz43_243776 


F 


M00057253C:D11 [ 


UC2-PrimColon 


2828 


732859 


1910.104.gz43_243787 


F 


M00057247C:C11 


UC2-PrimColon 


2829 


730046 


1910.C06.gz43_243813 


F 


M00057240B:C01 


UC2-PrimColon 


2830 


650067 ' 


1910E06sz43 243815 


F 


M00057242D:C07 


UC2-PrimColon 


2831 


734884 


1910.F06.gz43 243816 


F 


M00057243C:H11 


UC2-PrimColon 


2832 


550730 


1910.I06.gz43 243819 \ 


F 


M00057247C:F10 


UC2-PrimColon 


2833 


449042 


1910K06sz43 243821 


F 


M00057249D:H09 


UC2-PrimColon 


9834 


647704 


1910 B07 ez43 243828 


F 


M00057239B:F05 


UC2-PrimColon 


2835 


555641 


1910.G07.gz43 243833 


F 


M00057245A:F03 


UC2-PrimColon 


9836 


734226 


1910.D08.gz43 243846 


F 


M00057241D:C04 


UC2-PrimColon 


2837 


551437 


1910.A09.gz43 243859 


F 


M00057238B:F05 


UC2-PrimColon 


2838 


639494 


1910.M09.gz43 243871 


F 


M00057252D:B10 


UC2-PrimColon 


2839 


661194 


1910.P09.gz43 243874 


F 


M00057256B:A05 


UC2-PrimColon 


2840 


734466 


1910 B10 ez43 243876 


F 


M00057239C:E05 


UC2-PrimColon 


2841 


734371 


19 10.110.gz43_243883 


F 


M00057247D:D10 


UC2-PrimColon 


2842 


731392 


1910.Kll.gz43 243901 


F 


M00057250B:A03 


UC2-PrimColon 


2843 


484964 


1910.G12.gz43 243913 


F 


M00057245B:E02 


UC2-PrimColon 


2844 


733991 


1910.H12.gz43_243914 


F 


M00057246C:B12 


UC2-PrimColon 


2845 


585976 


1910M12ez43 243919 


F 


M00057252D:D04 


UC2.-PrimColon 


9846 


729125 


1910 B13 ez43 243924 


F 


! M00057239C:G07 


UC2-PrimColon 


2847 


733855 

/ ~J mJ \J *J 


1910 C13 az43 243925 


F 


M00057240C:A06 


UC2-PrimColon 


2848 


733945 


1910D14ez43 243942 


F 


M00057241D:G01 


UC2-PrimColon 


2849 


555103 


1910 J14 ez43 243948 


F 


M00057249A:C06 


UC2-PrimColon 


2850 


726011 


1910.K14.gz43_243949 


F 


M00057250B:D04 


UC2-PrimColon 


2851 


557353 


1910.N14.gz43_243952 


F 


M00057254A:G07 


UC2-PrimColon 


2852 


457846 


1910.E15.gz43 243959 


F 


M00057243A:H03 


UC2-PrimColon 


2853 


456753 


1910.B16.gz43_243972 


F 


M00057239D:D01 


UC2-PrimColon 


2854 


! 731238 


1910.M17.gz43 243999 


F 


M00057253A:C05 


UC2-PrimColon 


2855 


558332 


1910.O17.gz43_244001 


F 


M00057255C:A07 


UC2-PrimColon 


2856 


649390 


1910.P17.gz43_244002 


F 


M00057256D:A11 


UC2-PrimColon 
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INSDOCID: <WO_0214500A2_L> 



WO 02/14500 



PCT/US01/25840 



Table 2 



SEQ 
ID NO 


CLUSTER 


SEQ NAME 


ORIEN 
T 


CLONE ID 


LIBRARY 


2857 


734808 


1910D19ez43 244022 


F 


M00057242AH1 1 

ITIV/VV/J / x#TZ#xTL.X XXX 


UCS-Primfolon 

KJ \-*£* X 1 HllV_sV7XUXl 


2858 


731125 


1910L19ffz43 244030 


F 

1 


M00057252AD1 0 

1t1V/V/V/«J / Z*-J £*£\..U X VJ 


T TP9 -Prim Pnlnn 

\J \~>jL < I 1 XillV^lsXlJxl 


2859 


476455 


1910G20ez43 244041 


F 
1 


M00057945DO09 


KJ X X llllv V7XWXX 


2860 


735754 


1910 M20 ffz43 244047 


F 
1 


M00057253 AFN 0 


V/^> x x iiiiv^uivyix 


2861 


735477 

/ «7«7"T / / 


1910 D21 ffz43 244054 


F 
1 


M00057949BF07 

1VXV/V/V/,J 1 £*t£*MJ,M/\J I 


VJ XT 1 IXXXV^ wlvUl 


2862 


734894 


1910 121 az43 244059 

17lu.lZ>l.^itJ X,* 1 1 W~J Z? 


F 

X 


M00057248 AH1 0 

iviuuuj / z,tor\.iil v 


T Tr 1 ? -Prim Pol nn 


2863 


473588 


1910 J21 az43 244060 

1 7 1 U.tf 1 .g£>*T J X» 1 1 V7 V7 V/ 


F 

X 


M00057240PP07 


T TP? -Prim Pol nn 

VJ \->£* X 1 illiV/ VXVJXX 


2864 


735665 


1910 K21 ez43 244061 




M00057250C Cr09 


T TP 9 -Prim Pol on 

VJ V_^^ 1 1 XL 1 1 V7XU1X 


2865 


725095 


1910 M22 ez43 244079 


F 

X 


M00057253BP06 


T IP2-Pri iTiPrilon 

VJV^Z* X 1 HXIA_/UX *J1X 


2866 


726081 


1910 P93 ez43 244098 


F 


M00057257AH1 0 


I JP2-PrimColon 

vvyA X X XL11\_^V7XV^1X 


2867 


455821 


1910F24ez43 244104 


F 1 


M00057244CE06 


I JP2-PriTnColon 

\7V/Af X X XLXIV^V/X V7XX 


2868 


449537 


1910O24&743 244113 


F 


M00057955DF02 


T TP9 -Prim Pol on 

VJ V>-£> XT 1 1111V-/VXJA-JX1 


2869 


514697 


1910P24ez43 244114 

X7 XV/. X ^'r.&^iTJ 4& 1 11 X~ 


F 

X 


M00057257BC 1 1 « 

IVlv/wvJ / / XJ . V_/ X X 


1 TP9 -Prim Pol on 

VJ XT X XIX1^/V^XV/XX 


2870 


586794 


1911 101 ez43 244123 


F 


M00057269D F02 


TJP2-PrimColon 

V/v/ai X XXXllV^WXVUl 


2871 


648710 


1911 M01 &z43 244127 

X X X •ITIV/ X if\ — " I T x x# / 


F 


M00057275BB02 


lJC2-PrimColon 

W V^Arf X X illlv/V/lx/11 


2872 


630348 

V^«J V7«7~0 


1911C02ffz43 244133 

17 1 l.VVi.gL'TJ X# I I 1*7*7 


F 


M00057260AE05 


UC2-PiimColon 


O / .J 


733-570 1 


1911 F09 <x:t43 944136 


F 
1 


M00057966DR12 

1 VXV7 V/ v/«7 / Z,UU17 .XJ X ^ 


I JP2-PrimPo1oTi 

VJ x i miv^uiuii 


2874 


480723 


1011 J09 JT743 244140 

1711,JvZ<,gi6tJ X**T*TX*TV/ 


F 


M00057271 AF04 


T JP2-PrimColon 


2875 


728756 


1911 C03 az43 244149 

l/il< wVJ ,gZiT-7 Z« f T 1T7 


F 

X 


M00057260AE1 1 


UP2-PiiiTiColon 

VJ V/a> X 1 XL XI V7XUJX 


2876 


735071 


1911E04ffz43 244167 

X 7 X 1 . X—<v> i . rlr / X* IV lUl 


F 


M00057265DB 12 


IJC2-PiimColon 


9877 


630655 


1011 K04az43 944173 

17 11 ,l\Vt ,g/i*T J Z»T*tl / 


F 

X 


M00057973 A C08 


t TP2-PriniPolon 

VJ VA L 1 XL11V_/VJ1V/XL 


9878 


736014 

/ JV/V X*T 


1011 TST04 az43 244 1 76 


F 

X 


M00057279A-G02 

lVlV/UvJ 1 4m 1 7/\.VJ V7^> 


1 TP 9 -Prim Pol on 


987Q 


42 


1011 P04 <*r43 944178 

X 7 X X .1 V7"T ,g^*T^ X* 1 f I/O 


F 
1 


M00057283AE06 


T IP2-Prim Colon 

VJ V^— ^ 1 X IX 11V/V71L'11 


9880 


489 1 45 


1011 D07 o 7 43 944914 


F 

X 


M00057262B-r03 

IVlvUV/J / Z*\J£-i\J . V^V/»7 


T JP2-PrimColon 

VJ \~s X X XXIXV^V^XVXIX 




735834 

/ ,J«70.«7"T 


1011 KIO0743 944960 

17 11.J\Xv.k£^J ^*T*T.Z»V77 


F 

X 


M00057273B-A12 


UP2-PrimColon 

VJ X XXXli.\^VJXVJXX 


9889 


49 


1011 P10ct743 944974 


F 

X 


M00057983BOOO 


T TP9 -PrimPolon 

VJ V.^ilr XT 1 XXXlV-sVJXVJXX 


9883 


736014 


1011 Mil a743 944987 


F 


M00057977B-C00 

1V1UUWJ / Z# / / U. V^V/7 


T TP2-PrimPolon 

VJ V_^^# X XXXXLV^VJXWXX 


9884 
4600*+ 


450609 


1011 Oil or43 944980 


F 

X 


M00057281 AO08 


T JP2-PrimPolon 

VJ \~/A\* XT X XXUV^V/XV/1X 


9885 


4470 


101 1 M19 »v43 944303 


F 

X 


M00057277B-E10 


T TP9 -PrimPolon 

UV/^ x xxxxxv^v^xvyxx 


-OOU 


456065 


1011 019ar43 944305 

l7ll.V/l<C.g£'TJ z.*t*T^\yj 


F 


M0005728 1 AH02 


I JP2-PrimColon 

VJ\w^ X X xixxv^v/xwix 


9887 


489085 


1011 K15o-r43 944340 


F 


M00057973C-F1 1 


T TP9 -PrimPolon 

VJ V^X# XT X XXXXV^V7XV7XX 


9888 


659789 


1011PM 8 ar43 944303 

17 1 1 .VJx 0.g£*r-> A t t 1 rJ7j 


F 


M00057968B-B03 


1 JP2-PrimColon 

V7\_/X» X 1 X111V- /Ulv/li 


9880 


79S784 


101 1 F118 ar43 944304 

17 1 l.XllO.g^H'J ^ t rtJ74 


F 


M00057960PF01 


IJC2-PrimColon 


9800 


79039 1 


1011 1C10O743 944413 


1 F 

JT 


M00057273DA06 


UC2-PrimColon 


9801 


73991 3 


1011 TJ99 oy43 944458 

1711 .XlZ.^.gZH'J Z,*T*T*T-7 0 


F 

JT 


M00057960r -TT06 


T IP2-PrimPolon 

UV/4> 1 XXXLLVxVJXUXX 


2892 


420504 


191 l.B23.gz43_244468 


F 


M00057259D-.D11 


UC2-PrimColon 


2893 


735687 


1911.F23.gz43_244472 


F 


M00057267CG09 


UC2-PrimColon 


2894 


728790 


1911.J23.gz43_244476 


F 


M00057272C:D08 


UC2-PrimColon 


2895 


734131 


191 l.P23.gz43_244482 


F 


M00057283D:B12 


UC2-PrimColon 


2896 


725951 


| 1911.D24.gz43_244486 


F 


M00057265C:F03 


UC2-PrimColon 


2897 


554703 


S 1911.E24.gz43_244487 


F 


M00057266C:G12 


UC2-PrimColon 


2898 


546642 


1911.L24.gz43_244494 


F 


M00057275B:A12 


UC2-PrimColon 
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3NSDOCID: <WO 0214SOOA2 I > 



WO 02/14500 



PCT/US01/25840 



Table 2 



SEQ 
ID NO 


CLUSTER 


SEQ NAME 


ORH5N 

T | 


CLONE ID 


LIBRARY 


2899 


730189 


1912.E03.gz43_244535 


F 


M00057291B:H08 


UC2-PrimColon 


2900 


728768 


1 9 12 .D08 .gz43_2446 14 


F 


M00057290B:A02 


UC2-PrimColon 


2901 


420402 


1912.H10.gz43_244650 


F 


M00057299C:A08 


UC2-PrimColon 


2902 


737109 


1912.F12.gz43_244680 


F 


M00057293A:H03 


UC2-PrimColon 


2903 


424672 


1 9 1 2 .H 1 5 .gz43_24473 0 


F 


M00057299D:E04 


UC2-PrimColon 


2904 


558045 


1912.D16.gz43_244742 


F 


M00057290D:G03 


UC2-PrimColon 




733856 


1912.F16.gz43 244744 


F 


M00057293B:H04 


UC2-PrimColon 


2906 


736449 


1912.H17.gz43_244762 


F 


M00057300A:A08 


UC2-PrimColon 


2907 


148201 


1 9 1 2.H1 8 ,gz43_244778 


F 


M00057300A:B06 


UC2-PrimColon 


2908 


639427 


1 9 12 .E 1 9 .gz43_24479 1 


F 


M00057292A:B08 


UC2-PrimColon 


2909 


730187 


1912.B21 .gz43_244820 


F 


M00057287A:H06 


UC2-PrimColon 


7910 


732300 


1912D24ez43 244870 


F 


M00057291B:D08 


UC2-PrimColon 


2911 

s x x 


465528 


1921.A01.gz43 244883 


F 


M00057314A:A10 


UC2-PrimColon 


2912 


549552 


1921.J01.gz43 244892 


F 


M00057331D:D08 


UC2-PrimColon 


2913 


470199 


1921.E02.gz43 244903 


F 


M00057320A:G08 


UC2-PrimColon 


7914 

X"T 


729125 


1921.F03.gz43 244920 


F 


M00057323B:G04 


UC2-PrimColon 


2915 


640341 


1921.C04.gz43 244933 


F 


M00057316D:D02 


UC2-PrimColon 


2916 


77737 


1921.F04.gz43 244936 


F 


M00057323B:H08 


UC2-PrimColon 


7917 

^.-7 X / 


735469 


1921.G04.gz43 244937 


F 


M00057324B:E04 


UC2-PrimColon 


7918 


736733 


1921 B06 az43 244964 


F 


M00057315D:C06 


UC2-PrimColon 


2919 


732969 


1921.I06.gz43 244971 


F 


M00057328C:F11 


UC2-PrimColon 


2920 


635849 


1921.J06.gz43 244972 


F 


M00057332A:C06 


UC2-PrimColon 


2921 


733146 


1921.O06.gz43 244977 


F 


M00057339D:H09 


UC2-PrimColon 


2922 


737114 


1921 A07 gz43 244979 


F 


M00057314B:H06 


UC2-PrimColon 


2923 


732114 


1921.L07.gz43 244990 


F 


M00057334D-.E03 


UC2-PrimColon 


2924 


482788 


1921.P08.gz43_245010 


F 


M00057341D:B09 


UC2-PrimColon 


2925 


554647 


192 1 .M10.gz43_245039 


F 


M00057337C:G12 


UC2-PrimColon 


2926 


552623 


1921 Bll az43 245044 


F 


M00057316A:B04 


UC2-PrimColon 


2927 


447705 


1921.Ell.gz43 245047 


F 


M00057320D:C02 


UC2-PrimColon 


2928 


731748 


1921 Gil ez43 245049 


F 


M00057324C:G05 


UC2-PrimColon 


2979 


727093 


1921 B12 ez43 245060 


F 


M00057316A:D09 


UC2-PrimColon 


2930 

i.7 JV7 


551607 


1921 N12 ez43 245072 


F 


M00057339A:E08 


UC2-PrimColon 


29^1 

1.7 J X 


777013 


1921 G13 ez43 245081 


F 


M00057324D:E10 


UC2-PrimColon 


2937 


777018 


1921 L13 ez43 245086 


F 


M00057336B:E01 


UC2-PrimColon 


7933 


JtOvJ J 


1921 G14 ez43 245097 


F 


M00057324D:H03 


UC2-PrimColon 


2934 


726692 


1921.J14.gz43_245100 


F 


M00057332C:F12 


UC2-PrimColon 


2935 


556430 


1921.D15.gz43 245110 


F 


M00057319D:E06 


UC2-PrimColon 


2936 


726692 


192lA16.gz43_245123 


F 


M00057314D:E09 


UC2-PrimColon 


2937 


473225 


1921.F16.gz43_245128 


F 


M00057323D:E10 


UC2-PrimColon 


2938 


892 


1921. K16.gz43_245 133 


F 


M00057334A:C12 


UC2-PrimColon 


2939 


730997 


1921.C17.gz43 245141 


F 


M00057318B:H05 


UC2-PrimColon 


2940 


51616 


1921.E18.gz43 245159 


F 


M00057323A:F01 


UC2-PrimColon 
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WSDOCID: <WO 0214S00A2 I > 



WO 02/14500 



PC17US01/25840 



Table 2 



SEQ 
ID 
NO 


CLUSTER 


SEQ NAME 


ORIE 
NT 


CLONE ID 


LIBRARY 


2941 


546642 


1 92 1 .F 1 8 .gz43_245 1 60 


F 


M00057324AA09 


UC2-PrimColon 

WX_/x* X A IxU V-/ VUV/xJL 


2942 


643005 


1 92 1 .Gl 8 .gz43_245 161 


F 


M00057325B:D06 


UC2-PrimColon 


2943 


735054 


1 92 1 .HI 8 .gz43_245 1 62 


F 


M00057326D:B10 


UC2-PrimColon 


2944 


727231 


1 92 1 .F 1 9 .gz43_245 1 76 


F 


M00057324A:C08 


UC2-PrimColon 


2945 


733464 


1 92 1 J 1 9.gz43_245 1 80 


F 


M00057333A:D08 


UC2-PrimColon 

V-/ «^X« X X XLXX^/V/XV/XX 


2946 


457092 


1 92 1 .B20.gz43_245 1 88 


F 


M00057316C:A06 


UC2-PrimColon 


2947 


733723 


1921.H21.gz43 245210 


F 


M00057328AF02 


UC2-PrimColon 

\-S V^Xv X X XXXXV^V/XV/U 


2948 


557177 


1 92 1 .12 1 .gz43_2452 1 1 


F 


M00057331C:E10 


UC2-PrimColon 


2949 


734582 


1 92 1 .K2 1 .gz43_2452 1 3 


F 


M00057334B:F01 


UC2~PrimColon 


2950 


448431 


1 92 1 .L2 1 .gz43_2452 14 


F 


M00057336D-.F07 


UC2-PrimColon 


2951 


697006 


1 92 1 .H22 .gz43_245 226 


F 


M00057328A:F10 


UC2-PrimColon 


2952 


452936 


192LI22.gz43 245227 


F 


M00057331C:F10 


UC2-PrimColon 


2953 


727132 


1921.L22.gz43 245230 


F 


M00057337AA06 


UC2-PrimColon 

V-/ A« X X XXX Ix/VlV/U 


2954 


513306 


1921.F23.gz43 245240 


F 


M00057324BA06 


UC2-PrimColon 


2955 


642986 


1921.P24.gz43 245266 


F 


M00057344AG07 


UC2-PrimColon 


2956 


496752 


1924 G03 ez43 245338 


F 


M00043308AD09 


UC2-ColonMetLiver 


2957 


498509 


1924 H03 az43 245339 


F 


M000433 10AF01 


UC2-ColonMetLiver 


2958 


450805 


1924.E04.gz43 245352 


F ! 


M00043304BA10 


UC2-ColonMetLiver 


2959 


451456 


1924.G04.gz43 245354 


F 


M00043308AF06 

IT±V/ V/ v i */ v U2 X* X \J\J 


UC2-ColonMetLiver 


2960 


494625 


1924.M04.gz43 245360 


F 


M000433 17BB12 


UC2^oloTxMetLiver 


2961 


490401 


1924.P05.gz43 245379 


F 


M00043321BE05 


UC2-ColonMetLiver 


2962 


495105 


1924 H08sz43 245419 


F 


M000433 10CB04 


UC2-ColonMetLiver 


2963 


496957 


1924 E09sz43 245432 


F 


M00043304CE01 


UC2--ColonMetLiver 


2964 


494306 


1924.I10.gz43 245452 


F 


M000433 12D* A02 


UC2-ColoiiMetLiver 


2965 


492544 


1924.Jll.gz43 245469 


F 


M000433 13DB04 


UC2-ColonMetLiver 


2966 


451081 


1924Mllgz43 245472 


F 


M00043317DC02 

1TAV V X § X^ . V v*. 


UC2~ColoiiMetLiver 


2967 


495951 


1924.K14.gz43 245518 


F 


M00043315C*D05 


UC2-ColonMetLiver 


2968 


450349 


1924.D15.gz43 245527 


F 


M00043303B:E11 


UC2-ColonMetLiver 


2969 


498951 


1924.L15.gz43 245535 


F 


M00043316CF06 


UC2-ColonMetLiver 


2970 


450999 


1924.C17.gz43J245558 


F 


M00043301DB12 

XTX w V \/ l W «y \m¥ X X-' * J 0 X X^ 


UC2-ColonMvtLiver 


2971 


499693 


1924.G17.gz43 245562 


F 


M00043309AH06 


UC2-ColonMetLiver 


2972 


498242 


1924.F18.gz43 245577 


F 


M00043307B:F11 


UC2-ColonMetLiver 


2973 


499700 


1924 H18 sz43 245579 


F 


M00043311BH08 


UC2-ColonMetLiver 


2974 


479604 


1924L19ez43 245599 


F 


M00043316DF04 


UC2-ColonMetLiver 


2975 


562516 


1924N19ez43 245601 


F 


M00043319AD01 


UC2-ColonMetLiver 


2976 


553779 


1 924.L2 1 .gz43_245 63 1 


F 


M00043316D:F09 


UC2-ColonMetLiver 


2977 


. 493746 


1924.E24.gz43__245672 


F 


M00043306B:A09 


UC2-ColonMetLiver 


2978 


494325 


1924.H24.gz43_245675 


F 


M00043312B:A10 


UC2-ColonMetLiver 


2979 


453078 


1933.H03.gz43_245723 


F 


M00043340B:B04 


UC2-ColonMetLiver 


2980 


492887 


1933J04.gz43_245741 


F 


M00043343D:H03 


UC2-ColonMetLiver 


2981 


450283 


1933.F05.gz43_245753 


F 


M00043335D:E02 


UC2-ColonMetLiver 


2982 


494271 


1933.M05.gz43_245760 


F 


M00043351C:A07 


UC2-ColonMetLiver 
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Table 2 



SEQ 
ID 
NO 


"LUSTEP 


SEQ NAME 


ORIE 
NT 


CLONE ID 


LIBRARY 1 


zyoJ 


/MQ 1 1 A 

44ol 10 


1 CXXi f\(\K citA'X 9xl^7£9 


X 1 




T jr2-CoIonMetLiver 


90Q/I 


A 19 A'} 1 
41ZOZ1 


1 0/2*2 FiO./^ rr-yJ.^ 9xlS7fV7 
ISO O.JJU0.gZ4J>_ZHJ /O / 


T7 

X 1 




T 1(^9 -PolonlMT etT J ver 






lOQT. 17i£ rr-yd/* 9x1^779 
X^O J.±UO.gZH , «J_ZH\j / /Z 


r 


1VAU l/vt J> J *rZ \»y . VJ 


1 TP?-r , n1 onA/TetT ,iver 

\J Vi» V> VII V/1XLVJL& LI /A V vl 


zyoo 


AO. 1 0 1 9 
4U IzlZ 


1 Oil T70.7 rr-yJ.^ 94S784 


17 
JT 


M0n04^^^4A*Fl 0 


T JCl-ColonMetLiver 


90C7 

zyo / 


44 /UD / 






\in004^^40D*C03 


I JC2-ColonMetLiver 


OQQO 

zyoo 


/LQ£On.O 

4yoyuy 


1 0/2*2 CIO, ct«7/1^ 94 < 5ft'30 




IVXv V/V/'T-? J A« O^V.A-» lAi 


TJC2-ColonMetLiver 

V-/ vm v/VlViU'AVtUl V VI 




/1G1 Q22 

4yiy3J 




r? 




T TC2-ColonMetLiver 


oooo 

zyyo 


4yoo/o 


IQTJ "Ml/I jttA^ 9d.^RAl 


r 


M0004^^'SSD*E1 2 


UC2-ColonMetLiver 

v UlV/iilVXVLLjl V vl 


zyy i 


3UUU4U 


lQOO r>i 9 rr . 7 /l'2 94^R/;i 
1 7Jj .D X Z .gZIO ZHO OO 1 




Mfl0041^26A'H06 


TJC2^oloiiMetLiver 

\J V/Z/ V VlV71U TXV kUl V VI 


zyyz 


^^2^o>i 
553594 


lQ-ll Ol 9 rrrAI 9xl^R7xl 

1 y J D . %J X Z .gZ4 J_Z4 J 0 / *f 




M0nO4^^'S4D , H0R 

lVXUv/v/ fc T J JJtl/ .X lv O 


T JC2-r , olonMetLiver 

KJ V_/^» v/UlUlli'lvllJl V vl 




Til 

5ol /12 


ly^ J.JN14.gZ4J_Z £ tJ!7UJ 


17 

r 


M0nn4^^S4A-F06 


T Tr*2-Colon MetLiver 

\J\^Z* vUIv/IUtIvII j± V vl 


too/i 


A <AA9*7 

45002 / 


IQOO T1 SL trrjA'X 9x1^017 

lyjj.J ID.gZ4J__Z'tJ57l / 


17 


M0004^^44r , r , l 1 

1VXV/VV/^J J*T*tV-/.V^X X 


T IC2-ColonM etLiver 

VJ V-/ \s vUvIUtIv l»l IX V vl 




/I ^A 1 OQ 

450129 


iyj J .JS.10.gZ4j_Z £ tJi/ 


17 


Mnn04^ r ?4RA-D02 


TJC2^olonMetLiver 


zyyo 


492 / /9 


iqoo r^lT rr?A^ 9xlSQxl9 
XV.2.2.v^X / .gZ4^_ZHJf*Z 


17 


M0004^^9 QR -H07 


T jr2-ColoiiMetLiver 

V 1 vUlvlUTXVlouX V VI 


ioot 
2997 


/l A£Q 

453068 


1 y 3 -5 . v^ 1 o .gZ40_Z43 sO o 


r? 

17 


TVyf C\(\(\A'X ^ 9 0 P • D 1 9 


T TP9 -Col onlVTetT A ver 

W V/Ar vUlUlllVlvl 1 tX V Vl 


oaao 


450804 


1 y33 .r 1 8 .gZ43_Z43yO 1 


•p 


iv>rrtnnzi^ ^ ^ & c • A 04 

IVXU V/ v*r jjjUv', ^VV/*t 


T TP9 -Colon lVTetT ver 

VJ v^-*-»^V^WlV>lllVXvl.l /I V vl 


2999 


1015 


1 CY11 TVT1 Q *2 9/f <0*£Q 

1 933 .IN 1 8 .gZ43_245y oy 


r 


\A(\(\f\A'X1^AT±'C % C\A 


T TP 9 -Pnl onlVfptT .ivf»r 

VJ v^i V^VIlVfXJLLVXwUXjJ. V V/A 


3000 


CxC 1 /I 

561834 


1Q*2*2 TVT9A n^wA *2 9/KOA1 
1 y5 D .IN2U.gZ4^_Z40UU 1 


F 
r 


7v>innn4^^S4R-Fi 9 


T TP9 -PolonlVTetT J ver 

VJ V/Z. v/UlUlliVXU I.I /l V vl 


3001 


552687 


1933 .U23 .gZ45_24oU5 U 


17 
r 




TTP9-r , olonTvTptT ivpr 

VJ V/// V^UlUlllYlULljl V V/l 


3002 


5 17346 


iy 33 .\J24.gZ43_Z40 ,, JDO 


17 

r 




T TC*9 -Colon IvTetT .iver 

VJ V^^ V^VJlV/llXYXvl.I /l V vl 


3003 


491448 


1 0*3< DAs rw/1'2 9/I/i9'5?0 

1 935 .1305 .gZ43__24o2oy 


17 

r 


ivyf nnnzi^ ^ 09 r* -P09 


T TC , 9-Polon'M"ptT .iver 

VJ Vx^r V^VIlV/XLLVXV^LX^l V vl 


3004 


CCOOAA 

553890 


1935.1'v06.gz43_2463 14 


17 


"MrnnnA^^m A-no7 

IVlUUU^t D jU 1A.17v / 


T TC9-C l olon Tvf PtT Jver 

VJ V-^-o V_^ VII Ul 11 V J. v U— (1 V vl 


3005 


635439 


iy35.IN06.gZ4J_Z4O^l / 


17 


A/rn004^ ^041-1 * C 09 

1VAU vvHj ^ W*tX^ . V^VJii* 


T JC 2-ColonMetLiver 

VJ Vm vvlV/lXL>Xv LXJ1 V V-l 


iUOo 


4U2U/0 


1 a*2 c Tn7 nvAl 9JW39K 
iyj J.1U / .gZ4J_Z400Zo 


17 


^0004^40 QC * C07 


UC2-ColonMetLiver 


300/ 


448924 


i yo 3 .nuo .gz4^_z*-to 


17 


1^/10004^406^(^04 


I JC2-ColoiiMetLiver 


■JAAO 

JOOo 


562886 


lAQC AAQ crrrAI 1 AfSX 

iyo3."UUo.gZ4J_Z400 JU 


JT 


A/T0004'* SOSRGO^ 


T JC2-ColoriMetLiver 

VJ V-^ -i— V> V/1W 111 »X v U— f X V Vi 


3009 


558055 


IQ-jr AAQ „ rr yl'2 9/1^*2^^ 

i y3 5 . uuy .gz4 ^_z4oj o o 


P 




T JC2 -ColonMetLiver 

VJ VVIUI JU.TXV ill /I V vx 


3010 


451456 


1 y jj .rll 3 .gz 4 K>__z*i-o*t j j 


p 

JT 


A40004^407r , H08 


UC2^olonMetLiver 


3011 


555399 


iy55.IN J.0.gZ40_Z404 / / 


P 


MOnO4^SO4P*F0^ 

1VX v V/\/'*T»J J Ut w ■ l—i\J-J 


UC2^olonMetLiver 


3012 


498629 


iy35»xll / .gZ45_Z404o / 


P 

r 


Mnn04^4n7D*G06 


T TC2-ColoixMetLiver 

VJ V^^- VV/lVlliTlVU-Jl V VI 


o a i o 
3013 


/I AAOA< 

490805 


lA-Jf 17 1C **rrA1 9/1/^ OA 

1 y35 .Hlo .gZ4o_Z40 D UU 


p 

JT 


lV/f 0004^401 A'FOO 


T JC2-ColoixMetLiver 

VJ <w V_x vlvlUVlv H /l V VI 


OA1 y| 

3014 


450335 


iy35.r 18.gZ4J_24o3Ul 


P 
I* 


Ayfnnn4^An^*R- A 1 9 

IVXU vU*r J *t V/O X> . r\. 1 


UC2-ColoixMet^Liver 

vXr VyV/lvUiTlVULU V VX 


3015 


446964 


10*2^XT99 rrrrAI 9/1^^7*2 
1 y 35 .JN 22. gZ43_Z403 


P 

r 


MOO 04^ *S 04F> * G0X 


UC2-ColoiiMetLiver 


1 A 1 

3016 


522703 


iy35.Jr24.gZ45_240OU / 


p 

r 




IJC2-ColoiiMetLiver 

VJ V^^« VV71V/1UTXVUJ1 V VX 


3017 


C A noc *5 

549853 


1 A*}/: 17A*; -5 0/1/C/CQ9 

1936.K05 .gz43_246682 


17 


1VXUUU J *f *f ^H-rV.xT-v/H 


T TP9 -Colon lvf etLiver 

VJ vVJlUlilYlv I I tx V vl 


3018 


560275 


1936 M05 ez43 246684 


F 


M00054497B:C10 


UC2-ColonMetLiver 


3019 


481057 


1936.P05.gz43_246687 


F 


M00054501C:E08 


UC2-ColonMetLiver 


3020 


414739 


1936.C08.gz43_246722 


F 


M00043508D:C01 


UC2-ColonMetLiver 


3021 


559720 


1936.O08.gz43„246734 


F 


M00054500A:F04 


UC2-ColonMetLiver 


3022 


554908 


1936.Mll.gz43_246780 


F 


M00054497D:A04 


UC2^oloiiMetLiver 


3023 


449000 


1936.Nll.gz43_246781 


F 


M00054498D:F01 


UC2-ColonMetLiver 


3024 


556768 


1936.F13.gz43_246805 


F 


M00054488A:F01 


UC2<:oloiiMetLiver 
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Table 2 



SEQ 
ID 
NO 


CLUSTEE 


SEQ NAME 


0RIE 
NT 


CLONE ID 


LIBRARY 




44007R 


1036 T1 5 €7^43 946X40 

1 Zf3 D.l 1.7 .gZH.7 ZHOoHVJ 


JT 


AAnnn^4409 a -1^104 

IVIU UU D 44 y Z/V. JLFU4 


U l^Z-^01011iVlcLL»lvcr 


3096 




1Q36 T16 rr-74.3 9468^7 
17jOJ 10.gZ4J ZWoj / 


r? 
r 


IVIU UU 3 44y O \-/ .HU4 


u ^z-^oionivieiJLfiver 


3097 


S69R93 


1036 131 8 <7743 946881 


JT 




u z- v^oioiuvie LJL/i ver 


3098 
OUZO 




1036 F91 o"t43 946039 


Jt* 


tv/rO0O^44ft7r*- A 0 1 


u U/Z-i^oioiuvicLxjiver 


3090 


^S9Q90 

JJZ7ZU 


1036 H93 <*t43 946Q74 
X 7jO. UZj .gZ4J Z40y / 4 


T? 

17 


A/TOOO^A^Ol A -PI 1 


T TP 7 P /^r» A /T/»*T -i<irov 

u v^z ~ v^oionivic u^iver 


3030 


SS90R6 


1071 A01 <t:t43 946009 




A/T000^47 1 7 A • A 0^ 


T TP7-Pn1r»nA/Tp'tT iirw 


3031 


S^3193 


1071 nni cr^43 946005 


r 


Ayf 000^479 OP- A 0 1 


TTP9 PnlnnA/l^T ix«»r 
U v^Z " V^OlOIliVlCLLfl vcr 


3039 


561636 


1071 FT01 ot43 946000 




M000S4794H-P0S 


T TP 9 -Pnlnn A/ff*tT ivpr 


3033 


406779 


1071 ICOl o74^ 947009 


F 


^000^47971^-^06 


TTP9~Pri1nnA/f^»tT tx/iai* 
U \^Z~V^010nxV16U-AV©X 


3034 


•7*ry»/»Jl/ 


1Q71 AO^ cy-743 947H94 
1 y / 1 .AUj .gZ40 Z4 / UZ*f 


t? 

r 1 


A/I 000^47 1 7 A - P07 


T TP 9 „Pr»l nn A/f «r#»r 


303^ 


JJJOJ / 


1071 PH^ a-743 947057 
iy / 1 .D\JD .gZHO Z*r IKJD 1 




1V/T0005471 R"R-r*0'^ 




^o^6 

jUjO 


^771 A 
JJ / / 14 


1Q71 rr-rA^ 9A70^G 

1 / 1 .UUj .gZ4.5_Z4 / U3 y 


r 


A/rooo^A77or , -m 0 


TTP9 Prtl/A«A/T<a+T it/or 


^0^7 


44t>yyy 


1071 "PH^ rr-rA^ OA7H6A 
iy / 1 XUj .gZ4-7 Z4 /UOU 


17 

r 


\/TOOO^A771 P-Fll 1 
1Y1UUUD4 /Z 1^.1-11 1 


U v^Z - v^OlOIUVlclJulVcr 


jUjo 


A^OAI o 
4DU41U 


1071 ThH^ rrrrAQ O A7H/^ 1 


17 

r 


A/T000^A777r , -F*0 1 


¥TP9 Pyxl/^-r%A/T«fT iirar 




AQAAAO 
4o4U4«3 




T7 

r 


A/FOOO*CA77tYR- A Aft 


u LxZ -\^oioiuviciJ-fivcr 


on/in 


A^9A£ft 
43Z4oo 


1071 XTH^ rrrrAO OA7AAO 




A/TAOO*A7'3 1 A »n07 
1V1UUUD4 / D 1A.UU / 


TTP7_Pr^1/«*nA/r<Kl-T nrar 


^OAI 
3U4 1 


A^097fi 
43UZ /o 


1QT1 An^rr-rA^ 0A7A70 
1 y / 1 .Al/O .gZ4 J__Z4 / U / Z 


T7 

r 


A/TOOO^I A7 1 7 A •fT-OO 


T TP9_P^x1 r\nA/f/i+T « rA r 


^OA9 




1Q71 FinA cttA^X lAICYlS*. 
iy / 1 .LiUO.gZ4j__Z4 IK) ID 


r? 
r 


A>fO00^A77Or*-M0 1 


TTP9 Prtlrt«AA**tT 

u v^z - v^oionivieuLiivcr 


^OA^ 


<<7A7£ 
DO /4ZO 


iy / 1 .JjUO.gZ4J_Z4 /Uoj 


17 

r 


A/TO O O ^ A77 Q"R • "R O ^ 


u i^z-^oioiiivicix#xver 


3U44 


!)D331o 


iy / x.rUy.gZ43_Z4/lZ3 


r 


A/TAAO^A777Tl'r^nft 


T TP9 _Pr>1r^r»AyTia+T ixror 


"3A/K 
oU4j 


1 5 

JJJOlO 


1CY71 TAO rrrrAI OAT 10ft 

iy / 1 .iuy.gz4o__z4 / izo 


r 


A/TO0O^A77/^ A -FlOO 




-5U40 


jDiuyo 


1071 tTin t^A'X iaiiai 
iy / 1 .rllU.gZ4o__Z4 / 14 j 


17 

r 


A>fOAO^A77^ A • A O-^ 


TTP7_Pi-k1^»-r»A/Trrf-T itrdf 

u v^z«^oionxvieiXiivcr 


OU4 / 


cc/r/rcA 
DD0OD4 


iy / l.r'lU.gZ4j_z4/ ID 1 


17 


A/TO 0 O 5 A 7*2 AT3 • C 1 06 


TTP9 PAl/-»«A/Ti*4T iirAr 


OA/IO 

oU4o 




iy / ijtii i.gz4^_z4/ ioy 


17 

r 


A/TOOO<A77^ A -POO 
1VIUUU34 lz,DJ\.r Uy 


TTP7_Pi-fc1/^*%AA«*-T iiror 


OA AO 

3U4y 




1Q71 1 rrrrAI OATI^A 

1 y / 1 .IVl 1 1 ,gZ4 J__Z4 / 1 04 


17 


A/rOOO^A7QOT3«F1 1 




^o^o 


oOl / D 


iy / X.rllZ.gZ40 Z4/ 1 ID 


Jr 


AyTOOO^ A77 ^ A -FT06 


TTP9_Pr»lr»nA/T<*fT ii«*r 


30^1 




1071 T 19 0^4*3 947 170 

iy / i.i^iz.gz4o / 1 /y 


p 
r 


1^000^4790^-^08 


TTP9-PrklnriA>l**t-T ivpr 


"30S9 


A91Q^Q 


1 Q7 1 r^l 4 tfy-rA^ 9479 H6 
1 y / 1 . VJ l^f .gZ4^ Z4 / zuo 


p 


Tv/T000^4794 A -IT-09 


T TP9_PnlnTiA/fp»tT 




AA£A^ 
44o43.j 


1071 "MIA n-rAI 7A771^ 
iy/ I.lN14.gZ4^ Z4/Z10 


P 


A/T000^47^ 1 13-r>04 


T TP 9 «Pr»1 r*n A/Tp»tT ii/i»r 

u ^Z" v^oioxiivxciX/ivcr 


*20^a 

OUJ4 


JOZZOj 


1Q71 T K /v«AQ 7A7777 
iy / l.l-fl3.gZ4^__Z4/ZZ / 


p 

F 


A/TOOO^ A77 0^ -^07 


TTP9_P/Vlrt«A/I**<T tT7*»^- 


once 


DDO / j)J 


1071 T1/£ /v«A7 OA77AA 
iy / 1.110.gZ4«j Z4/Z4U 


p 

F 


A/T00O^A77^Tl-PO8 


TTP9-Prv1<^r»A/f<*fT ixre*r 

u v^z *-^oioiiivi6XJ-fi ver 




jj / /4/ 


1Q71 T17 rtT/lQ 7A7756 
iy /l.xl / .gZ4o_Z4/ZjO 


p 
r 


AAOO O <J A77 6TI • PO O 

iviuu u j 4 / zdjd . f u y 


T TP9 _Pr*l/^r» A/f/atT n/pr 

u \^z ■- v^oioiuvictJ-#xver 


/ 


14j /3 


ly /l.JN 1 / .gZ43_Z4/zol 


17 
F 


A/TOAA^A7'2 1 "D.#^107 
JV1UUU34 Id IJd.OUZ 


u i^z oioiiivie uLrivci 




t AO 


1Q71 D17 ~— /I Q 7/170iC' , 2 

i y / 1 .F 1 / .gZ4 j_z4 /ZO J 


17 

r 


A/f AAA^A7*5AP- A07 






-C^/f A/l O 


1071 V 1 Q rt-^/ll 7/1777/1 

iy / l.ivlo.gz43_z4/z/4 


17 

Jr 


A/TAAA^/1 77 QTVT5A7 


u v^z oioiiiVietJLrivcr 


3060 


55031 S 


1071 T 18 c*7*n 94727^ 


F 


M0005472QDD05 


UC2-ColonMetLiver 


3061 


449035 


1971. 12 l.gz43 247320 


F 


M00054726C:D10 


UC2-ColoiiMetLiver 


3062 


524721 


1971.122.gz43_247336 


i f 


M00054726C:E07 


UC2-ColonMetLiver 


3063 


555571 


197 1 .A24.gz43_247360 


i F 


M00054718A:D11 


UC2-ColonMetLiver 


3064 


553869 


197 1 .N24.gz43_247373 


F 


M00054731C:H12 


UC2-ColonMetLiver 


3065 


552686 


1971.P24.gz43_247375 


F 


M00054735A:G09 


UC2-ColonMetiLiver 


3066 


424723 


1972.103.gz43_247416 


F 


M00054743A:C07 


UC2-ColonMetLiver 
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Table 2 



SEQ 
ID 
NO 


"LUSTEF 


SEQ NAME 


ORDE 
NT 


CLONE ID 


LIBRARY 


3o/£7 


4Q05 1 7 
*f 77 j 1 / 


1 079 P0^ cr74^ 24742*3 


F 

X 


M00054750DH12 


UC2-ColonMetLiver 


30£8 


^1441 


1079 T 04 0743 74743 S 


F 

X 


M00054745DG09 


UC2^oloiiMetLiver 




8£14^ 
oOX*K) 


1 079 P04 o-743 9474^0 


F 


M0005475 1 A* A05 


UC2-ColonMetLiver 


j\J fV 


J-JOIO / 


1 079 P0S ay4^ 947442 


F 

X 


M00054737BH11 


UC2-ColoiiMetLiver 


3071 


^3834 


1079 TOS ar43 947440 


F 

X 


M00054744A*G10 


UC2-ColoAiMetLiver 






1079 P06 a-743 9474-58 


F 

X 


M0005473 7C ' BO 1 


UC2-ColoAAMetLiver 


jU / D 


4£A93^ 


1079 FOR o^n 247402 


F 

X 


M00054739CB12 


UC2-ColonMetLiver 


3074 
D\J /*t 


^£9393 


1 079 O0Q cr?43 947S 1 8 


F 


M00054750AG10 


UC2^olonMetLiver 


307^ 


4^0149 


1079 F19ct743 247556 


F 


M00054739CE05 


UC2-ColoiiMetLiver 


307A 


33 I JOO 


1 079 0 1 2 $T743 247566 


F 


M00054750BF02 


UC2-ColoiiMetLiver 


jU/ / 


^77379 


1079 R13 0^74^ 247560 


F 


M00054736B:H03 


UC2-ColonMetLiver 


3078 


55867Q 

JJOQ / 7 


1 079 Tj 1 3 a743 24757 1 


F 


M00054738D:F01 


UC2^olonMetLiver 


3A70 


A59957 


1079 Tl^ ©-743 247576 


F 


M00054743CE11 


UC2-ColoiiMetLiver 


jUoU 


^737^3 


1 079 7\T1 3 ot4^ 94758 1 


F 

X 


M00054748BG10 


UC2-ColonMetLiver 


308 l 




1 079 Tsll 4 a^4^ 947507 


F 


M00054748BH09 


UC2-ColonMetLiver 


OUoZ 


^3707 
/UZ 


1079P14crr43 947500 

17 / Z.x iT.giTJ / J77 


F 

X 


M00054751BF12 


UC2-ColonMetLiver 


0/1Q3 


40^837 


1079ni5o741 94760*3 


F 

X 


M0005473 8D GK)7 


UC2^oloiAMetLiver 


3Uo4 


4Q3 1 77 

t+yj izz 


1079 F1S a?43 947605 


F 

X 


M00054740CH08 

lTlv/V/V/*/ «^ f ^tW^«a A\/u 


UC2-ColonMetIiver 




4031 3<; 


1079 fx15 cr^43 947606 

1 7 /Z.VJ1D .gZHJ Z*+ / \J\JyJ 


F 

X 


M0005474 1 CD08 


UC2^oloiiMetLiver 


3UoO 


3oz/y^ 


1 079 K" 1 5 o-^43 9476 1 0 

l7 / Z.1V1.} .gZHO ZH- /VJl'U 


F 

X 


M00054745CB12 


UC2-ColonMetLiver 


3Uo/ 


Do /5D*t 


1Q79 Tl£ »-7 , 43 94769 S 

17 / Z.J 10.gZ*T^ ^r*T / O^-J 


F 


M00054744CD02 


UC2-ColonMetLiver 


OAQQ 

JUoo 




1079 018 rr^43 947669 


F 

A 


M00054750CD01 


UC2-ColonMetLiver 




551 778 
331/ fo 


1Q77 Ol Q rr-7'43 947670 
17 /Z.'<Jl7.gZ A t-?_Z A f- /O /U 


F 


M00054741DC05 


UC2-ColoiiMetLiver 




555 ATT 
DDDU/ / 


1Q77 TIO rr«y43 947677 
1 7 /Z.ll7.gZ*t j__Z*f /O /Z 


r 


M0005474^DF04 


IJC2-ColonMetLiver 




557871 
DD /o / 1 


1 Q77 IT 1 Q rr-743 947674 
17 /Z.lS-X7.gZ*fD Z*+ / D / *+ 


F 
r 


M00054745CH02 


UC2-ColoiiMetLiver 

V—j'^-^ V/\/X V/1XX. TAW bW. » W^* 


0U7Z 


5 £09 89 
DOUZoZ 


1079 P91 947608 

17/ Z.'C/Zl.g^'TJ ^»*t / U70 


F 

X 


M00054738BE12 


UC2-ColonMetLiver 




5fY3457 
DUD*fDZ 


1079 T 99 a743 94779^ 

17/ Z.JuZZ.gZ'T^ ^»*T / / 


F 

X 


M00054746DE05 


UC2-ColonMetLiver 


•2AO/1 


555773 
DDD / ID 


1 079 H94 0^43 947747 

17 / Z.l/Z t T.gZ t TJ / /*t/ 


F 

X 


M00054739AF07 


UC2^oloiiMetLiver 


OAQC 

DVzfD 


4743 no 


1079 M94 0-74*3 947756 

17 /Z.lvl^'r.gZ.T'J ^.*r / / JU 


F 

X 


M00054747DB05 


UC2-ColoiiMetLiver 




55^01 O 


10R1 T01 or43 947760 

170 1 JUl .gZ*T*> Z.'T / »VJ7 


F 


M00054764DF01 


UC2-Ck)loiiMetLiver 


3U7 / 


477 1 OA 
*r /ZX7O 


1081 T09 ot4^ 947784 

l70l.l\7Z.#4Z*T-? _^»" ' / O^T 


F 

X 


M00054763A*A10 


UC2-ColonMetLiver 




5 1 3A37 


1Q81 r.03 *vy43 94770-5 
1 70 1 ,I/Uj ,gZ*tD z*t / / 7J 


F 

r 


M00054755DE05 


UC2-ColoiiMetLiver 




553 1 58 
DD j IDo 


1081 T703 rrrr43 947707 
IVol.r UD .gZ z tD_ZH / / 7 / 


F 


M00054750AB08 


i UC2-ColoiiMetLiver 

V^V/XvXXXTXVH /* ▼ 


3 1UU 


4Q8££7 
4tOOOZ 


1 08 1 T A3 rr-743 947803 
1 70 1 ,gZ*f D Z*r / OUJ 


X 


M00054766P-B08 


UC2^oloiiMetLiver 

v/Zv v/\yxvxxx f xw ihWi * 




551 £03 


1 O8 1 r*A4 rr r^43 9478 1 n 


F 


M000^4754HF1 1 


UC2-ColonMetLiver 


3102 


554021 


1981 J04.gz43_2478 16 


F 


M00054763C:D07 


UC2-ColonMetLiver 


3103 


550402 


198 1 .004.gz43_247822 


F 


M00054769B:D12 


UC2-ColoiiMefLiver 


3104 


448510 


1981.P04.gz43_247823 


F 


M00054770B:D09 


UC2-ColoiiMetLiver 


3105 


558900 


1981.105.gz43_247832 


F 


M00054763C:F10 


UC2-ColoiiMetLiver 


3106 


554294 


1981.K05.gz43_247834 


F 


M00054765D:D05 


; UC2-Q)loiiMetLiver , 


3107 


450840 


1981.106.gz43_247848 


F 


M00054763C:H04 


UC2-ColoiiMetLiver 


3108 


561487 


1981.J07.gz43_247865 


F 


M00054765B:C03 


UC2-ColoiiMetLiver 
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1NSDOCI0: <WO_0214S00A2_l_> 



WO 02/14500 



PCT/US01/25840 



Table2 



SEQ 
ID 
NO 


CLUSTEB 


SEQ NAME 


ORIE 
NT 


CLONE ID 


L_ LIBRARY 


3109 


552541 


1981 L07 sz43 247867 




M00054766CE01 


\J V^^~\^IJX\JXJUVxCLx-#l VOI 


3110 


528775 


1981 P07 ez43 247871 


F 


M00054770CA04 


TIC^-CnloTiiVrptT iver 

V7 Vs^r VvV7AV7iXlVXt'LX-fJ. V Vvl 


3111 


553918 


1981B08ez43 247873 


F 


M00054753C H02 


TTP9-Pn1riTiiV/fi*|-T ivf*r 

V7 Vx» Vw^ VJ JV7iXi VXC LXjI V C*l 


3112 


477046 


1981 B09 ez43 247889 


F 

X 


lVX\/vf\/»7*T / JJx/.rxUJ 


vJ v^Z, V^lJlVJxxlVXC IXjI vox 


3113 


551289 


1981 D09 ez43 247891 


F 

X 


M00054756A-C ] 2 


TTP9-Pn.nniV/I>tT twr 

U V»y Lll V/lxiVIC L1-/1 V CI 


3114 


263800 


1981 G09 ez43 247894 


F 

X 


M00054760DB0^ 


U V,/ X. V_/ U xUlXxVAC LX-/X V CI 


3115 


495942 


1981 A10 ez43 247904 


F 


M00054752RH06 

iVXV/V/V/-/T f -J XJ . X X V7 \J 


T TP9-r , n1r»nA/fptT ivpr 

V7 Vw/^» VA^lUUlVICIXxl VC1 


3116 


556183 


1981 H10 az43 247911 

JL ^ JL- » JL JL JL V/ mC^trnt* 1 — ^ T / ^ J. JL 


F 


M00054762AD09 


\J\**J^ VvUlvUIVXwlXvl Vvl 


3117 


448357 


1981N10sz43 247917 


F 


M00054768D-A01 


T TP? -Colon IVfptT .ivpr 

V/ V> ^ V^V/lV/XXi VXvUjJ V wl 


3118 


554161 


1981 P10 ez43 247919 


F 

X 


M00054770C'C04 


TIP?-T , o1onTV/fptT iver 

V7 V.^ ^# V71 V7XX1 VX V UL/X V CI 


3119 


256179 


1981 Bll ez43 247921 


F 


M00054753DC 1 2 


V7 V^-^r V^ V71 V/xXx VX Ks LX-/1 VCI 


3120 


555478 


1981Pllez43 247935 


F 


M00054770CD05 


UC2-ColonMetT,iver 

V/ V/Z/ V> v/1 SJXXl VX & Lvl /I V V/A 


3121 


554581 


1981 P 12 ez43 247951 


F 


M00054770CF 1 0 


TTP9-T 1 o1on*MfptT Jvpr 

V7 V^V/lV/XXlYXv UL/X V CI 


3122 


553922 

w mJ -S .4— r 


1981 B13 sz43 247953 


F 


M00054753DH1 0 

lVXV/V7V/w'~ # JJJL/.lXxl/ 


T Tr , 9-r!nlnnM'ptT i*vf*r 

C v^X< V^fCXV7XXiVXCLX-/l VCI 


3123 


554336 


1981 E13 ez43 247956 


F 

X 


M00054758AF03 

1VXV/V/V/»7*T / JOxl.l V/«7 


TTP?-PolonA/fpfT ivpr 

V/ V/Z. V-sV/XV/XXlYXV/lX-sX V CI 


3124 


560529 

\J \J +-s *-d 


1981 F13 ez43 247957 


F 


M00054759CG1 0 

XVX V7 \J \J -J ~ # J7V/.VJ XV/ 


T Tr , ?-Po1onM« : *tT .ivpr 

w J— Vw> V7XV7XX1 VX V 1 1 '* V CI 


3125 


558900 


1981 H14 ez43 247975 


F 


M00054762BF02 


UC2-ColonMetLiver 

W V/ X# V> V7X VJXXJL rlv LJ *X V vl 


3126 


447667 


1981 M14 ez43 247980 


F 


M00054767C D03 


V/ V^- V«/V/AwXXXVX&ULwX V vl 


3127 


551288 


1981 D15 ez43 247987 


F 


M00054756CC08 


UC2 -ColonMetLi ver 

\J V—' V7X V7XX1 VXvULu V Vl 


3128 


122169 


1981 E15 ez43 247988 


F 


M00054758BC 1 0 

XVX \J \J \J %J r I UXJ . X V/ 


T JC 2-ColonMetLiver 

V/ V_y X* V> V/X WJLXl VX^LX- /I V Wl 


3129 


551617 


1981 H15 ez43 247991 


F 


M00054762B *F07 

ivxv/v/v/*/"T / \J i.j.x v/ / 


T JP?-r*o1oniVTetT .iver 

\J V^X*^V— /V/XWXXiVXwtX-^X V vi 


3130 


260558 


1981 L15 ffz43 247995 


F 


M00054766DH02 


T TP?-PoIon IVrpf T .iver 

W V_/ \w V/XV7X XL VX V L 1 /X V Wl 


3131 


558609 


1981 B16 ez43 248001 


F 

X 


M00054754BF04 


T TP 9 -Pnl nnlVfptT jver 

VJ V_^Z» V^V7XV7xXxVXt/vX_/X V Wl 


3132 


554352 


1981 E16 2z43 248004 


F 

X 


M0005475 8B D03 

AVXV7VyV7*7*T / JOJJ.X7VJJ 


TTP9-PolonlVfpfT ivpr 

V/ CZr VA/IUXllVXwllJiVWl 


3133 


556064 


1981 F17 ffz43 248021 

1701,1 X / .^£tTmJ ZitOvZd 


F 

X 


IVXV/V/V/fc/^T / ~f Zs X-J .X-i\J x 


TTP9-Pn1on*\/fptT iver 

KJ V> Z. V^CIL/IIIVIC LX-/1 V Wl 


3134 


508126 


1981 K17 ez43 248026 


F 

X 


M00054766AH1 0 


T TP9 -Polon IVf etT Jver 

V7 VwV/lW^XXXVXWIJUfX V Wl 


3135 


549576 


1981 Ml 7 2z43 248028 


F 


M00054767CH06 


T JP2-.Po1on TVf etT J ver 

W V/^ WvivXUVlv H a V WX 


3136 


554000 


1981 A18 ffz43 248032 


F 

X 


M00054753A-A0S 

xVXWV/V/— » r / J J/V.xVV/J 


T TP9 -Polon IVf pfT .ivpr 

V7 V> JL, V7XUXX1VXC LX_j1 V Wl 


3137 


554049 


1981 E18 az43 248036 


F 

X 


M0005475 8B *H0^ 

1VXV/V/V/-/T / JOU.XlUJ 


T TP9-Po1onlVrptT .iver 

V/ V^X> WCXCXXLVXCIJU/I V Wl 


3138 


558503 


1981 HI 8 ffz43 248039 


F 

X 


M00054762C* A 1 2 


r TP9 -PolonMpfT -i ver 

w V-/ V71V7XXXVXW LXvX V WX 


3139 


451993 


1981 D19 ez43 248051 


F 

X 


M00054756DF05 

1VXV/V7 / •/ VIX^.X V/*7 


T TP? -Pol on IVf ptT .iver 

V/\_/^> V^V/IV/XXXVXWLX^XV Wl 


3140 


558949 


1981019ffz43 248062 


1 F 


M00054770AC06 

J.VXV/V/V/— 'T / / V/i\.Vvv 


T TP? -Poton lvf etT .iver 

V/ WZi \wV/lV/XXIVXW UU1 V Wl 


3141 

«/ X~ X 


553131 

*J *J *J XmS X 


1981 J20 m43 248073 


F 


M0005476Sr*F 1 0 

1VXV/V/V*7~ # VJ«7V>.X XV/ 


TTP?-Po1onlVfptT iver 

w C^i UlvlUVlw LX-f 1 V WX 


3142 


555343 


1981 D21 ffz43 248083 


F 


M00054757A-H07 


T TP? -Pol on MptT jver 

w V> J- Wvl WXXXVXW UX-f 1 V Wl 


3143 


512432 


1981 N21 ez43 248093 


F 

X 


M0005476Q A -frl 0 

xVXx/V/v/— '"r / V/~^x. VJ X V/ 


T TP?-Po1onA/fptT .iver 

KJ V^Z* V_^V/lWXXiVXWUX_/X V Wl 


3144 


550580 


198 1 .B22.gz43_248097 


F 


M00054754C:H09 


UC2-ColoixMetLiver 


3145 


551117 


1981.F22.gz43_248101 


F 


M00054760A:A12 


UC2-ColonMetLiver 


3146 


556475 


1981.L22.gz43_248107 


F 


M00054767A:F08 


UC2-ColoixMetLiver 


3147 


552549 


1981 .M22.gz43_248 1 08 


F 


M00054767D:G09 


UC2-ColonMetLiver 


3148 


470351 


1981.P23.gz43_248127 


F 


M00054771A:E01 


UC2-ColoixMetLiver 


3149 


561535 


1981.F24.gz43_248133 


F 


M00054760A:D10 


UC2-ColoiiMetLiver 


3150 


1093 


1981.G24.gz43_248134 


F 


M00054761C:E02 


UC2<:oloiiMetLiver 
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3NSDOCID: <WO 0214500A2J_> 



WO 02/14500 



PCT/US01/25840 



Table 2 



SEQ 
ID 
NO 


CLUSTER 


SEQ NAME 


ORIE 
NT 


CLONE ID 


LIBRARY 


D X-J X 


2284 

LH3T 


1981 H24 ez43 248135 


F 


M00054762D:C06 


UC2-ColonMetLiver 


31S9 

J X JL 


5S5958 

JJJ7J O 


1982F01ez43 248149 


F 


M00054778A:D01 


UC2-ColoiiMetLiver 


31 S3 


492779 


1982 A02 ez43 248160 


F 


M00054771B.F12 


UC2-ColoiiMetLiver 


31S4 


SS3848 


1982P02ez43 248175 


F 


M00054806A:G04 


UC2-ColoiiMetLiver 


31SS 


SSS686 


1982 C03 <*z43 248178 


F 


M00054774B:A07 


UC2-ColonMetLiver 


31 S6 


SSSS24 


1982H03&z43 248183 


F 


M00054780B.B06 


UC2-ColonMetLiver 


31S7 


SS1068 

_J«7 J wuu 


1982 A04 sz43 248192 


F 


M00054771C:A11 


UC2-C^lonMetLiver 


31 S8 


SS9183 


1982 004 ez43 248206 


F 


M00054805A:H04 


UC2-ColonMetLiver 


3 ISO 


SSI 003 


1982 F05 ez43 248213 


F 


M00054778A:F08 


UC2-ColoiiMetLiver 


3160 


46S907 


1982G05ez43 948214 


F 


M00054779B.A07 


UC2-ColoiiMetLiver 


3161 


S33901 


1982 H05 ez43 248215 


F 


M00054780B:E03 


UC2-(^lonMetLiver 


3169 


SS3868 


1989 fCOS 0743 248218 


F 


M00054786DF08 


UC2-ColonMetLiver 


3163 


SS3774 


1982BO6 0z43 248225 

X 7Q^il/vv»ftAiTJ L~o^x^-y 


F 


M00054772D *G1 2 


UC2-ColonMetLiver 


3164 


SS4079 


1 982 C06 0z43 248226 


F 


M00054774B*G08 


UC2-ColonMetLiver 


316S 


SS4898 

JJtOZ.0 


1 989 F06 &743 248228 


F 


M00054776C:G06 


UC2-ColonMetLiver 


3166 


SS0879 


1982G06&z43 248230 


F 


M00054779B:B07 


UG2-ColonMetLiver 


3167 


401 S70 


1 982 H06 &z43 24823 1 


F 


M00054780B.G10 


UC2^oloiiMetLiver 


3168 

J 1UO 


97S 


1Q89 M06 0743 248236 


F 


M00054802C : A07 


UC2-ColonMetLiver 


3160 


SS901 9 


1982C07ez43 248242 

170LiV/v' »ftf 


F 


M00054774C:A03 


UC2-ColoxxMetLiver 


3170 
.JX /u 


SS61 83 


1982G07ez43 248246 


F 


M00054779B:B11 


UC2-ColonMetLiver 


3171 


SS9S3S 


1 989 T07 f>743 248249 


F 


M00054784CD11 


UC2-ColonMetLiver 


3179 


SS171 8 


1 982 A08 ez43 248256 


F 


M00054771D:G01 


UC2^olonMetLiver 


3173 
D L I D 


SSI 1 60 


1 982 B08 &z43 248257 


1 F 


M00054773A:E09 


UC2-ColonMetLiver 


3174 


SS1080 


1982F08ffz43 248261 


F 


M00054778B.A02 


UC2-ColonMetLiver 


317S 


S17146 


1982 P08 ez43 248271 

j. _y u w » x v/u .ftxi~»/ Xrf^w*- * x 


F 


M00054806C:C12 


UC2-ColonMetLiver 


3176 


409 1 47 


1982 B09 ez43 248273 


F 


M00054773A:H05 


UC2^olonMetLiver 


3177 

Di.it 


SS7269 


1982 D09 ez43 248275 


F 


M00054776A.D10 


UC2-ColonMetLiver 


3178 


S303S3 

D D-s D ~J ^ 


1982N09&z43 248285 


F 


M00054804A:H04 


UC2-ColonMetLiver 


3170 


S60S90 


1989 f>10 0^43 248291 


F 


M00054776AF01 


UC2-ColonMetLiver 


31 an 


S7S094 


1 089 Tl 0 Q743 248296 


F 


M00054782D D 12 


UC2-ColoiiMetLiver 


J iol 


^S4S89 


1089 T10 0743 248297 


F 


M00054784D-B01 


UC2-ColonMetLiver 




SSS6Q6 

JJJ07U 


1089 M10 0743 248300 

X70Z.XVXX V.g^*T«7 itOJuu 


F 


M00054802D * A09 


UC2<)oloiiMetLiver 


31 83 


SS68S6 


1089 O10 0743 248302 


F 


M00054805B:E06 


UC2-ColonMetLiver 




447096 


1089 Till 0^43 948307 


F 


M00054776AF07 


UC2-CoIoiiMetLiver 


3 1 ft^ 

.5 JLoD 


D3j /HO 


1QS9T711 cr-743 948308 


F 

X 


MOOOS4776D*G09 


UC2-ColonMetLiver 


3186 


553979 


1982.Nll.gz43_248317 


F 


M00054804B:E07 


UC2-ColonMetLiver 


3187 


497912 


1982.OH.gz43 248318 


F 


M00054805B:E11 


UC2-CoIonMetLiver 


3188 


412621 


1982.D12.gz43 248323 


F 


M00054776A:G01 


UC2-ColonMetLiver 


3189 


497477 


1982.M12.gz43_248332 


F 


M00054802D:C02 


| UC2<:oloiiMetLiver 


3190 


552188 


1982.J13.gz43 248345 


F 


M00054785B:B07 


UC2-ColonMetLiver 


3191 


456052 


1982.M13.gz43 248348 


F 


M00054802D:C03 


UC2-CoIoixMetLiver 


3192 


143346 


1982.H14.gz43 248359 


F 


M00054780C:G11 


UC2-ColonMetLiver 
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INSDOCIO: <WO 0214500A2J_> 



WO 02/14500 
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Table 2 



SEQ 
ID 
NO 


CLUSTER 


SEQ NAME 


ORE 

NT 


CLONE ID 


LIBRARY 


3193 


553237 


1982 N14 ez43 248365 


F 


M00054804C-F04 


T TP?-Pn1nn"M>tT iT/f*r 

w V^^"V^UIUIllVxCLJL/l VCI 


3194 


556856 


1982.014.gz43 248366 


F 


M00054805BG02 


V^UIVJIxIVxCLXjIVCI 


3195 


586600 


1982.A15.gz43 248368 


F 


M00054772BB01 


TTC^-CnlnnlVfptT ivpr 

V/i' V^wllJxXI.VX&IJLfXVwl 


3196 


562263 


1982 J15 ez43 248377 


F 


M00054785CC02 


T Tp ? -P nl nn A/f f*tT ivf»r 
vj \~t£* v^vjiuxiivxcixji vci 


3197 


555462 


1982.C16.gz43 248386 


F 


M00054775AD07 


UC2-ColoiiMetT ivpr 

w Vvai \J l\J± XL VX W VI— iL V CI 


3198 


394168 


1982.016.gz43 248398 


F 


M00054805BG10 


TJP2-Cn1nnMptT ivpr 

V> V^X> V_/ VJ A UXXLVXw Lul V Vvl 


3199 


472672 


1982.A17.gz43_248400 


F 


M00054772B:D04 


UC2^oIonMetLiver 


3200 


595181 


1 982. C 17 .gz43_248402 


F 


M00054775A:G03 


UC2-ColoiiMetLiver 


3201 


554627 


1982.G17.gz43 248406 


F 


M00054779D:F07 


UC2-ColonMetLiver 

\-/ Vy ^ V> W 1 V_f XXI VX\a/ I.I /X Vvl 


3202 


552325 


1982.I17.gz43 248408 


F 


M00054783CC03 


UC2-ColonMetLiver 


3203 


462511 


1982J17.gz43 248409 


F 


M00054785CG11 




3204 


552813 


1982.K17.gz43_248410 


F 


M00054788C:G04 


UC2-ColonMetLiver 


3205 


175758 


1982.M17.gz43 248412 


F 


M00054803AD08 


UC2-Cn1onMetT .iver 

\J <t* vXWXXLtX w UX</1 V vl 


3206 


493085 


1982.A18.gz43 248416 


F 


M00054772BF03 


UC2-T , o1onMetT A ver 

W v- ' ^ V/1V/XX1VX v LX-/1 V vl 


3207 


557834 


1982B18az43 248417 


F 


M00054773CF09 


v* V^vlVXllVXv ULll V vl 


3208 


548998 


1982.F18.gz43 248421 


F 


M00054778CF09 


UC2-CoionMetI Aver 

KJy^^* vvlUUlYlvLLJl Vvl 


3209 


551425 


1982 G18 ez43 248422 


F 


M00054779DF08 

XTX \J\J \J*S^T i 1 S , A. \J VJ 


UC2-Co1onMetT ..ver 

w VxVlWlllTXvtX-/l V vl 


3210 


490414 


1982 K18 ez43 248426 


F 


M00054789AA02 


UC2-T t n1onivfptT iver 


3211 


551553 


1982 Ml 8 ez43 248428 


F 


M00054803A-E10 


T TP ? -Pnl nn lvTpfT ivpr 

\J V^VJ1VJ111VXvLX-/1 VC/1 


3212 


555571 


1982 N18 ez43 248429 


F 


M00054804D-D07 


W \w/ vJlUlxl Vx v Lx^l VCI 


3213 


448629 


1982 B19 ez43 248433 


F 


M00054773CF 1 0 


T TP? -Pn1nnA/i>tT twr 


3214 


552005 


1982 J19 ez43 248441 


F 


M00054785D • A07 


T TP?-Pn1on KTptT fi/f»r 


3215 


554116 


1982 M20 ez43 248460 


F 


M00054803BB 1 2 


TTP9-Pri1rm"M>tT iv<>r 

v» V^VJIUxlIVlClJ-rl VCI 


3216 


505792 


1982H21ez43 248471 


p 


M0005478 1 AH09 


T JP2-rVi1r>nMf»tT ivf»r 

uvA X^rV/lvXliVXCLxjlVCl 


3217 


561422 


1982 J21 ez43 248473 


F 


M00054786A-G1 1 


UC2-C , n1onMptT iver 

w V_^x* V-tiUJlll VI v LX-/1 V vl 


3218 


551755 


1982 M24 cz43 248524 


F 


M00054803 C GO 1 

IT JL V V/V */ ^ U w«/ V/.VJV/ X 


UC2-Co1onMetT^iver 

V* Vi ^UlUlllVlVULjl V vl 


3219 


550063 


1945 J03 ez43 248963 


F 


M000545 13 AB08 


UC2-ColonMetLiver 


3220 


388055 


1945 D04 ez43 248973 


F 


M00054506BA07 


I JC2-Co1ftnMetT Jver 

*J V-/^. CI1 vxXl VX v t.I /I Vvl 


3221 


572807 


1945 C05 ez43 248988 


F 


M00054505AG12 


T TP?-Pn1nnK4pfT ivf>r 

v» V^Zr V^UIUXXLVXCULf 1 V CI 


3222 


455814 


1945 J05 ez43 248995 


F 
i 


M000545 13 AF09 


\J V^VJlVJIIIVxClXwI VCI 


3223 


452471 


1945 H07 ez43 249025 


F 


M000545 10C-G07 


UC2-ColonMetT J iver 

\J V-/ \JX VJlXl V 1 v LX-fl V vl 


3224 


551518 


1945 J07 ez43 249027 


F 


M000545 13BE1 1 

IVXV/ V/vy»y "T«y X«JX*.JL/X X 


T TP 9 -Pnl nnA/T^tT iv^r 

V-/ V^X. wvxvTXIVXCIJLjI VCI 


3225 


555660 


1945 J09 ez43 249059 


F 
i 


M00054513C-A01 


\J V^VJlV/XXiVxCvXwl V CI 


3226 


524721 


1945 Bll ffz43 249083 


F 


M00054504BA04 


\J \_/^,~V>*Jl Will VXCLXjI VCI 


3227 


550986 


1945 Dll ez43 249085 


F 
i 


M00054506BH01 

1VX V/ v/vf ^"Tw' VyVJXJ .X XV/ X 


TTP9-Pn1nn'Mf*tT ixr#*r 

VJ X-xZ. *" v>vJ1vJII1V1C ULrl VCI 


3228 


499696 


1945.Gll.gz43_249088 


F 


M00054509C:G01 


UC2-ColonMetLiver 


3229 


554611 


1945.Nll.gz43_249095 


! F 


M00054519A:C04 


UC2-ColonMetLiver 


3230 


557681 


1945.F12.gz43_249103 


F 


M00054508B:E08 


UC2-ColonMetLiver 


3231 


550678 


1945J13.gz43_249123 


F 


M00054513D:F04 


UC2-ColonMetLiver 


3232 


554246 


1945.K13.gz43_249124 


F 


M00054515B:E11 


UC2-ColonMetLiver 


3233 


550129 


1945.F15.gz43_249151 


F 


M00054508C:B04 


UC2-ColonMetLiver 


3234 


555771 


1945.F17.gz43_249183 


F 


M00054508C:C08 


UC2-ColoiiMetLiver 
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3NSDOCID: <WO 0214S00A2J_> 



WO 02/14500 
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Table 2 



SEQ 
ID 
NO 


3LUSTEE 


SEQ NAME 


ORIE 
NT 


CLONE ID 


LIBRARY 


3235 


572992 


1945 018 ez43 249208 


F 


M00054521D:H01 


UC2-ColonMetLiver 


3236 


461313 


1945 J21 ez43 249251 


F 


M00054514C:B01 


UC2-ColonMetLiver 


3237 


554456 


1945.L21.gz43 249253 


F 


M00054516D.F09 


. UC2-ColonMetLiver 


3238 


547866 


1945 F23 ez43 249279 


F 


M00054509A:C01 


UC2-ColonMetLiver 


3239 


458974 


1945 P23 az43 249289 


F 


M00054523C:A11 


UC2-ColonMetLiver 


3240 


554839 


1945 C24 gz43 249292 


F 


M00054506A;D05 


UC2-ColonMetLiver 


3241 


487658 


1945.I24.gz43 249298 


F 


M00054512D:H08 


UC2-ColoiiMetLiver 


3242 


556940 


1946 HOI gz43 249313 


F 


M00054533C:B04 


UC2-ColonMetLiver 


3243 


493575 


1946.N03.gz43 249351 


F 


M00054541C:C10 


UC2-ColonMetLiver 


3244 


466020 

vy v 


1946.H05.gz43 249377 


F 


M00054533C.E07 


UC2-ColoixMetLiver 


3245 


552006 


1946.C06.gz43 249388 


F 


M00054527B:A07 


UC2-ColonMetLiver 


3246 


551157 


1946.E06.gz43 249390 


F 


M00054529C:D11 


UC2-ColoixMetLiver 


3247 


559574 


1946.G06.gz43 249392 


F 


M00054532D:A01 


UC2-ColonMetLiver 


3248 


556216 


1 946 .106 .gz43_249394 


F 


M00054534D:D02 


UC2-ColonMetLiver 


3249 


559676 


1 946 .B07 .gz43_249403 


F 


M00054525D:H05 


UC2-ColonMetLiver 


3250 


550018 


1946.J07.gz43 249411 


F 


M00054536B:A03 


UC2-ColoiiMetLiver 


3251 


526984 


1946.G08.gz43 249424 


F ; 


M00054532D:E07 


UC2-ColonMetLiver 


3252 


481594 


1946N09gz43 249447 


F 


M00054541C:F11 


UC2-ColonMetLiver 


3253 


550618 


1946P09ffz43 249449 


F 


M00054544B:E03 


UC2-ColonMetLiver 


3254 


454336 


1946 A10 ez43 249450 


F 


M00054523D:G09 


UC2-ColonMetLiver 






1946J10ez43 249459 


F 


M00054536B:D02 


UC2-ColonMetLiver 


3256 


464154 


1946 M10 ez43 249462 


F 


M00054540A:H07 


UC2-ColonMetLiver 


3257 


550475 


1946 Bll ez43 249467 


F 


M00054526A:E04 


UC2-ColonMetLiver 


3258 


584071 


1946Fllez43 249471 


F 


M00054531D:F05 


UC2-ColonMetLiver 


3259 


550730 


1946 Jll ez43 249475 


i F 


M00054536B:F08 


UC2-ColonMetLiver 


3260 


552638 


1946 012 ez43 249496 


F 


M00054543B:E06 


UC2-ColonMetLiver 


3261 


560014 


1946 L13 ez43 249509 


F 


M00054538D:H11 


UC2-ColonMetLiver 


3262 


553338 


1946 114 az43 249522 


F 


M00054535A:G03 


UC2-ColonMetLiver 


3763 


556809 


1946 B15 ez43 249531 


F 


M00054526B:C09 


UC2-ColoiiMetLiver 


3264 


498433 


1946 H15 ez43 249537 


F 


M00054534A:B06 


UC2-ColonMetLiver 




455821 


1946 K16 ez43 249556 


F 


M00054538A:E10 


UC2-ColonMetLiver 


3766 


82864 


1946J17ez43 249571 


F 


M00054536C:D12 


UC2-ColonMetLiver 


3267 


560519 


1946 LI 7 ez43 249573 


F 


M00054539B:D06 


UC2-ColonMetLiver 


3268 


409262 


1946N17ez43 249575 


F 


M00054542B:B01 


UC2-ColonMetLiver 


3269 


560717 

•J \J\J / X / 


1946 P17 sz43 249577 


F 


M00054544C:F04 


UC2-ColonMetLiver 


3270 


550172 


1946.K18.gz43_249588 


F 


M00054538B:B05 


UC2-ColonMetLiver 


3271 


552753 


1946.018.gz43_249592 


F 


M00054543C:F01 


UC2-(^lonMetLiver 


3272 


549734 


1946.L19.gz43_249605 


F 


M00054539B:G03 


UC2-ColonMetLiver 


3273 


550714 


1946.G20.gz43_249616 


F 


M00054533B:F04 


UC2-ColonMetLiver 


3274 


585380 


1946.D21.gz43_249629 


F 


M00054529A:H01 


UC2-ColonMetLiver 


3275 


550204 


1946.F21.gz43_249631 


F 


M00054532B:C05 


UC2<:olonMetLiver 


3276 


560205 


1946.I22.gz43 249650 


F 


M00054535B:H08 


UC2-ColonMetLiver 
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1NSDOCIO: <WO_0214500A2_I_> 
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Table 2 



SEQ 
ID 
NO 


CLUSTER 


SEQ NAME 


ORIE 
NT 


CLONE H) 


LIBRARY 


3277 


312036 


1946.E23.gz43 249662 


F 


M00O5453ODC1O 


TTP9-r , r»1mi"M'f i tT iv£»r 

UVyX» VyvJlvJlXLVXCtX-/! VC1 


3278 


553002 


1946E24ez43 249678 


F 


M00054530DH07 


T TP?-Pn1nnA/fpfT iwr 

Vj VyZ. V_y\JlvJlJLLVJ.CLX-»l VCI 


3279 


394567 


1948 O01 ez43 249704 


F 


M00054583D-E04 


Vy \y j£f V^L/1 v/111 VAC LX-fl V CI 


3280 


468613 


1948 C02 az43 249708 


F 


M00054569DE01 


T TP? -Cnl nn A/TpfT ivpr 

Vy V^^> V-HJlvJlXLVXCLX-/ X V CI 


3281 


378610 

m* i v vy X \J 


1948 102 sz43 249714 


F 

X 


M00054576D • T07 


U \y^" V^UlvJlLLVxCLJyl VCx 


3282 


559752 


1948 M02 ez43 249718 


F 


M0005458 1 E* A0 1 

1V1UUWJTJ O X XJ .yvVy x 


%J VyZ. V^UlL/lliVXCti—fl V CI 


3283 


550804 


1948 K03 ez43 249732 


F 


M00054579AG1 0 


T TP? -CnlrmlVFptT ivf^r 

Vy Vy^ Vy V/XVrXXLVXClX-f X V CI 


3284 


549956 


1948 O03 ez43 249736 


F 


M000545 84 A- A07 


TTP9-Pn1nnA/lf*t-T ivf*r 

Vy v>y_» Vy vJIvJJXiVXCLI—il VCI 


3285 


556065 


1948 A04 2z43 249738 


F 


M00054567CB03 

i.vxv/ vy vy^"T»y vi # Vy 


Vy \yi Vy vJivJlliVXC U-yl V CI 


3286 


550397 


1948M04sz43 249750 

X y^O.XVXVJ"T..£ j X/"Ty Z»*Ty / — 'VJ 


F 

X 


M000S4^ 8 1 R*D0^ 

1V1V/WUJTJ O lU.L/vJ 


VJ Vy " Vy UIL/IIiVIC LJ-yl V CI 


3287 


550874 


1948 M05 ez43 249766 


F 


M00054581 RG1 0 

J. » x vy V/ \y y T y vJ XXJ . VJ X Vy 


T TP9_Pn1nnA/TptT iv^r 

\J Vwa» VyVylVfXXLVXCLlyl VCI 


3288 


533812 


1948 O05 ez43 249768 

x r vj . v/u^' .fjf i~y r y / vy 


F 


M00054584AB03 

ItIv \y vy y ~y UT/v. JJ Vy y 


TTP9-Po1ntiMptT ivf»r 

V Vy^ V^VylvJXXiVXOlJyX V CI 


3289 


549597 


1948 C08 ez43 249804 


F 


M00054570A-F02 


T TP? -Pnl nnA/f ptT ivpr 

VJVy^ V_AJ±v/xx±VIC LJyl V CI 


3290 


553749 


1948 K08 2z43 249812 


F 


M00054579RD1 0 

1VJLvV/V/J*TJ / yjy.xy XVy 


TTP?-P<i1onMptT iver 

UV/Z. V^vJlvJXXLVXCLXyl VCI 


3291 


446814 


1948G10ez43 249840 

X y"TVy. VJ lU.gLiTJ Z^^T Z* ty~ Vy 


F 


M00054574DF1 1 


T TP9-Pn1nn\f p*T tvf*r 

Vy VyZ» V^vJlvJlLlVICLXyl VCI 


3292 


555000 


1948 M10 ez43 249846 

A y 7 i U . i.YX X Vy . J^iLjTy ^» i y o~ w 


F 


M00054581DC12 

xYXVyV/V/y T«y O 11/, Vy X^> 


TTP?-Pn1nnM>fT ivf»r 

VJ Vy V^VJlVJJLLLVXCLIyl V CI 


3293 


554887 


1948 O10 ez43 249848 


F 


M00054584BA03 

XTXvy \y v/y » y vj~xj . urxvyy 


T TP2-Cn1fmMefLi ver 

Vy V^ y- v_* uiuixi vxc U-y J. v Vyl 


3294 


561876 


1 948 El 1 ffz43 249854 


F 

X 


M000^4S7?r*F09 


T TP 9 _Pr»1 on TVyTRtT iv<»r 

VJ v>Z> Vy VJlVJlllVXC LJyl VCI 


3295 


556488 


1948 Ml 1 ez43 249862 


F 


M00O5458 ID DO 1 

iTiv vv>/ w ljy . iy vy x 


T TP?-Pfi1onTVTetT ,ivpr 

VJ Vy Vy VJ1 VJXXLV XC 1 .1 ll V CI 


3296 


560080 

*J KJ\J\JV3 Vy 


1948 G12 cz43 249872 


F 


M00054575AB06 

J.VX V/ V/ Vy y i y / y A . Xy Vy W 


TTP?~Pn1ntiMpfT twr 

Vy Vy ^> Vy VJlUiXlVXCLXyl VCI 


3297 


556336 


1948 Ml 2 ez43 249878 


F 1 


M00054581DE04 


T TP?-Pn1nnTVfptT iver 

VJ VyX. VyVJlVJJLXXVXCLXyl VCI 


3298 


558729 


1948 113 ez43 249890 


F 


M00054577BA09 

xvxvy vy vy y ~y / t xj .Auy 


T TP?-Pn1nnMpfT ,ivpr 

VJ VyXrf Vy VJlVJxLLVXCUly 1 V CI 


3299 


549716 


1948 A14 ez43 249898 

x y i o .A x~»^j«r<"»y yoyo 


F 


M00054568AG02 


T TP?-Pn1nnMptT Jvpt 

VJ Vy^r VyVJlvJXXLVXCLXyX VCI 


3300 


568467 


1948 M14 ez43 249910 

iy~U)iTxi~.gtiTj ry y x \y 


F 


M000545 82 A- AOS 


T TP2-Pn1riTiMel-T >iver 

VJ VyZ> Vy VJ 1 VJxXJL VXC LXy X V CI 


3301 


550694 

-J Vy V/ y™ 


1948 N14 ffz43 24991 1 


F 

X 


M00054583AF05 


T Tp 9 _Pn1 nnA/fptT ivpt 

VJ VyZr ~ VyVjlVjlIiVlCULyl VCI 


3302 


549994 


1948 M15 sz43 249926 


F 


M000545 82 A * A07 


TTP?-Pf>1finMf*fT ivpr 

VyVyZ- VyvJIUXXiVXCLX-/! VCI 


3303 


549858 


1948 A16 ez43 249930 


F 


M00054568AH03 


TTP9-Pn1onlVjfptT iwr 

VJ Vy ^— Vy VJ1VJ XXI V XC 1 I. il VCX 


3304 


554793 


1 948 A 1 7 ©*t43 24QQ46 


V 

X 


MO0OS4^68B- A07 

XVXwVJWy "y VJOXJ .yriLV/ / 


T TP? -Pnl nnlVlptT ivpr 

VJ Vy ^i. Vy VJ1 vJXXLVICLXyX V CI 


3305 


466020 


1948 017 d-743 949960 


T7 

X 


M000S4S84DP01 


T TP?-Pn1nnTv/TF k tT ivpr 

VJ Vy Zr "* Vy vJlVJlilVlC ULyl VCI 


3306 


509202 


1948 T1Q crz4^ 949Q86 


X 


M000S4S77BF0 1 

IVXVy VJVyy T^y / / XJ.x VJ 1 


T TP? -Pr*1 rinlxjTetT ivpr 

Vy 1 VyZ» VyVjlvJXXiVICLXyl VCI 


3307 


450963 


1948A21az43 250010 


F 


M00054^68r-D06 


TTP2-ColnnMetT iver 

VJ Vy Z»^y VJXVJIXLVXCLXyl V CX 


3308 


451049 


1948 P91 w43 2^0019 


F 


M000S4S7 1 A B 1 0 

1VXW v/vjy *ry / irv.xj x VJ 


TTP?-Pn1nnMpfT ivpr 

VJ Vy VyvJlVJXXLVXCLX— rX V CX 


3309 


528404 

y X» U r VJ" 


1948 K21 az43 250020 


X 


MO0OS458OA-B1 1 


T TP?-Pn1onMetT jver 

VJ Vy^ Vy VJXVJJJJ- VXC LXw X V CI 


3310 


160320 


1948 091 0743 9 S 0094 


F 

X 


M000S4S8SA-F07 


TTP?-Pn1onMptT ivpr 

VJ Vy A* Vy VJJ.vJXXLTXCLXyX V CX 


3311 


492139 

i-^^ X y y 


1948 K22 0z43 250036 


F 

X 


M00054580A-C1 0 

xvxvy VJVyy *ry O V/yTV. Vy X Vy 


TTP?-Po1oniVfptT -ivpr 

VJ Vy^ Vy WlVJAAJLVXt/LXyl V CX 


3312 


557656 


1948.A23.gz43_250042 


F 


M00054568C:G12 


UC2-ColonMetLiver 


3313 


570248 


1948.K24.gz43_250068 


F 


M00054580A:D04 


UC2-ColonMetLiver 


3314 


561836 


1957.G01.gz43_250080 


F 


M00054594A:D04 


UC2-ColoiiMctLiver 


3315 


558332 


1957J01.gz43_250083 


F 


M00054597A:C11 


UC2-ColonMetLiver 


3316 


558981 


1957.J02.gz43_250099 


F 


M00054597A:D08 


UC2-(yOlonMctX,iver 


3317 


535129 


1957.B03.gz43_250107 


F 


M00054587C:G02 


UC2-ColonMetLiver 


3318 


549781 


1957J03.gz43_250115 


F 


M00054597A:G08 


UC2-ColonMetLiver 
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3NSDOCID: <WO 021 4S00A2_I_> 



WO 02/14500 
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Table 2 



SEQ 
ID 
NO 


CLUSTER 


SEQ NAME 


ORIE 
NT 


CLONE ID 


LIBRARY 


3319 


555920 


1957.C04.gz43 250124 


F 


M00054589A:D09 


UC2-ColonMetLiver 


3320 


550044 

«_/ «y vy vy i r 


1957 E04 ez43 250126 


F 


M00054591C:A01 


UC2-ColonMetLiver 


3321 


447356 


1 957.104 .gz43J250 1 3 0 


F 


M00054596B:C01 


UC2-ColoixMetLiver 


3322 


560751 


1957 A05 ez43 250138 


F 


M00054586C:H02 


UC2-ColonMetLiver 


3323 


550322 


1957 B05 ez43 250139 


F 


M00054587D:D04 


UC2-ColoiiMetLiver 


3324 


416884 


1957N05sz43 250151 


F 


M00054602A:E06 


UC2-ColonMetLiver 


3325 


559857 


1957.A06.gz43 250154 


F 


M00054586D:A03 


UC2-ColoiiMetLiver 


3326 


446164 


1957 C06 ez43 250156 


F 


M00054589B:A07 


UC2-ColonMetLiver 


3327 
~j ^ i 


550088 


1957D06az43 250157 


F 


M00054590B:B11 


UC2-ColonMetLiver 


3328 


554764 


1957 E06 ez43 250158 

X s *J I .x—/ vy Vy . fj« it «y _ ^ 


F 


M00054591C:H09 


UC2-ColonMetLiver 


3329 


552907 


1957O07ez43 250184 


F 


M00054603A:G06 


UC2-ColonMetLiver 


3330 
~j -j ~j \j 


498827 


1957 D08 ez43 250189 


F 


M00054590C:A03 


UC2-ColonMetLiver 


3331 


498777 


1957 G08 ez43 250192 


F 


M00054594C:E06 


UC2-ColOxxMetLiver 


^39 


559883 


1957 L08 €tz43 250197 


F 


M00054599D:B07 


UC2-ColonMetLiver 


^^^^ 


554869 


1957 M08 ffz43 250198 


F 


M00054600D:H07 


UG2-ColonMetLiver 


^34 


SS9419 

JJ7T 17 


1957A09az43 250202 


F 


M00054586D:G01 


UC2~ColoiiMetLiver 




473859 

*T / JOJ7 


1957 F09 ffz43 250207 

lyj / .a i/y.fti i J ^i»»y vy^rvy / 


F 


M00054593B:B03 


UC2-ColonMetLiver 




SS5949 


1957 M09 pz43 250214 


F 


M0005460 1 A:E08 


UC2-ColonMetLiver 


^^7 


S5991'} 


1957O09ffz43 250216 


F 


M00054603A:G12 


UC2-ColonMetLiver 




55 1463 


1957 CIO ez43 250220 

X J7 »y / ,Vylv.g£iT^ z<yv^u 


F 


M00054589B:F05 


UC2^olonMetLiver 


^39 


550855 


1957 Bll ez43 250235 

X J7^y / . XJ X X .g4»Ti/_4»»/ \/x»-y*y 


F 


M00054588A:G08 


UC2-ColonMetLiver 


•J -7 "TV/ 


526733 


1957 Oil ez43 250248 


F 


M00054603B:B04 


UC2-ColonMetLiver 


3341 

•7-7"T X 


557970 


1957 F12 ez43 250255 

x-y^y / .x i£<igu . y «y 


F 


M00054593B:E03 


UC2-ColoiiMetLiver 


334? 


549665 


1957 H12 ez43 250257 


F 


M00054595C:G06 


UC2-ColonMetLiver 




446503 

■ f • i vj~y vy _y 


1957 M12 ez43 250262 


F 


M00054601B:D08 


UC2-ColonMetLiver 


3344 


549599 


1957P12ez43 250265 


F 


M00054604C:F03 


UC2-ColonMetLiver 


3345 
«y 


558413 


1957 A13 ez43 250266 


F 


M00054587A:A08 


UC2-ColonMetLiver 


^46 


553331 


1957 B13 £z43 250267 

x *y «y / . X-f X «y . ~ 


F 


M00054588B:B12 


UC2-ColonMetLiver 


3347 


549609 


1957 114 ffz43 250290 


F 


M00054596C:F09 


UC2-ColonMetLiver 


^348 


455998 


1957 115 £z43 250306 


F 


M00054596CF10 


UC2-ColoiiMetLiver 


^49 


563514 

JUJJ J. » 


1957P15ez43 250313 

X-y^y/.X X«y ,Q/ir-J^£~,^/\J~J X»y 


F 


M00054604D:A04 


UC2-ColonMetLiver 




471 181 


1957 A1 6^743 750314 


F 


M00054587AF06 


UC2-ColonMetLiver 


^51 

JJJ J. 


487176 


1957N16ez43 250327 


F 


M00054602CC12 


UC2-ColoiiMetLiver 




lift 


1 957 Tl 7 az43 250338 

17J / ,11 / .£jA<T-y Z.w> Vy J *y t> 


F 


M00054596CG09 


UC2-ColonMetLiver 




4.6171 8 

HOI / AO 


1957 T17 oz43 950^39 

X ✓J / . J X / ,gZi"T«y Z-JUJJ ,y 


F 


M00054597DE12 


UC2-ColonMetLiver 


3354 


559554 


1957.L17.gz43_250341 


F 


M00054600A:G12 


UC2-CoIonMetLiver 


3355 


452506 


1957.A18.gz43_250346 


F 


M00054587A:F09 


UC2-ColoixMetLiver 


3356 


551975 


1957.D18.gz43_250349 


F 


M00054590D:C12 


UC2-ColonMetLiver 


3357 


485029 


1957.H18.gz43_250353 


F 


M00054595D:A10 


UC2-ColonMetLiver 


3358 


549816 


1957.L18.gz43_250357 


F 


M00054600A:H03 


i UC2-ColonMetLiver 


3359 


408130 


1957.O18.gz43_250360 


F 


M00054603C:G09 


I UC2-ColoiiMetLiver 


3360 


558927 


1957.P18.gz43_250361 


F 


M00054605A:A03 


I UC2-ColoiiMetLiver 
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Table 2 



SEQ 
ID 
NO 


CLUSTEF 


SEQ NAME 


ORIE 
NT 


CLONE ID 


LIBRARY 


3361 


550217 


1957 B19 sz43 250363 


F 


M00054588C-F04 

lVlVVVJtJOOV/,1 \/*T 


w ~ V^UiUIxIVXCLXjX vex 


3362 


556376 


1957 C20 ez43 250380 


F 

1 


M00054589DP06 

IVXV/ Uv J *T J O 7XJ . VJ VJ 


TTr , ?-Pr*1nnA4'ptT ivpr 

lJ V^Z. ~V^IJ10JXLVXCULj1 VCI 


3363 


549964 


1957 E20 ez43 250382 


F 

X 


M00054592P ■ A05 

XVXV/V/V/ J*T J 7i\^ ,.iV\J J 


T TP? -Col nn TVfp+T ivpr 

Uv/Z, V^V^lUiliVXwLXvXVCX 


3364 


549228 


1957 K20 sz43 250388 


T7 

X 


M00054599BP06 

1V1vv/V/J*tJ77xJ.V/vv 


TTP?-Prk1nn"MV>tT i\;pr 

vJ V^Z~V^01UIliVXCUL/lVCl 


3365 


459581 


1957 L20 ez43 250389 


F 


M00054600CB 1 0 


T TP?-Po1fiTiTVlptl .ivpr 

UV/Z, V_/\JlVJxliYXC LX-/1 V d 


3366 


562722 


1957 N20 ez43 250391 


F 


M00054602D * A 06 


T IP?-Pr»1nn1VffttT ,i ver 

«J V^v/xV/XIXVXWIXjXVv71 


3367 


552121 


1957 020 ez43 250392 


X 


M00054603DB02 


vJ V^Z,~V^vJlL/xxiVXCLL*l VCI 


3368 


4473 1 1 


1957 C?3 sz43 250428 


F 


M00054590 A- AO 1 

XYXWV/ J"T J 7 UA.AU X 


T TP? -Pnlnn TVTp1T .iver 


3369 


467035 


1957 123 ez43 250434 


F 


M00054597AA07 


T JP2-Pr»1nnMefT .iver 

V— ' V/l ViXIXVX V/ I.I <X V vl 


3370 


549611 


1957 K23 ez43 250436 


F 


M00054599BF09 


UC2-Po1onMetLiver 


3371 


54973 1 


1957 H24 ez43 250449 


F 


M00054596 AG1 1 


UC2-Co1onMetL,iver 

W X# V-> V/lV/XlXVXw I t /X V vl 


3372 


380127 


1957 P24 ez43 250457 


F 


M00054605BF10 

lflVVVk/~vVJU,l X \J 


UC2-Po1nTiMetT>iver 


3373 


405042 


1958 H01 ez43 250465 




M00054614BE01 


T TP?-Pn1nnlVTptT iver 

UvZi V^UxlJ'XXLVXvLX-fX V vl 


3374 


552418 


1958 J01 ez43 250467 


F 

X 


M000546 1 6D-r 1 0 

lVlV/vv/ J*7U X UL/.V1V 


TTP?-Pn1nnMf*fl ivpr 

^J V^^Cr V^/ OlV-ZlXi VXt/ LX-rl V vl 


3375 

J J 1 J 


55864? 


1958 N01 <r741 ?S0471 


X 


M 000546? 1 0-^06 


TTP?-Pn1r«iMH'T iver 

vJ v^L/lvJXllVXClX-/l Vvl 


3376 


549649 


1958 A0?az43 250474 


F 

X 


M00054605BG0 1 


TTP?-Pn1r«iMptT .iwr 

vJ Vw^ ^ V-/ UxU'XLLtXv/ LX-fl Vvl 


3377 


557100 

J J / J. 37V/ 


1Q58 T30? a743 750475 


F 1 
x^ 


M00054606D-B05 

1VXV/VI \J J *T U V/ UXJ . X* V/ J 


TTP7-Pn1onT\/rptT ivpr 

U V^wlr v^(JlvXllVXCLi«rX VCI 


3378 

J J / o 


549370 


1Q58 TM0? dt41 750487 

X 7 J O . ±> V/X. .gitJ X* J UtO / 


X 


TVT000546?1DTI1 1 

lVXwWWj*T\JjC#XX-/.i-/X X 


TTP?_Pr»1nnTV/TRtT iver 

lj V/ Z/ vJlvJJxiVXC LX-/1 V vl 


J J I 7 




1QS8 T01 ot4^ ?S0400 

i7JO.JvJ.gitJ iJu i r77 


F 
r 


M000S461 6TYfrOQ 


T TP9-Pr»1nnlV/fp'tT iver 


3380 


561 1 16 


1958 M03 ff743 25050? 

X 7J O .Ivxv J .g^TJ ZrJ V/ J v/Z, 


F 

X 


M000546?0r>-n06 

1VIUvV/J*tU^V/X/.X/vU 


T TP?-Pn1r»nA/fetT iver 

KJ VZ> VJUJlllVXV/ LX-#1 V vl 


3381 

J JO X 


4731 1 1 


1958M04(*743 750518 

17JO .JLYXWr ,gZ.*T J Z.JV/J X O 


F 

X | 


M000546?ODT>1 1 

xVXwV7\» J*rLI^V/X-/.X-r X X 


T TP? -Pnlnn"M"etT iver 

\J V/ii. "V^lJUJxXLVXw tX^l VCX 


3382 

J J OjZ* 


558103 


1958 P04 az43 7^0571 

X 7 J O .X uT.gtjTJ AfJvJL X 


F 

X 


TVT000546? 5 A T)07 

1V1V7 V/Vf J*TViZr JX\.I/ V / 


T TP? -Pr»1onA/TpfT .iver 

VJ Vy^ V^tIJIvXXLtXvIXvI V vl 


^8^ 


548864 


1958 A05 ffz43 ?505?? 

17JO.AuJ.gtiTJ Z,JUJZ,Z. 


F 

X 


M00054605f t - A 04 

lVlUUUJtUvJV>.AUt 


T TP? -PnlnnTVf ptT iver 

VJ V_> ^ Wl will VX V LX-/1 V vl 


3384 

J J 0*T 


56086? 


1958R05ffz43 7505?^ 

17JO.JJvJ.gCTJ <6JUJZ.J 


F 

X 


M00054607AB06 


TTP?-Pn1nnlvfetT iver 

VJ V_^^ V> V»l*JlxiVXC/LX_/l V vl 


3385 

JJ O — J 


557199 

J w/ / A 7 7 


1958P05&743 2 50537 

17JO.rvJ.g£i > TJ -6JV/J J / 


F 

X 


M 000546? 5 A -F05 

1VX \J \J \J J *T vl J -ZTL . U/ V/ J 


T TP?-Po1nnTVfetT .iver 

VJ Vy^ V^*JlUlli.VXv LXjI V vl 


3386 


188753 

X OO / J J 


1958A06ffz43 250538 

17 J O .xlUU .gZ.*T J ^<JUJJU 


F 

X 


M00054605C-D03 


T TP?-Pn1nnTVletT .iver 

VJ v^ V-' vlV^xll VXt>lX-/l Vvl 


3187 


550177 


1Q58 G06 ^74^ ?50544 


F 

x^ 


TVT0005461 ^ ATKV7 

lVXWUV , J*rt/X Ji^V-.X-Zv / 


T TP?-Pr»1r»nA/fetT iver 

VJ V^WIVJIXIVXvLXjI V vl 


3388 

j j o o 


556896 

JJUU7U 


1958 M06 ffz43 250550 

J.7J0.1VlV/U,gii*TJ X» J V/ J J \J 


F 

X 


M00054620DF1 1 


T TP ? -Pn1 nnTVfetT .iver 

VJ V-/ U 1 L/1X1 VX V Uul Vvl 


3389 


552823 

J J O.^, J 


1958R07az43 250555 


1 F 

X 


M00054607A-rS02 


T TP2-Po1oTiMetT J iver 

VJ V./ ft \s VJXVJxU. VX V 1.1 /I V VI 


3390 

J J V/ 


395411 

•J / J" X X 


1958 M07 ez43 250566 

17JO. J.VXV/ / .gtiTJ Z.JUJUU 


F 

X 


M00054620DG1 1 


T IP2-Pn1on Met T ,iver 

VJ Vi» \~s WlVJxxl VX V LX_fX V VX 


3391 


10397 

X\/ J 7 / 


1958 N07 ffz43 250567 

17JO,l , iv# .gbTJ X-Jv/JV / 


F 

X 


M00054622AD06 


TJP2-Pn1nnMe*Liver 

VJ \^s^* VI vXXLVXV 1 1 / J. V VI 


3392 


551250 


1958P07&74^ 750569 


F 
i 


M00054675BB02 


T IP? -PnlnnA/letl .iver 

VJ V^< vl VIXLVXV l«Xwl V vl 


3393 


561892 


1958 C08 ez43 250572 

x yjo.wu,g£i*rj £>jvj / x. 


F 


M00054609AF0 1 

lflUV/\/J*tU\/7/x.l v/X 


UC2-ColonMetLiver 

VJ vx vxlxVX V UJl V VI 


3394 

J J 7" 


549347 


1958F08&743 750575 

X 7 JO .X UU.gZtTJ Z.JV/J / J 


F 

X 


M000546 1 ? A -D 1 2 

IVXVfV/V/ J*tU 1^.^1..J7 1 ^ 


T IP? -Polnnlvf etl .iver 

l/V<C» VxVlVUllVXVLxwlVvl 


3395 

JJ7J 


561068 

jVJ 1 VJVIO 


1Q58 N08 &743 75058^ 


F 


M000546?? A TT0 1 


TTP7-Pn1nriivfetT iver 


3396 


549912 


1958.C09.gz43 250588 


F 


M00054609A:H04 


UC2-ColonMetLiver 


3397 


549114 


1958.F10.gz43_250607 


F 


M00054612B:B11 


UC2-ColonMetLiver 


3398 


548965 


1958.J10.gz43_250611 


F 


M00054617B:A09 


UC2-ColonMetLiver 


3399 


491644 


1958.N10.gz43_250615 


F 


M00054622B:F05 


UC2-ColoiiMetLiver 


3400 


550018 


1958.G1 l.gz43_250624 


F 


M00054613B:H04 


UC2-ColonMetLiver 


3401 


556530 


1958.C12.gz43_250636 


F 


M00054609B:E01 


UC2-ColonMetLiver 


3402 


562749 


1958.L12.gz43_250645 


F 


M00054619D:C10 


UC2-ColotxMetLiver 
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Table 2 



SEQ 
ID 
NO 


CLUSTER 


SEQ NAME 


OWE 
NT 


CLONE ID 


LIBRARY 


3403 


556308 


1958.N12.gz43_250647 


F 


M00054622B:H09 


UC2-ColonMetLiver 


3404 


551305 


1958.O12.gz43_250648 


F 


M00054623D:C12 


UC2-ColonMetLiver 


3405 


507349 


1958.C13.gz43_250652 


F 


M00054609B:H11 


UC2-ColonMetLiver 


3406 


549626 


1958 E13 ez43 250654 


F 


M00054611B:F12 


UC2-ColonMetLiver 


3407 


549388 


1958.J13.gz43 250659 


F 


M00054617B:D06 


UC2-ColonMetLiver 


3408 


478511 


1958.E14.gz43_250670 


F 


M00054611B:G09 


UC2-ColonMetLiver 


3409 


555371 


1958.K14.gz43_250676 


F 


M00054618C:H02 


UC2-ColonMetLiver 


3410 


553758 


1958.115.gz43_250690 


F 


M00054616A:H01 


UC2-<^lonMetLiver 


3411 


553204 


1958.L15.gz43_250693 


F 


M00054620A:C09 


UC2-ColonMetLiver 


3412 


497477 


1958.H16.gz43_250705 


F 


M00054615B:E03 


UC2-ColonMetLiver 


3413 


481360 


1958.116.gz43_250706 


F 


M00054616A:H03 


UC2-ColonMetLiver 


3414 


552437 


1958.K16.gz43_250708 


F 


M00054618D.D04 


UC2-ColonMetLiver 


3415 


550164 


1958.O16.gz43_250712 


F 


M00054624A:B11 


UC2-ColonMetLiver 


3416 


555660 


1958.E18.gz43_250734 


F 


M00054611C:E01 


UC2-ColonMetLiver 


3417 


562932 


1 95 8 .N 1 8 .gz43_25 0743 


F 


M00054622D.C02 


UC2-ColonMetLiver 


3418 


490393 


1958.B19.gz43_250747 


F 


M00054608B:D08 


UC2-ColonMetLiver 


3419 


490890 


1 958 .K19.gz43_250756 


F 


M00054618D:E06 


UC2-ColonMetLiver 


3420 


451009 


1 95 8 .K20 .gz43_25 0772 


F 


M00054618D:E11 


UC2-ColonMetLiver 


3421 


552194 


1958.L20.gz43 250773 


F 


M00054620B.B10 


UC2-ColonMetLiver 


3422 


451025 


1958.M20.gz43_250774 


F 


M00054621C:G03 


UC2-ColonMetLiver 


3423 


551463 


1958.N20.gz43_250775 


F 


M00054622D:D10 


UC2^oloiiMetLiver 


3424 


553028 


1958.G21.gz43 250784 


F 


M00054613D:H09 


UC2-ColonMetLiver 


14? S 


485504 


1958N21ez43 250791 


F 


M00054622D:F05 


UC2-ColonMetLiver 


3426 


562712 


1958.P21.gz43 250793 


F 


M00054625D:F06 


UC2<:olonMetLiver 


3427 


490152 


1958 C22 gz43 250796 


F 


M00054609D:E12 


UC2-ColonMetLiver 




552364 


1958F22ez43 250799 


F 


M00054612D:B12 


UC2-ColonMetLiver 


3429 


448276 


1958 L22 cz43 250805 


F 


M00054620C:C08 


UC2-ColonMetLiver 






1958N22ez43 250807 


F 


M00054622D:G1 1 


UC2<:olonMetLiver 


3431 


461734 


1958 P22 az43 250809 


F 


M00054625D:H07 


UC2-ColonMetLiver 


3432 


557947 


1958 H23 ez43 250817 


F 


M00054615C:D09 


UC2^olonMetLiver 


3433 


560491 


1958 L23 ez43 250821 


F 


M00054620C:C12 


UC2-ColoiiMetLiver 


3434 


554810 


1958.O23.gz43_250824 


F 


M00054624D.B06 


UC2-ColonMetLiver 


3435 


549829 


1958 C24 ez43 250828 


F 


M00054609D:H06 


UC2-C^loiiMetLiver 




518049 


1958 M24 ez43 250838 


F 


M00054621D:A10 


UC2-ColonMetLiver 


1437 


484669 


1777O01ez43 251861 


F 


M00042831D:G06 


UC2-PrimColon 


3438 


537506 


1777.C03.gz43 251881 


F 


M00042811C:B06 


UC2-PrimColon 


3439 


545162 


1777.F03.gz43 251884 


F 


M00042816B:C08 


UC2-PrimColon 


3440 


456793 


1777.Cll.gz43_252009 


F 


M00042812B:C01 


UC2-PrimColon 


3441 


448340 


1777.Ell.gz43 252011 


F 


M00042815C:C02 


UC2-PrimColon 


3442 


448793 


1777.E13.gz43 252043 


F 


M00042815C:F10 


UC2-PrimColon 


3443 


449718 


1777.H14.gz43 252062 


F 


M00042819C.H02 


UC2-PrimColon 


3444 


735534 


1777.E17.gz43_252107 


F 


M00042816A:C09 


UC2-PrimColon 
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SEQ 
ID 
NO 


CLUSTER 


SEQ NAME 


OWE 
NT 


CLONE ID 


LIBRARY 


3445 


449258 


1777.K17.gz43J252113 


F 


M00042825AB05 


UC2-PrimColon 


3446 


485020 


1777.018.gz43 252133 


F 


M00042833CG05 


UC2-PrimColon 


3447 


649735 


1777.A23.gz43 252199 


F 


M00042809DC 12 


V** v^Z^ x 1 Xlxlv^V/xV/Ix 


3448 


498504 


1923.B01.gz43 252616 


F 


M00042352DF11 


UC2-ColonMetLiver 


3449 


424723 


1923.C01.gz43__252617 


F 


M00042355AH09 


UC2-ColonMetLiver 


3450 


451124 


1923 E01 ez43 252619 


F 


M00042444CE02 


UC2-ColonMetLiver 


3451 


553312 


1923 O01 gz43 252629 


F 


M000425 1 6DH08 


UC2--ColoiiMetLiver 


3452 


257547 


1923.D02.gz43 252634 


F 


M00042442BA07 

JLTJL\^ 1 ^ I ■ 1 ^«JL 3k \/ / 


UC2-ColonMetLiver 


3453 


492292 


1923 J02 gz43 252640 


F 


M00042455BG09 


UC2-ColoiiMetLiver 

V-/ VyviViUTlVVlJi V VI 


3454 


638444 


1923 O02 ez43 252645 


F 


M000425 16DH09 

1TXW V/ ■ X V/JL/ .X X\/ ^ 


UC2-ColonMetLiver 


3455 


451009 


1923 C03 ez43 252649 


F 


M00042355BB07 


UC2-ColonMetLiver 

V-/ VVlVfiUVlv l>X_i£ V VI 


3456 


451518 


1923 .103 .gz43 252655 


F 


M00042453CD12 

J. ▼ JL v V/ Vy • 9**S # JL^ JL 


UC2-ColonMetLiver 


3457 


450255 


1923 J03 ez43 252656 


F 


M00042455CD 1 1 

J.TX V X/ X</ a Xw^ • JL JL 


UC2-CoIonMetLiver 


3458 


556804 


1923 M03 ez43 252659 


F 


M00042460BD 10 


UC2-ColoiiMetLiver 


3459 


497400 


1923 C04 ez43 252665 


F 


M00042355BE10 


UC2-ColoiiMetLiver 


3460 


451618 


1923 H04 ez43 252670 


F 


M0004245 1DH10 


UC2^oloiiMetLiver 


3461 


490890 


1923 J04 sz43 252672 


F 


M00042455CE04 


UC2-ColonMetLiver 


3462 


492982 


1923 P04 ez43 252678 


F 


M000425 19DH07 


UC2-ColoiiMetLiver 


3463 


498777 


1923 A05 ez43 252679 


F 


M00042351CG01 

X T JL V/ X*f X/ ffMk/ JL X/ • X/ X 


UC2-ColoiiMetLiver 


3464 


493575 


1923 B05 ez43 252680 


F 


M00042353BA1 1 


UC2-ColonMetLiver 


3465 


556916 


1923.L05.gz43 252690 


F 


M00042459B:F03 


UC2-ColoiiMetLiver 

Xsh/ X— ^ x^ vy AxyjuLi, t jl w l« i /x ▼ xvx 


3466 


551755 


1923 N05 ez43 252692 


F 


M00042460DH06 

xt jl vy vy \/t^a*~w JL-' • jl jl v/ xy 


UC2-ColOxxMetLiver 


3467 


495074 


1923 O05 sz43 252693 


F 


M000425 17CB04 


UC2-CoIonMetLiver 

^^^^ WIVUiTlVliUl T VX 


3468 


487745 


1923 E06 ez43 252699 


F 


M00042445A:B04 


UC2-ColoiiMetLiver 


3469 


448663 


1923 F06 ez43 252700 


F 


M00042447C HI 0 


UC2-ColOxiMetLiver 

V>" ^ V/V1V1U TX VI. 1 1 ■ ▼ vx 


3470 


468729 


1923 L06 ez43 252706 


F 


M00042459BF12 


UC2-ColonMetLiver 


3471 


451470 


1923 C07 ffz43 252713 


F 


M00042355C-G09 


UC2-ColonMetLiver 


3472 


489426 


1923 K07 ez43 252721 


F 


M0004245 7D ■ D02 


UC2^oloiiMetLiver 


3473 


450302 


1923 F08 ez43 252732 


F 


M00042447D-E04 


UC2-ColonMetLiver 


3474 


559575 


1923 M08 ez43 252739 


F 


M00042460BG12 


UC2-ColoiiMetLiver 


3475 


496446 


1923 A09 ez43 252743 


F 


M0004235 1DD02 


UC2-ColonMetLiver 


3476 


492110 


1923 F09 ffz43 252748 


F 


M00042447D *G 1 0 


UC2 -ColonMetLiver 


3477 


495941 


1923 G09 ez43 252749 


F 


M00042450AD02 


UC2-ColoiiMetLiver 


3478 


446621 


1923 J09 ez43 252752 


F 


M00042455D *H08 

lllv v/ V/ ¥ ^y™^ «y X-' .X XV7 VJ 


UC2-ColonMetLiver 

\-* X_/ X» WW /III TXVliJWA T VX 


3479 


562115 


1923 N09 ez43 252756 


F 


M000425 16AA10 


UC2-ColoiiMetLiver 


3480 


494300 


1923.O09.gz43_252757 


F 


M00042517D:A12 


UC2-Q)l0xxMetLiver 


3481 


451429 


1923.A10.gz43_252759 


F 


M00042351D:F08 


UC2-ColOxxMetLiver 


3482 


486912 


1923.H10.gz43_252766 


F 


M00042452C:A09 


UC2-ColoiiMetLiver 


3483 


456492 


1923J10.gz43_252768 


F 


M00042456A:C08 


UC2-ColonMetLiver 


3484 


562060 


1923.N10.gz43_252772 


F 


M00042516A:C08 


UC2^olonMetLiver 


3485 


4244 


1923.P10.gz43_252774 


F 


M00042S20AF09 


UC2-ColonMvtLiver 


3486 


451429 


1923.All.gz43_252775 


F 


M00042351D:H05 


UC2-CctoxMelLiVvr 
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3NS0OCID: <WO 0214500A2_I_> 



WO 02/14500 



PCT/US01/25840 



Table 2 



SEQ 
ID 
NO 


3LUSTEB 


SEQ NAME 


ORJDE 
NT 


CLONE ID 


LIBRARY 


3487 


449974 


1923 Hll ez43 252782 


F 


M00042452C.C10 


UC2-ColonMetLiver 


JtOO 


59512 


1923 Kll ez43 252785 


F 


M00042458B:B08 


UC2-CoIonMetLiver 


3489 


550362 


1923 Mil gz43 252787 


F 


M00042460C.A02 


UC2-ColoiiMetLiver 


3490 


211273 


1923 Oil ez43 252789 


F 


M00042517D.G04 


UC2-ColonMetLiver 


3491 


450400 


1923 112 ez43 252799 


F 


M00042454A.F02 


UC2-ColonMetLiver 


3492 


562627 


1923.L12.gz43 252802 


F 


M00042459C:E06 


UC2-ColonMetLiver 


34.93 


494393 


1923 B13 ez43 252808 


F 


M00042353D.B08 


UC2-ColonMetLiver 


3494 


451049 


1 923 .F 1 3 .gz43_2528 1 2 


F 


M00042448B.C04 


UC2-ColonMetLiver 


3495 

»y*T^»y 


558729 


1923 K13 ez43 252817 


F 


M00042458B:G05 


UC2-ColonMetLiver 


3496 


526733 


1923 L13 az43 252818 


F 


M00042459C:G02 


UC2-ColoDMetLiver 


3497 


558559 


1923 N13 ez43 252820 

X.-'j^.-J .X i U.txjT^-/ r »« /x^*JX» V/ 


F 


M00042516B:E03 


UC2-CblonMetLiver 


3498 


451518 


1923 A14 ez43 252823 


F 


M00042352A:G05 


UC2-ColonMetLiver 


3499 


349 


1923 M14 sz43 252835 


F 


M00042460C:B02 


UC2-ColonMetLiver 


3 son 




1923 A15 ffz43 252839 


F 


M00042352A:G09 


UC2-ColoiiMetLiver 


3S01 


497101 


1923 D15ez43 252842 


F 


M00042443D:E01 


UC2-CyOlonMetLiver 


3509 


488680 

tOOUOu 


1923 H15 ez43 252846 


F 


M00042452D:C04 


UC2-ColoiiMetLiver 


3S03 


559936 


1923L15cz43 252850 


F 


M00042459D:B07 


UC2-ColonMetLiver 


3504 


450823 


1923 B17ez43 252872 


F 


M00042354B:A07 


UC2-ColonMetLiver 


3505 


446963 


1923D17ez43 252874 


F 


M00042444AC10 


UC2-ColonMetLiver 


3506 


492094 


1923 H17 ez43 252878 


F 


M00042452D:G06 


UC2-ColonMetLiver 


3^07 


559581 


1923K17ffz43 252881 


F 


M00042458C:E06 


UC2^oloiiMetLiver 


3508 


494362 


1923 G18 ez43 252893 


F 


M00042450C:B09 


UC2-0)lonMetLiver 


3509 


560420 


1923 L18 ez43 252898 


F 


M00042459D:E03 


UC2-ColonMetLiver 


3510 


569399 


1923 N18 ez43 252900 


F 


M00042516C:C04 


UC2-ColonMetLiver 


351 1 


487522 

*TO # ^ 


1923 018 ez43 252901 


F 


M00042518D:A08 


UC2-ColonMetLiver 


351? 


498194 


1923 A19 ez43 252903 


F 


M00042352B:F03 


UC2-(^lonMetLiver 


3513 

JJ X«y 


451126 


1923 B19 sz43 252904 


F 


M00042354C:F04 


UC2-ColonMetLiver 


3514 

«J»y XT 


450507 


1923 E19 ez43 252907 

x -y ^« «y . x^y x * (Z) 


F 


M00042446B:G02 


UC2-ColonMetLiver j 


351 5 

^y,J X-J 


450723 


1923 119 cz43 252911 


F 


M00042454D:H10 


UC2-ColonMetLiver 


3516 


492981 


1923 J 19 ez43 252912 


F 


M00042456C:H07 


UC2-ColonMetLiver 


3517 


451054 

*T«/ XV«/*T 


1923 F20 cz43 252924 

17Z>J.j- ^» vy . <~ ~y <b>y<wy4{<^ 


F 


M00042448C:C09 


UC2-ColonMetLiver 


351 8 

D-J XO 


453667 


1923 J20 ez43 252928 


F 


M00042456D:B06 


UC2-ColonMetLiver 


3519 


569588 


1993 K20 ez43 252929 


F 


M00042458D-E06 


UC2-ColonMetLiver 


3590 


597679 


1 923 L20 fiz43 252930 


F 


M00042459D:G04 


UC2-ColonMetLiver 


3^91 


494133 


1093 B22 ff743 252952 


F 


M00042355AA12 


UC2-ColonMetLiver 


3522 


492304 


1923.122.gz43_252959 


F 


M00042455A:G12 


UC2-ColoiiMetLiver 


3523 


100821 


1923 . J22.gz43_252960 


F 


M00042456D:D07 


UC2-ColoiiMetLiver 


3524 


562603 


1923.M22.gz43_252963 


F 


M00042460C:G08 


UC2-ColonMetLiver 


3525 


562275 


1923.N22.gz43 252964 


F 


M00042516D:B11 


UC2-ColonMetLiver ! 


3526 


451302 


1923.A23.gz43 252967 


F 


M00042352C:G01 


UC2-CoIoiiMetLiver 


3527 


451126 


1923.B23.gz43_252968 


F 


M00042355A:C03 


UC2-ColonMetLiver 


3528 


494130 


1923.C23.gz43 252969 


F 


M00042442A:A12 


UC2-ColonMetLiver 
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INSDOCIO: <WO 021 4500A2_I_> 



WO 02/14500 
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Table 2 



SEQ 
ID 
NO 


CLUSTEF 


SEQ NAME 


ORIE 
NT 


CLONE ID 


LIBRARY 


3529 


562587 


1923.K23.gz43 252977 


F 


M00042459AE04 

J.TX \y v/vy • jtx. x-#v/ • 




3530 


562543 


1923.M23.gz43 252979 


F 


M00042460CH10 


TTC"?-fY»lr»nMf»tT iv<»r 


3531 


562398 


1923.N23.gz43 252980 


F 


M00042516DC01 


Vy Xv Vy V/1V/XXL VlvUjl V Vyl 


3532 


497517 


1923.P23.gz43 252982 


F 


M00043296AE08 


TJP2-Pn1nnMefT iwr 

Vy VyXV> V^VylVylXlTXVvLXyl VVvl 


3533 


450902 


1923.A24.gz43 252983 


F 


M00042352D-A1 1 


VJ vyX» Vy vylVJlXl VXVvLXyl V vyx 


3534 


499178 


1923.C24.gz43 252985 


F 


M00042442AG04 


IJC2-C l olonMetT ivpr 

\J Vy X* V^VJl VJXU.VX Vv LJL/1 V wl 


3535 


450507 


1 923 .E24.gz43_252987 


F 


M00042446D*F04 


UC2^olonMetLiver 

vy v/A< vy \J iv/uiTiv LXyi V Vv l 


3536 


448556 


1923.F24.gz43_252988 


F 


M00042449A:H10 


UC2-ColonMetLiver 


3537 


454575 


1923.124.gz43_252991 


F 


M00042455B:D05 


UC2-ColonMetLiver 

vy v^Xrf vy vx viu txvu_/i v wx 


3538 


275 


1 923 .L24.gz43_252994 


F 


M00042460B:C06 


UC2^olonMetLiver 


3539 


452719 


1 934.C0 1 .gz43_25300 1 


F 


M00043360CE07 


UC2-ColonMetLiver 

Vy V/X VyL/XV/lXiT±vLlyX V wX 


3540 


449842 


1934.G02.gz43 253021 


F 


M00043369DB01 


UC2^olonMetLiver 

Vy VyX V/VfXV/iiXVlw XI A V vl 


3541 


489506 


1934.N02.gz43 253028 


F 


M00043384DD05 


I JC2-ColonlVTetT .i ver 

uv/x v^v/xuiixviv/ixvivwi 


3542 


453893 


1934.P02.gz43 253030 


F 


M00043388AD05 


V/ V/x VyVylVylllVXVvULyl V vl 


3543 


452586 


1934.A03.gz43 253031 


F 


M00043356BE12 


UC2-ColonMetT iver 

Vy Vy x Vy vJXVIXU VX Vv LXy A V Vv 1 


3544 


491177 


1934.C03.gz43 253033 


F 


M00043360C*F11 


\J Vy X^ V> V/XWXU Vlw LXy 1 V Vvl 


3545 


496586 


1934 M03 az43 253043 


F 


M00043 3 82C D07 

i»lv vy vy"«y J \j \^/ .ly vy / 


Vy Vy x< Vy KJl vJJXLVX C LJLyl V vx 


3546 


490393 


1934 O03 ez43 253045 


F 


M00043 3 87A-E02 


TTP?-Pn1onA/fptT ivpr 

\J Vyx V> VJlVIXLLVXVy LXy 1 V V/l 


3547 


450756 


1934 D04 ez43 253050 


F 


M00043363BA1 1 


iy VyX. ~Vy UIUXXIVXVvLXjI V CI 


3548 


456492 


1934 104 ez43 253055 


F 


M00043373DG04 


TTP?-PnlnnA/fptT iwr 

UV/X Vy VylVlXlVXVy LXyl V Vvl 


3549 


490395 


1934 O04 ez43 253061 

a y <*v t . vy vy i « £3 x*.y vy vv x 


F 


iTivy Vy vy^«y «y ly / rl.ljUJ 


VJ \y x« V/ vJlVJXll VA Vv LXyi V vl 


3550 


489275 


1934 P04 ez43 253062 

x y -y r « x vy r »5~>« tT+j xy y \j vjx* 


F 




U wx^~VyVJlvJXXlVXOLXylVCl 


3551 


489368 


1934 COS ez43 253065 


F 


M00043360DD01 

lTlvy\yv/"^^VyV/X*'.X-'V/ X 


T TP ? -T 1 rvl on IvTpfT ivpr 

Vy Vy x# "\y VylvJlii VXVv U_yl V Vvl 


3552 


450756 


1934 EOS ez43 253067 


F 


M00043364DH1 1 

lTlVvv/™J*/vrTX/.lll X 


TTP?-Pn1r«iivTptT ivpr 

VJ VyX "Vy VUUXU VX Vv LXy 1 V Vvl 


3553 


496084 


1934 K05 ez43 253073 


F 


M00043378DD12 


TTP9-Pn1nnMetT ivpr 

V-/ VyX» VyVyl Vy 111 VXVv LX-il V Vvl 


3554 


450193 


1934 L05 ez43 253074 


F 


M00043380DE10 

ITlVVV'T./J Ov/l/.JUlv 


T TP 9 -P nl nn A/TptT ivpr 

VJ VyX. Vy VJlVJlllVXVvLXyl V Vvl 


3555 


499240 


1934 M05 ez43 253075 


F 


M00043382CG09 

1TXV/ V/V/"T«y ^ vJyi* Vy .VJ Vy J 7 


TTP7-Pn1nnlvTptT ivpr 

vy Vy X» Vy VJ 1 Vy 111 VXVv LXy 1 V Vvl 


3556 


448230 


1934 A06 ez43 253079 


F 


M00043356CF03 

iTXvy \/v/"T«y «y «y vy v^.x vy ^y 


Vv VyX Vy Vy 1 VJU1 VXVv LXy 1 V Vvl 


3557 


455707 


1934 H06 sz43 253086 


F 


M00043372BB01 

iTiv/v/v/T*/*/ / rill >iyv x 


T TP2-PolonMet'Li ver 

Vy VyX. Vy Vy 1V/111 VXVv LXy 1 VVvl 


3558 


496760 


1934N06sz43 253092 


F 


M000433 85 AE0 1 

ITIV/V/Vnj •/ Uk/ix.Xyv X 


T IP2-Po1nnlvfetT .iver 

Vy Vyx Vy VFXV/1XLVX Vv LXvl V Vvl 


3559 


447429 


1934A07gz43 253095 


F 


M00043 3 5 6C H07 

i t x vy v/ vy r — ' «y vy v .X xvy # 


T TP2-Pnlon M etl Jver 

Vy VyX Vy VJ1 Vy 111 VXVv LXy 1 V Vvl 


3560 


450551 


1934.G07.gz43 253101 


F 


M00043369DG10 

lTXV/v\/~«/^ vyl/, VJ x vy 


UP2-PolonMetLiver 

Vy Vyx Vy VI Will VXVv LXy 1 V Vvl 


3561 


451144 


1934J07az43 253104 


F 


M00043376B-C03 

ITIV/UV/TJ J / VyXJ.V — /Vy»y 


TTP9-Pnlon\fptT iver 

VJ VyX VyV^l Will VXV/LXyl VVvl 


3562 


491402 


1934C08ez43 253113 

X. y «y ~T . Vy vy VJ wy y JL JL *J> 


F 


M00043361BF02 


T TP9-Pnlnnlvf pfT ivpr 

vy Vvx \y vilvJlllVXvvLXyl V Vvl 


3563 


449956 


1934 E08ez43 253115 


F 


M00043365A-C06 

J, Yl V V/ V~ J J Vlu'xVt V/ V/U 


TTP9-PnlAniVfptI ivpr 

VJ Vy X Vy VJ1 VJ111 VX Vy LXy 1 V Vvl 


3564 


449959 


1934.G08.gz43_253117 


F 


M00043370A:C08 


UC2-ColonMetLiver 


3565 


560213 


1934.108.gz43_253119 


F 


M00043374A:E05 


UC2<k>loiiMetLiver 


3566 


450482 


1934J08.gz43_253120 


F 


M00043376B:F06 


UC2-ColoiiMetLiver 


3567 


449795 


1934.B09.gz43_253128 


F 


M00043359B:A04 


UC2-ColonMetLiver 


3568 


486856 


1934.O09.gz43_253141 


F 


M00043387B:A03 


UC2-ColoiiMetLiver 


3569 


450080 


1934.C10.gz43_253145 


F 


M00043361DA02 


UC2-ColonMetLiver 


3570 


510272 


1934.110.gz43_253151 


F 


M00043374A:G04 


UC2^oloiiMetLiver 



157 



3NSDOCID: <WO_021 4S00A2_I_> 



WO 02/14500 



PCT/US01/25840 



Table 2 



SEQ 
ID 
NO 


CLUSTER 


SEQ NAME 


ORIE 
NT 


CLONE ID 


LIBRARY 


3^71 

j J 1 1 


4Sosa3 

^JV/JOJ 


1 034 A 1 1 <yr43 753 1 SO 


F 
i 




UC2-ColonMetLiver 


^77 


4^0103 


1034 R11 cr743 7S3160 




M0004335913D06 

lVlvV/V/*t J J J 7XJ .1/ V/U 


I JC2-ColonMetLiver 


j j / J 


1 34307 


1034 M11 0743 7S3171 

J." j't.IVXI 1 .Q^rr D 


F 

X 


M00043383RF12 


UC2-ColoijMetLiver 




t r70ZJ < T 


1034 "Mil o?43 7*53177 

I 7 J*T.XN X X .gZ.tJ ArJj L / JZ* 


F 

X 


M00043385CD06 


UC2-ColonMetLiver 


3^7S 


4K7R03 


1034 A17&743 75317S 

1 J' J*t,r\.iA..gZi*TJ Z>JJ1 / J 


F 

X 


M00043357B-B02 

lVlV/VVf J J J / U.UV/Ar 


UC2-ColoiiMetLiver 


3^76 


401 S07 


1034ni2ff743 253178 


F 

X 


M00043363DF07 


UC2^oloiiMetLiver 


^S77 


733R14 


1034 T12 a743 253184 


F 

X 


M00043376DA12 


UC2-ColonMetLiver 


j j / © 


4S01 80 

*T JVJ 107 


1034 A13 0743 753101 

1 7J*T,rVU.g£>*TJ J J X 7 X 


F 

X 


M00043357B-D01 


UC2-ColoiiMetLiver 


^7Q 

jj /7 


tOoD Ij 


1034F13«tt43 7*5310S 

l7Jt,DlJ.gi*tJ ^JJI7J 


F 

X 


M00043365BC08 


TJC2-ColonMetLiver 


jjoU 


4S101 1 


1034 K13 ot43 753701 

17J*T,J\lJ.5&iTj Z>JJ^.U X 


F 

X 


M00043379CB09 


UC2-ColoiiMetLiver 


J JO 1 


4S0707 


1034 G14 ez43 253213 


F 


M00043370BD08 


UC2-ColonMetLiver 


3^R7 


4S1 1 

*+ J 1 X OJ 


1934 T14ez43 253216 


F 


M00043376DD12 


UC2^oloiiMetLiver 


3SR3 


SS1380 

J J XJOV 


1934L14ez43 253218 

17J7,JjlTig£i~J ^.JJ^xU 


F 


M00043381BE10 


UC2-ColoiiMetLiver 


3SK4 


40S143 

t7J 1HJ 


1934 M14 ez43 253219 

17 J™.1»X1™.K^~J X^JJ^X.^ 


F 


M00043383DC07 


UC2-ColonMetLiver 


3^as 

JJOJ 


4X71 83 


1034 014 crr43 253721 


F 


M00043387CA1 1 


UC2^oloiiMetLiver 


JJOO 


631S76 

Uj J. JZ,U 


1934T15ffz43 253231 

17JT.llJ<g&<"J ^#J jAfJ A 


F 


M00043374DD07 


UC2-CoIoiiMetLiver 


3^R7 


44Q7R0 


1 034 F 1 6 cr 7 43 253243 

1 7JT.lilv;,gZ. L TJ ajjaHJ 


F 

X 


M00043365CA09 


UC2-ColoiiMetLiver 


35£8 


40 1 407 


1034 F16 cr743 253744 

17JT.I IV/.gZ/TJ iJJLII 


F 

X 


M00043368AF03 


UC2-ColonMetLiver 


3S£Q 

JJ07 


404000 

H7HU77 


1 034 K 1 6 &z43 253249 

17JT.JVlv.gZi*TJ 


F 

X 


M00043379DA05 


UC2-ColonMetLiver 


^oo 
jjyy 


4SOS10 


1034 1SJ16 o^43 7S37S2 


F 

X 


M00043386AA1 1 


UC2-ColonMetLiver 


^Ol 

JJ71 


4&&34Q 


1034P1fio743 7S37S4 

l7J*T.Ir l\J.j3j£*TJ LJjZJt 


F 

X 


M00043389DB10 

lVlU\/\/*TJJlJ7X/.U XV/ 


UC2-ColoiiMetLiver 


JJ72 


^1 ^473 


1034T17o743 7S3763 


F 

X 


M00043374DH09 


UC2-ColonMetLiver 


jjyj 


4QO£/L£ 
*+7VO*tO 


1034 PI 8 o«?43 7*53773 

l7JT,V^10.gZHj Z>JJ^ / J 


F 

X 


M00043362CE02 


UC2-ColonMetLiver 

Vri/ V^j Vj\J1\/1XJ. T JL t> 1 *X ▼ V-'X 


j jy** 


jjOj 1 1 


1034 T 18 ot43 7S3787 

l7J*T.Jul0.5i*rJ z,JJZO^ 


F 

X 


M000433 8 1 C El 0 


UC2-ColoiiMetLiver 


JJ7J 


487437 


1034M18ov43 7S3783 

17 j*t.xY11 O.J*jE/tJ LJJiOJ 


F 

X 


M00043384BA04 


UC2-ColoiiMetLiver 

V-/ V_J \JVlwlllf XWUIWX T V^X. 


JJ7U 


4Q1 740 


1034F10ot43 7S3701 

1 7J < T.JM7.gZ, < TJ AJJ^71 


F 

X 


M00043365CF06 


UC2-ColoiiMetLiver 


3^Q7 


4^771 0 

HJZ / 1U 


1 034 HI 0 ot43 7 S3 704 

l7J*T.XXl7.gZ.*TJ Z.JJ^7t 


F 

X 


M00043373AG1 1 


UC2-ColoiiMetLiver 


JJ70 


44QQ<tQ 
*f*f 77J7 


1034 T70 o^43 7S3317 

lyJT.JZ.U.^Z.T'J LJJJ 1Z* 


F 

X 


M00043377CA1 1 


UC2-ColonMetLiver 


JJ77 


440Q7K 

*T*t 77 1 o 


1034P70ot43 7S3318 

1 7j*t.x x.\J.g]c*tJ Z-jjjXO 


F 

X 


M00043390BC04 


UC2-ColoiiMetLiver 

V_J^_^ V/VlVxUTXVUi-ix ▼ V^-*- 


^£oo 


4^0747 


1034 A71 a^43 7S3310 

i7J*T.A£.l.giC'TJ ^JJJ 17 


F 


M00043358AD02 


UC2-ColoiiMetLiver 


jOU 1 


4^1 704 
Hj 1 ^7*+ 


1034T71 ar43 7S3377 

l7j 4 t.XZrl.gZ'TJ ZJJJL / 


F 

X 


M00043375A-E02 


UC2-ColoiiMetLiver 


^£07 


4^nofto 


1034 1^77 a-743 7S3341 


r 


A/T00043371BC , 10 


UC2-ColonMetLiver 


3603 

JVJvJJ 


4^1 361 

Hj iJOl 


1034T77&743 753344 


! F 

X 


M00043 3 77D EO 1 

1YX vUv~J J / / i/.J^vl 


UC2-ColoiiMetiLiver 


3604 


4ftQ707 


1034 M77 ot43 7S3347 

l7J^.lVX^X».giCTTJ iJ J Jt / 


F 

x^ 


M00043384CC02 


UC2-(jolonMelLiver 


3^os 


45*8030 


1034 077 &743 7S3340 

17JT,V/Ai*'.5i"j z»jjj*t7 


F 

X 


TVT00043 3 87D B02 


UC2-ColonMetLiver 


3606 


449908 


1934.G23.gz43_253357 


F 


M00043371C-.B02 


UC2-ColoiiMetLiver 


3607 


450211 


1934.023.gz43_253365 


F 


M00043387D:D06 


UC2-ColoiiMetLiver 


3608 


402916 


1934.124.gz43_253375 


F 


M00043375B:A04 


UC2-ColoriMetLiver 


3609 


494450 


1934.J24.gz43_253376 


F 


M00043378B:B05 


UC2-ColoiiMetLiver 


3610 


492627 


1934.M24.gz43_253379 


F 


M00043384C:G01 


UC2-ColoiiMetLiver 


3611 


550246 


1947.J01.gz43_253404 


F 


M00054555D:C03 


UC2-ColoiiMetLiver 


3612 


550952 


1947.106.gz43_253483 


F 


M00054555A:H09 


UC2-ColoiiMetLiver 
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Table 2 



SEQ 
ID 
NO 


CLUSTER 


SEQ NAME 


OREE 
NT 


CLONE ID 


LIBRARY 






1QA7 AlAo™!'* O^^Q 
iy4 / .A.LU.gZ43 Z33337 


17 


IViU UU3 43 43 . x5Uy 


T TOO sA A /Tr-v+T i , 

u L/Z-^oiotiMeLLiver 


A 

jO 14 




1QA7P91 9«70.A 
iy4 / ,rJL 1 ,gZ43_Z33 / 3U 


X* 


iV1UUU34300JJ.vjU5 






J XJ / v / 


1 Q47 IT09 otA'* 7^^7^7 
157** / AJZZ.gZ43__Z33 /3 / 


T7 


lYlUUU34333XJ.liUy 


T TOO P/Vl/Y**A/Tt*+T nrar 




SS9AQ9 


iy3y.PU3.gZ43 Z335XO 


J7 


1VXUUU3403 IXJ.v^UZ 


T TOO O^xl/vfkA/fa-fT i-iror 


jO 1 / 


34yziu 


1 Q^O AAAA. CrrrAI 7^7ft^Q 
.JVlU4.gZ43 Z33o3y 


X* 


1V1UUU3 404UX3 . t^U3 


1 TOO OyNly-k**AAzi+T Mra«. 


JU lo 


33 3yZ3 


iy3y.l-JUo.gZ4 3 Zjjo7H 


X* 


A/T A A A ^\ A A.0 O r 1 A & 
lVlUUU340Zyi^.vJU0 


T TOO Or\1/v»-»A4ia-fT ivris-r 

u cz-uoioiiivietLiver 


30iy 


A£1 
401 


ickq par 0-7A7 9^o.qa^ 

1 y 3 y . 1^ U O ,gZ43_Z3 3 z? U O 


Xr 


A/1 A A A^ A£A A"R • T7 A7 
1VXUUU34044X3.J7 UZ 


T TOO O fA A Acx+A itnaf 




JJ iy /3 


1QSQ TITO orvAl 9^AA7A 
1¥ 3y..LJiy.gZ43 Z34U/U 




A/fAAA^AMATl- A A£ 
1VXUUU3 403 UX5 .AUO 


1 TOO Oj»»1 A /Te»*T nra«- 


jOZI 


34y3 lO 


1 Q^Q AJ90 r^A^ O^AIOft 
iy3y.INZZ.gZ43_Z341Z5 


17 
r 


A/TAAA^A^idOT^'iri 1 
IVlUUU3404ZiJ.x 4 1 1 


XTOO 0^>1^**A/T«*-T 


jDZZ 


333y4y 


1 Q^Q TTOA rr»»AQ O^A 1^7 
iy3y.J>kZ4.gZ43_Z3413 / 


17 
X* 


A/TA A A ^ /l/^ STV 17 AO 
JVLUUU3403 oU.Jr UZ 


U CZ-ColoiiMetLiver 




iy3yy 1 


1 Q£A T A3 citA'X 0*iAOA£ 
1 yOU .lwU3 .gZ43_Z34ZUO 




A/TAAA</1£/£1 A •TJA'J 
1V1UUU3400 X A.X5U3 


u L^z-^oionMetiwiver 




3013yU 


1Q^H TA7 <Y-7/t^ O^AO#C7 
lyOU.lU / .gZ43_Z34ZO / 


17 
X* 


A/TAAA</l^^^r , »17 1 A 
J>aUU U3 403 O . xi 1 U 


u L/Z-i^oioiiJNieti-»iv6r 


jOZj 


A < Aft £7 


1 QAA O 1 9 <y-wIO 7^/17Al 
1 yOU.L/ 1Z .gZ43_Z3434 1 


17 
X* 


A/f AAA^/i*C/Lftr , 'T-J1 A 
xVlUUU3404o . XT 1 U 


TTOO 0«1^«Aif^+T «mm> 

uuz-uoioiiMetJLiver 


3ozo 


7/£A^7^ 
Z043 / 3 


I VOU. IN 14.gZ43_Z343 o4 


17 


A/f AAA</f /C/^/lFl- A A 1 
JVlUUU34004xJ .AU x 


TTOO PaIamX/T^T 

u uz^oionMetiwiver 


jOZ / 


3UZ053 


1 OAO "DO 7 crrrAI 9^/1^1/^ 

1 yOU .UZ3 ,gZ43_Z343 1 0 


TO 

r 


"\ /rn A A ^ A &A 1T\ •T4" AO 
JYLUUU3404 / U.XTUZ 


T TOO _0 y-it-» A AcA-X MfAi* 


jOZo 


43ZO0Z 


torn t70/l 9^/1 
iyOU,JiZ4.gZ43 Z34333 


X* 


A>fAAA</i^^ i n.nno 
JVlUUU34o3 1JJ. JJUZ 


TTOO 0^1^«A/Ti=*+T 




^ 1 O^Q 
331ZOy 


lyoy.r Kjd .gZ43__Z343o4 


17 

r 


A/TAAA^/1 /C'7/1T>-X5A'2 

JVIUUU340 /4X5.X5U3 


TTOO 0^1^*%A4"a*T 


303U 


^70^.51 1 

d /y4oi 


1 yO y.l^U / .gZ43_Z34043 


17 

jt 


A>f AAA^AA7 1 A *XJA7 
1V1UUU340 / 1 A.xtU / 


T TOO -OrtlrtwA/Tcrf-T nrar 


303 i 


43UOZ4 


1 O^O Al/if^/l^ 0^/1 ^Ol 

i yoy .Aiu.gz43_z34oy i 


17 

r 


A/TAAA^/K^$ A 'TIA1 
JVLUUU3400O A.JL/U 1 


TTOO Oi-»1i^**AA«4-T 

u cz-uoionMetlviver 


3D3Z 


A1 «0A 
4 133ZO 


i yoy ,in 1 1 .gz43_Z34 /zu 


T7 

r 


A A A A A < A /Z Q < T\ • T\ AO 

JVlUUU34oo3-LI.JJUy 


TTOO 0^1^*>A>f^+T 




3 ID IDD 


lOAO i^1A rr-rAI 7 < A0 17 

iyoy.v^z4.gz43_z34y i / 


17 


TiJfnf\(\<A£H 1 TV AIO 
JYLUUU340 / 1JJ.A1Z 


TTOO Or«k1/«»»%A/r«*-T »,a« 




33 io3y 


I y OD ,JJU 1 .gZ4«5__Z34y 34 


17 

r 


A/TAAA<AQ 1 1 r\-T7A1 
J\lUUU34o 1 1 Ij.r U 1 


T TOO O sA i^t-» A A ^»+T 




34y5lO 


1 y 53 .HU0 .gZ43_Z33 U 1 3 


17 
X* 


A/TAAA^Aft 1 OTA*TJTA< 
JYlUUU34o 1ZU.XTU3 


TTOO O /vr» A A/a+T « rA r 




33 /3Z3 


iqqo lino K*zf\/ZA 
iyoD .r Uy .gz43_Z33Uo4 


x? 
r 


A>f ArtA-Cyl Q1 A A. .XTAO 

MUUU345 14A.r U / 


TTOO Or%l^*»AA«*T » rA «> 


0,£00 


AA1A(\^ 
44 /4U3 


iyo3.1Vl X.gZ43__Z33 1U1 


X* 


A/fAAA^A80'3'R»T?A7 
IVXUUU345Z3JE5.XiU / 




WO 


A.QAA 7 1 
4y44/ 1 


iqoo pin nrrAI 7^^001 

i yo3 1 y .gz43_z33zz i 


17 

r 


A>f AAA^Aft 1 1 "D.XJ1 O 
JVlUUU34ol lo.XTlZ 


TTOO 0/-k1/*»»AA*»+T rtro* 




4303ZU 


1QS7 T71Q rr-yA"5 7^^'7'7'2 
1 y o 3 . Ji 1 y .gZ43_Z3 3 ZZ3 


17 


AAAAA^Aftl 713'T^I 1 
JYIUUU34513X5.U1 1 


TTOO Orx1/Vr»AA«fT iirar 


3D4U 


^q^oa 

DDDDZKj 


1 OBA XTA7 /vtA7 7<\^*3A/1 
iyo4.INUZ.gZ43_Z33344 


17 


A/fAAA^AftASU»T?AO 
JV1UUU34545X5 .JtSUZ 


TTOO OAl/VMAAirfT 

u ^z-L^oioiuvxeu-fiver 




30Uy54 


I y 54 .r'UO.gZ43_Z334 1 U 


17 
X* 


A>TAAA<A ft^ 1 TD-T7AQ 
JVLUUU3453 lx5.XiU3 




7£AO 
304Z 


^^o** ao 

30Z3UZ 


lOQ/t 7<:</l 17 

iyo4AiU/.gZ43_Z3341 / 


r 


A/TAAA^I/I CaOT\.17AQ 

JVlUUU34o3yU.r Uo 


TTOO OvVl/v*%AA/*fT 


3043 


334iy0 


iy54. AlZ.gZ43_Z334y 1 


r 


~\/fr\r\f\^AQi i a -Xi 1 1 
JYLUUU34o3 lA.xil 1 


TTOO 0/-\1/v*%AA*»+T 




33 / /U3 


1 y54. vjZ 1 .gZ4 j__Z33o4 1 


X7 

r 


A A A A A < A Q A A T% • "D A 1 

JVLUUU34o4Ul/.xJU3 




3043 


30ZZ43 


1 y y4 ,li 1 3 ,gz43_Z3 3yz / 


T7 

r 


A/fAAA^ylOTO A .T7A1 

jviuuu34o/ /A:rui 


TTOO 0*-»1rv»<»AA***T mmm* 




33^33 i 


iyy4.Aiy.gz43_z33yo / 


r 


A/TAAA<T>i 077 A -TT1 f\ 

JVlUUU34o /ZA.xt1U 




7£A7 


44/4o3 


1 yy4.r ZZ .gz43_Z3o04U 


r 


JVLUUU34o /oxj.vjU3 


TTOO 0/%Ti-v*»AA4»fT 


3648 


54Q3Q1 


1 QQ4 K94 a^4^ 9^^077 


x^ 


IVIuuvJtOOJV/.UvU 


TIC^-ColonMetT ivpr 

V/Zr V_/UlVJlJJ.YJ.t/ 1 1 /I V vl 


3649 


550204 


1995.C02.gz43_256101 


F 


M00054893D:F07 


UC2-ColonMetLiver 


3650 


554604 


1995.K02.gz43_256109 


1 F 


M00054904A:F07 


UC2-ColonMetLiver 


3651 


558326 


1995.N02.gz43_256112 


F 


M00054908D:A07 


UC2-ColonMetLiver 


3652 


557218 


1995.A03.gz43_256115 


F 


M00054891D:H05 


UC2-ColonMetLiver 


3653 


562152 


1995.C03.gz43_256117 


F 


M00054893D:G09 


UC2-ColonMetLiver 


3654 


562679 


1995.F03.gz43_256120 


F 


M00054897C:F03 


UC2-ColonMetLiver 
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Table 2 



SEQ 
ID 
NO 


XUSTEE 


SEQ NAME 


ORD3 
NT 


CLONE ID 


LIBRARY 


3655 


543855 


1995 G03 ez43 256121 


F 


M00054898D:H12 


UC2-ColonMetLiver 


3656 


555701 


1 995 . J03 .gz43_25 6 1 24 


F 


M00054903A:G07 


UC2-ColonMetLiver 


3657 


549376 


1995 K03 ez43 256125 


F 


M00054904A:H10 


UC2-ColonMetLiver 


3658 


522220 

mJ t Iff ft ft\J 


1995 M03 ez43 256127 


F 


M00054908A:F10 


UC2-ColonMetLiver 


3659 


550710 


1995 003 ez43 256129 


F 


M00054922A:C11 


UC2-ColonMetLiver 


3660 


562672 


1995 .E04.gz43_256 135 


F 


M00054896B:E09 


UC2-ColonMetLiver | 


3661 


559088 


1995.104.gz43_256139 


F 


M00054901C:A04 


UC2-ColonMetLiver 


3662 


447858 


1 995 .D05 .gz43__25 6150 


F 


M00054895B:D09 


UC2-ColoiiMetLiver 


3663 


492256 


1995 LOS az43 256158 


F 


M00054906D:D06 


UC2-ColonMetLiver 


3664 


550047 


1995.B06.gz43_256164 


F 


M00054893A:F08 


UC2-ColonMetLiver 


3665 


562579 


1995 P06 ez43 256178 


F 


M00054923D:A08 


UC2-ColonMetLiver 


3666 


556804 


1995F07ez43 256184 


F 


M00054897D:E08 


UC2-ColoxxMetLiver 


3667 


553102 


1995 107 ez43 256187 

X _x ^/ ♦ X vy / * f\* * * ^ ***** x^ ju xy * 


F 


M00054901C:G01 


UC2-ColonMetLiver ! 


3668 


559118 


1995.P07.gz43 256194 


F 


M00054923D:D04 


UC2-ColoiiMetLiver 


3660 


559554 


1995 H08 cz43 256202 


F 


M00054900B:D12 


UC2-CoIoiiMetLiver 


3670 




1995 K08 ez43 256205 


F 


M00054904C:B06 


UC2^olonMetJLiver 


3671 


567402 


1995 N08 ez43 256208 


F 


M00054910A:H12 


UC2-ColoixMetLiver 


367? 


553975 


1995 O08 ez43 256209 


F 


M00054922B:A08 


UC2-ColonMetLiver 


3673 


561525 


1995 N09 sz43 256224 


F 


M00054910B:F07 


UC2-ColonMetLiver 




1 S0830 


1995 O09 ez43 256225 


F 


M00054922B:A10 


UC2-ColoiiMetLiver 


3675 


45 1 049 


1995 E10 sz43 256231 


F 


M00054896C:F01 


UC2<:oloiiMetLiver 


3676 


560668 


1995H10ez43 256234 


F 


M00054900B:G10 


UC2<:oloiiMetLiver j 


3677 


555260 


1995 110 ffz43 256235 


F 


M00054901D:B01 


UC2-ColonMetLiver 


3678 


555785 


1995 J10 ez43 256236 


F 


M00054903C:F03 


UC2-ColonMetLiver 


3679 


82864 


1995 Dll ez43 256246 


i F 


M00054895C:B07 


UC2-ColonMetLiver 


3680 


559554 


1995 Hll gz43 256250 


F 


M00054900C:B04 


UC2-ColonMetLiver 


3681 


556404 


1995.C12.gz43_256261 


F 


M00054894C:B11 


| UC2-Cc>loxiIVl^iver | 


3682 


516691 


1995H12ez43 256266 


F 


M00054900C:C03 


UC2-ColonMetLiver 


3683 


556499 


1995 E13 ez43 256279 


F 


M00054896D:C01 


UC2-ColonMetLiver 


3684 


453715 


1995 F13 ez43 256280 


F 


M00054898A:G09 


UC2-ColoiiMetLiver 


3685 


556753 


1995 H13 ez43 256282 


F 


M00054900C:D08 


UC2-ColoiiMetLiver 


3686 


549847 


1995 J13 ez43 256284 


F 


M00054903C:G03 


UC2<:olonMetLiver 


3687 


549822 


1995 K13 ez43 256285 


F 


M00054904C:G09 


I UC2-ColonMetLiver 


3688 


480001 


1 995 Ll3 ffz43 256286 


F 


M00054907B:E07 


UC2-ColonMetLiver 


3680 

JU07 


S69080 


1005 PI 3 az43 256290 


F 


M00054924B:E10 


UC2-ColonMetLiver 


3690 


555998 


1995.A14.gz43_256291 


F 


M00054892C:G05 


UC2-ColonMetLiver 


3691 


562106 


1995.B14.gz43 256292 


F 


M00054893C:A06 


UC2-ColonMetLiver 


3692 


557326 


1995.E14.gz43 256295 


F 


M00054896D:C11 


| UC2-CoIoiiMetLiver 


3693 


558935 


1995.J14.gz43 256300 


F 


M00054903C:G04 


UC2-ColonMetLiver ! 


3694 


399766 


1995.L14.gz43 256302 


F 


M00054907B:F01 


UC2-ColonMetLiver | 


3695 


553976 


1995.014.gz43 256305 


F 


M00054922C:A11 


! UC2-ColonMetLiver 


3696 


554002 


1995.B15.gz43 256308 


F 


M00054893C:A07 


UC2-ColonMetLiver 
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Table 2 



SEQ 
ID 
NO 


CLUSTER 


SEQ NAME 


ORIE 
NT 


CLONE ID 


LIBRARY 


3697 


553691 


1995 P15 ez43 256322 


F 

1 


M00054074T3-F07 


\J V^UlULLLVlCLl_/lVCr 


3698 


556365 


1905 T316 az43 256324 

A 7 7.7 .XJ A\7.£^Z»f«7 


i 


M00054803r-R0? 

1V1 \J v/ W J *+ O 7 -> V> . X> UZ 


TTr , 9-Pn1r*nlV/f^tT \wr 
KJ V^Z,- V^OIOILLVAC LL/lVCr 


3699 

.J VJ 7 7 




1995 D16 ez43 256326 


i 


M00054805D130? 


KJ V-' Z, ~ V^UlUiilYJLC LIjI VCI 


3700 


553534 


1905 016 P743 956320 


F 
i 


A/T00054800r-ni 1 

IVlv/V/V/ J*+077V^.JL/ 1 1 


V/ V^Z,~VyUIUJJULYACLLfiVCl 


3701 


557004 


1905 HI 6 ez43 256330 


F 
i 


M00054000HF1 1 

1VA\J VJ V7 ^ *T 7 V7 \JxJ . A 1 1 


T TP ?-f , n1rwi"Mf*tT i\rt*r 

KJ V^Z/"" V^UIUIIIYACLLJ. VC1 


3702 


555082 


1995 M17 cz43 256351 

177J.1V11 / .££Zt~T«7 £*+j\JmJmJ 1 


F 

A 


MO0O54O08P- A 1 0 


vj v^z,"V^LHv}iiivieii«iivcr 


3703 


562714 


1995 P17 ffz43 256354 

X Zs Zs .X 1 / .gZ/T.J ^,JWJJT 


F 
i 


M00054074D F0 1 


vJ V/Z^~ V^UlUAilVlwULfl VOX 


3704 


552122 


1995 D18 sz43 256358 


F 


M00054895DC 1 0 


V/ V/^r \*jr v^lVJALlVlw LAjI V vl 


3705 


411272 

TllZ,/L 


1995 K18 ffz43 256365 

177J,J\10,gZi7J Z.JUJUJ 


F 
i 


M00054004DF07 


KJ V^Z< " V^UlUlAlVlC UL/1 VC1 


3706 


454812 


1995 A19 ez43 256371 

177J,rV17.g&iTJ £*J\JmJ / 1 


F 
i 


M00054809DD06 

lVlwVVJ*T07Z'l/,iyvU 


V/ V^Z."V/UlUlAlVACLLfl V CI 


3707 


558177 


1995 J19 ez43 256380 

1 7 7 J . J 1 7 .g^>~ J ^fJVUOvr 


F 
i 


M00054003DO01 

1 VI V/ v \7*7 *T 7 \JJXJ . VJ V7 A 


TTr , 9-Pn1nnM'ptT iwr 

l-J V--Zr "V^vlUJAlVlCllvlVCJl 


3708 


556852 


1995 P19 ez43 256386 


F 


M00054925AB08 


TTP9-Po1rmMptT ivpr 

V7 V/^r V^UlLUAlVl&U^il V wl 


3709 


551463 


1995 B20 ffz43 256388 


F 


M00054893 C -E04 


V7 V^Zr V'UlUlllVl&LL/lVvl 


3710 


558011 


1995 O20 az43 256393 

177J tVIAxltfMff J \JmJ 7«7 


F 
i 


M00054890D-C1 1 

1V1V/VV/ J*T0771/. V/ 1 A 


TTr*9-rVi1nnMf*tT ivpr 

V V-"Jt» VAJIvJIAIVAwLAjI V Vl 


371 1 


559088 

J J7UOO 


1905 FT90 $yz43 956394 

1 77J J.lA>v ( g£i7J Z*~)\J*J 7*T 


F 
i 


M00054001 A-T304 

IVAV/V/ \J*s *TZ7 \J 1 JZ\. . U V/*T 


v v^z,"v>uiuiiiviclj_j1 vcr i 


371? 
•j # i^> 


5Q5066 

7.7 V7VJ\J 


1005 T90 **z43 956305 


F 

A 


M00054009Rrr09 

1V1 v/ UUJt 7W^,D . VJ V7Z» 


Uv/Z ~ V/\J1U111 VIC LL/1 V CI 


3713 


558868 


1995 K20 ez43 256397 


F 
i 


M00054904D ■ G04 

1V1UUV/JT7VT17 . VJV/~ 


TIP2-Pn1onMetI iver 

V» V/A» V^UlUlllVlwIJUilVCl i 


3714 

/ 1*T 


458035 

tJOvjJ 


1005 F91 <*r43 956408 


F 

r 


M00054808R Ti04 

1V1V/ \J\J -7 *T O 7 O JJ . \J V*T 


TTP7-pMlr»n*MptT ivpr 

V V^^ V^L>lUlMYlCULyl VC1 


371 5 


555837 


1905 FF?1 ar43 756410 


r 


M00054001 A-F01 

lYlUVi V J*T7\J 1A. r V/ 1 


TTP7-Py\1rmlV4ptT fi/*»r 
U V^Z,~V^t)lUlllYlC uLrl V CI 


371 6 

O / JLO 


4^?1R? 


1O0^ T?1 0^43 9^^411 

177J .1Z*1 ZJOHll 


j? 

r 


Mnnn^4on?r*- at? 


T TP9-PnTr»T>lV/r**tT «7*>r 


3717 

DILI 


555967 


1005 T91 or43 956419 


17 
J7 


M00054003Tyfr1 0 


TTP9-Pf>1rmA/f^fT ivf»r 
V V^Z,-V^U1uIJLLV1CL1j1 vci 


371 8 

J / lO 


474 1 63 


1005 "R99 0743 956490 


r 


M00054803Prr06 


TTP9-Pn1rm"MV»tT iver 

VJ Vx Z. '"VAJlOJAlYlClAjl V CI 


371 0 
o / iy 


S5616Q 


100S 1499 crr43 9S6496 




M00054001 A-FT1 1 


U V/ZV- v^OIOIaIVIcIJL/IV ci 


3790 


558770 

JJO i iy 


1005 P?3 or43 756437 


i^ 


M00054805R* A09 

1V1 \J v \JJ *tO 7JD 


T TP9-Pr»1n-nA/TftT ivf»r 
V7 V^Zr V^UIULLIYICLL/IVCI 


3791 


555830 


1005 tr?3 <*^43 756441 


T? 

r 


MO0054800D*F07 

lvl\/v/WJ*T077JL'.Jrl/ / 


TTP9-PoTr*nTV/fetT iver 
V VxZ.~ V^U1UILLY1CLL/1 VCI 


3799 
j / z,z, 


974 
y /*t 


1005 "M73 aM3 756448 

l77J.l^ZJ.gi6*TJ ZJUttO 


F 
r 


MO0054071 P-n01 


TTP7-Pn1rttTN/ff»rT iver 

KJ \s Z, ~ V>UlUIliYIC VXjl V CI 


3793 


556639 


1005 1374 az43 75645? 

i 77 J .DZi*r.g£it J ZJU t tJZ 


F 
r 


M00054803TVD 1 2 

lVlV7Viv«7*t07-7l/.l-/ 1^> 


TTP?-Pr»1nnTV/fetT iver 

V/ V^^ V^L/lLllAlVACLAvl V Cl 


3794 


596800 


1005 P?4 <r^43 756453 

177 J . V^Zf^T.gZrr^ ZJD'tJJ 


F 


M00054805R- A08 

MUUU J*t O 7 JD ./\vO 


T TP9_Pr\1r»n1vTp»tT iver 
KJ K^ Z, ~ VAilOIIlYlC UL/1 V Ci 


3795 


405365 


1 005 TsT?4 oz43 756464 


F 

A 


M000540?irn06 


TTP9-Pn1nnTvTetT iver 

V7 Vii V^LIlUlliVAClAjl V t/1 


3796 


447639 
*t*t / kjd -7 


1005 (174 az43 756465 

177J .\JZ.*T.g£rtD LJUtUJ 


F 

17 


M00054073 A -H02 

lVlWvrV/J*T7ZrOzV.llWx. 


TTP9-PnlnnKfetT iver 

KJ \-s ■£* VxVJl Ulil VAC LL/1 VC1 


3797 


41851? 

*tlOJ IZ 


1006 K0? a^43 756517 


F 

Mr 


M00054038A-T308 


TTP9-Pn1r*nA/TetT iver 

V V^UIUIIIYIC Ll^i V CI 


37951 


559798 


1006 A04 o^43 756530 




M000540? 5 -R0 1 


T TP9-Pn1nnA/ff*rT iver 

C V>Z,~V>OIOIIIV1CIXjI VCI 


379Q 
/ z,7 


67060 


1006 PO^ ot43 756570 


F 

XT 


M00054047P-R04 

IVlw V/ v/ J*r 7*t Z> V^ . J3 


T TP9-Pr»1nnTV/fRtT iver 

KJ V^Z," V/\7lUlUYlCLLfI VC1 


3730 

J 1 DKI 


549171 


1 006 F06 or43 756575 

177VJ.lJ/WO.gii*TJ ^JOJ/J 


F 


MO0054O31P-F10 

1Y1UV/UJ*t7j a V/.JT Iv 


TTP9-Pn1nnA/letT iver 

KJ V-/Z. V^UILJIII VIC UL/1 VC1 


3731 

J 1 J L 


569550 


1006 O06 ar43 756585 

177U.\JUU.5Zi*TJ 


F 
r 


M00054041 r-no4 

IVlv/v/V/ J 4 t7 t T 1 Vx . VJ V/*T 


TTP9-Pn1nrTMetT iver 

KJ V^Zr~V/UIV7lllYlCLL/l VCl 


3732 


552566 


1996.M08.gz43 256615 


F 


M00054939D:E03 


UC2-ColonMetLiver 


3733 


473512 


1996.B13.gz43_256684 


F 


M00054927C:C03 


UC2-ColonMetLiver 


3734 


553535 


1996.N16.gz43_256744 


F 


M00054940D:E06 


UC2-ColonMetLiver 


3735 


555739 


1996.118.gz43_256771 


F 


M00054936B:G02 


UC2-ColoiiMetLiver 


3736 


873 


1996.N18.gz43i.256776 


F 


M00054941A:A08 


UC2-ColonMetLiver 


3737 


549805 


1996.119.gz43_256787 


F 


M00054936C:B08 


UC2-ColonMetLiver 


3738 


550782 


1 996.B20.gz43_256796 


F 


M00054928A:C11 


UC2-ColonMetLiver 
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Table 2 



SEQ 
ID 
NO 


CLUSTEF 


SEQ NAME 


OKIE 
NT 


CLONE ID 


LIBRARY 


37^9 


562106 


1996 F20 ez43 256800 


F 


M00054933BC02 


UC2-ColonMetLiver 




481166 

TO J. X\J\J 


1006 P20 ez43 256810 


F 


M00054943AH11 


UC2-ColonMetLiver 


3741 


447585 

II / JOJ 


1996 121 ez43 256819 


F 


M00054936CG03 


UC2-ColonMetLiver 


3749 


561878 


2005 K03 ez43 256919 


F 


M00054954A*F12 


UC2-ColonMetLiver 

vy v*w vyv/JV/i-Li'XwiJwi. t wx 


3743 


554833 


2005 C07 ez43 256975 


F 


M00054945BG09 

x t jl vy vy vy «y f »y « »^ j * * vy 


UC2-ColonMetLiver 


3744 


126 

XX. V7 


2005 J08 ez43 256998 


F 


M00054952DA1 1 

1YJ.VV v/^n^>'*y A*iy #x -x. jl 


UC2-CoIonMetLiver 


3745 


4Q6870 


2005 110 ez43 257029 


F 


M00054952A-B 12 


UC2-ColoiiMetLiver 


3746 


555526 


2005 J12 ez43 257062 


F 


M00054953B:D07 


UC2-ColonMetLiver 


3747 


569053 


2005 A 14 ez43 257085 


F 


M00054943D *F 1 2 


UC2-ColonMetLiver 


3748 

-J / to 


450594 


2006 B02 ez43 257278 


F 


M00054961CH07 

XTxv \y v/ ^y ■ -X xy ju ^y * jw v/ # 


UC2-ColonMetLiver 


3740 


557780 


2006 P02 ez43 257292 


F 


M00054974CA04 

x ▼ jlv/ v/ vy f ^y # r\y »a r 


UC2-ColonMetLiver 


3750 


456705 

*t»7C7 / ~«7 


2006 A03 pz43 257293 


F 


M00054960CC03 

I" Xv v/v*/ i y vy \y • vy 


UC2-ColoiiMetLiver 


3751 


556Q54 


2006 E03 ez43 257297 


F 


M00054964AF09 


UC2-ColonMetLiver 


3759 

-7 / -J JL 


553091 


2006 G03 ez43 257299 


F 


M00054966A:C11 


UC2-ColonMetLiver 


3753 


494306 


2006 H03 cz43 257300 


F 


M00054966D:H12 


UC2-ColonMetLiver 


3754 


908278 


2006 103 ez43 257301 


F 


M00054967D:F08 


UC2^olonMetLiver 


3755 


504400 


2006 L03 ez43 257304 


F 


M00054970C:C08 


UC2-ColonMetLiver 


3756 


456903 


2006 H05 ez43 257332 


F 


M00054967A:D11 


UC2-ColonMetLiver 


3757 


500019 


2006 C06 ez43 257343 


F 


M00054962C:D02 


UC2-ColonMetLiver 


375R 

o / JO 


555304 


9006 D06 az43 257344 


F ! 


M00054963CC08 


UC2-ColonMetLiver 


3750 


503173 


2006 G06 az43 257347 


F 


M00054966A:D09 


UC2-ColonMetLiver 


3760 


509041 


2006 T06 az43 257349 


F 


M00054968AC1 1 

xtiw/v*/ i y vvx »■* ^ ju jl 


UC2-ColonMetLiver 


3761 


554370 


2006 K06 ffz43 257351 


F 


M00054969DD04 


UC2-ColonMetLiver 


3769 


555103 


9006 O06 o^z43 257355 


F 


M00054973BA10 


UC2-ColonMetLiver 


3763 


556193 


9006 T08 0^43 257382 


F 


M00054969AE05 

xt jl v/ v/ vy »y -y %y ^ x jw» ji 


UC2-ColonMetLiver 


37 64 


557491 


2006 B09 ez43 257390 


F 


M00054961D:H08 


UC2-ColonMetLiver 


3765 


450032 


2006 K09 ez43 257399 


F 


M00054969D:G11 


UC2-ColonMetLiver 


3766 


553904 


2006E10ffz43 257409 


F 


M00054964C*B07 

X t jl vy \y vy «y ^y vy i ^y * ■ vy # 


UC2-CoIonMetLiver 


3767 


556196 


9006 G1 0^743 257411 

/jUUU.VJ lU.gZiTJ ^.J/TXX 


F 


M00054966A<j02 


UC2-ColonMetLiver 


^76R 




9006 Ml 1 az43 257433 


F 

X 


M00054971CC11 


UC2-ColoiiMetLiver 


^760 


556559 


9006 PI 1 <y7-43 957436 


F 

X 


M00054974DC1 1 


UC2-ColoiiMetLiver 

x«y vyJw \y v*v/iiiTAwuwx • 


377H 


554594 


9006"M19az43 257450 


F 


M00054972B-E06 


UC2-ColonMetLiver 


3771 
Dill 


550047 


9006P19oz43 957452 


F 

X 


M00054975 A- C08 


UC2-ColonMetLiver 


^770 
D 1 I jL 




9006P15crz43 957500 


F 

X 


M00054975BB06 


UC2-ColoiiMetLiver 

V^ Vy>C* WV/JV/AJJi f AWULyj> T 


3773 


556740 


9006 A 17 07-43 257517 


F 

X 


M0005496 1 B D07 


UC2-ColoiiMetLiver 


3774 


531145 


2006.C17.gz43_257519 


F 


M00054963A:C12 


UC2-ColoiiMetLiver 


3775 


557883 


2006.F18.gz43_257538 


F 


M00054965C:C04 


UC2-ColonMetLiver 


3776 


470386 


2006.K18.gz43_257543 


F 


M00054970A.G03 


UC2-ColonMetLiver 


3777 


596152 


2006.M18.gz43_257545 


F 


M00054971C:H06 


UC2-ColonMetLiver 


3778 


556552 


2006J19.gz43_257558 


F 


M00054969B:F12 


UC2-ColonMetLiver 


3779 


560696 


2006.C20.gz43_257567 


F 


M00054963A.E02 


UC2-ColoiiMetLiver 


3780 


218113 


2006.K20.gz43_257575 


F 


M00054970B:B02 


UC2-ColonMetLiver 
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Table 2 



SEQ 
ID 
NO 


CLUSTER 


SEQ NAME 


ORIE 
NT 


CLONE ID 


LIBRARY 


3781 


557559 


2006 L20 2z43 257576 




M00054Q71 A-r>04 


U V^Z~V^U1UIUVXCLXj1 VCl 


3782 


129715 


2006 B22 ffz43 257598 


T? 


M00054Q69 A -H07 


\J V^Z^V^UlUJUlVXCULrl VC1 


3783 


553976 


2006 C22 ffz43 257599 




M00054963 A -Crl 9 


T Tr* 9-PnlrrtiMf*tT iv<*r 

V-/^. V^OHJXXLVXCLXjI V wl 


3784 


561984 


2006 D22 ez43 257600 


F 


M000 54Q64 A -RO0 


U V^Z.*" v^rUIUXUVXCU-rl V CI 


3785 


497938 

~Z7 / -7 o 


2006 G22 or43 257603 


T7 

X 


M00054066DP05 

1VX V/ v V/ J*T j7\Jt>XJ . V^\J J 


V^Z _ v^UlUlXiVXCLX_/lVCl 


3786 


556951 

~J ^J\J Z* X 


2006 J22 az43 257606 


F 

X 


M00054969Pfr00 


\J wZr**\»^UlUJXIVXO tXw! vex 


3787 


553983 

~J *J ~J \j *J 


2006 N22 ff743 957610 


F 

X 


M00054979T> A 1 0 


U V-rZ-— V^UIUJXIVXCULwl VOX i 


^ / oo 


549570 


9006 P99 a^43 957619 


x^ 


1^/100054075^-1^08 


u v^z-v^uioiiivicix_/ivcr 


^780 

D 1 Oz7 


555135 


9006 R93 <rz43 957614 


1/ 


M0H05zl069R- A07 


U V^Z"-V^010I1IVXGIJL#I V CX 


^700 


471957 


9006 P93 a?43 95761 5 


T7 

r 


1V/10005 J.Q6^ A -MOA 


u v^z-v^uionivicLXji vcr 


3701 

O /57I 


5^7080 


9006 193 0-743 957699 

ZuUDJZJ.gZHj Z.J /OZZ 


r 


^000^4060^*1^0^ 
lVXU\JUJH"0^\_y.xxU*5 


u i^z*-\^oioiuvxeiJL»ivcx 


3709 

^ / yz. 


554643 


9006 M93 957695 

Z.UUU.IVXZO .gZ*rJ Z J / OZ.J 


F 


A/T0005ilQ7 1 Ti «F04 


T ^r^r^lnnTV/Tfsfl iT/f»i- 
U L^Z -x^OlOIHVXCLLvlVCr 


3703 


SSR707 

J JO / U / 


9007 AOl o«t4.3 957645 


F 


IV/f OOO^ 407 ^ P '"P09 


u v>-z-^oionivxcxx-/ivcr 


3794 

/ -7*-t 


557490 

«J_J / *TZ,U 


9007 M09 crr43 957673 




M0005400^T>F04 


U V^Z V^UlUIXLVXCLLfl V CX 


3705 


489500 


9007 A 03 cr743 957677 

ZUU / ,AVJ .gZ*T«? Z J IVJ 1 i 


r 


1^/100054075(^-006 


U V^Z-V^UIUIliVXt/LLfi VCX 


3706 


471 897 


9007 103 a7-43 957686 




M00054080R-P1 1 

IVxUUVJ*+5*027XJ.^ X I 


U V-/Z ~ V_/ U1UI11V1 C Llul V CX 


3707 


^56841 


9007 lc"03 o^43 957687 

ZUVJ 1 .IXAJO .gZHO Z.J I OO / 


r 


TV/rOOOS4Q01"R-F1 1 


u z-v^oioiiivie Li-fi vex 


3708 


507057 


9007 T 03 cr«743 957688 




1^/1000^40091^-1^ 1 9 


u v^z v^oioiiivxcxx-#ivcr 


3700 


556768 
J JO / oo 


9007 P04 a-74.3 957605 
z-uu / . v^u 4 r.gz 4 T.j zj /oyj 


F 


1^000^40781^ -T7 07 
IVXUUU JH7 / OV>. J1U / 


u v^z - v^oionivic u-fi v cr 


3800 


1 70450 


9007 FH4 o<z43 957608 

ZUU / ,JT U'+.gZTO ZJ / U70 


p 
r 


A/T0005408 f* • F05 


U V_y Z ~ V^UlUiliYlC IX-fl V CX 


jOvx 


555503 

JJJJ7J 


9007 TO/1 cry4^ 957701 


rr 


A/T000^4Q871 : l*r>06 

1YIUUUJ't70 /X^.X-rUO 


u v^z- v^uiunivxciJ-fi vex 


3809 


5568*39 


9007 T04 a^43 957709 


P 


M0005408QR-F04 


u \^z~v^uiuxllvxclxj1 vex 


3803 


551Q67 

J J I Z7\J I 


9007 A05 cr74^ 957700 

ZUU / .rWJJ ,gZ.*T J ZJ / / \JZ7 


P 


M00054076 A • A 0^ 

IVXUUU mJ^rzf 1 OrV./\UJ 


i u v>z*"V^uiuiiivxciJ-(i vex 


3804 


550814 


9007 T05 ar43 957717 

AUU / .1V7J .^Z<*T«7 Z«<^ till 


p 


M00054087R-rT05 


V^Z> V^LVlUXliVXCUL/X V CX 


3805 


55781 1 


9007 T.05 cr743 257790 


F 

X 


lvf00054992r)F06 


vj v^^r v>L/iv/xixvxei>xvx vex 


3806 


559580 


2007 J06 ^743 257734 


F 

X 


lV100054Q89r-r 1 9 


T TP9-Pr»1nnMpfT J ver 


3807 


554764 


9007 1ST06 ay43 957738 


F 


M00054005R-O1 9 
ivx\j u yj J*t 2/ 37 j x-> . \j x z 


U V-/Z~ V^UlUxlIVXCU-fi VOl 


3808 


554305 


9007 P06 o743 957740 


p 


M00054007H F0 1 

IVXUUU DtZfz? i U . JZ>U 1 


TTP9-Pn1nnA/Ti*tT ivf»r 

U V-/Z~v^UIUIXLVXCUL/l vox 


3800 


555630 


9007 WH7 957748 

Z,V/V/ / .Jnv/ / .gZ*r J / /Ho 


P 


TuT000^4086Pl -r>07 
IVXUUU J 570OX5. J-/U / 


vJ v_^z - VvUionivxouXji vex 


3810 

DO l\J 


556561 


9007 FH8 cr74^ 957761 


p 


1V/T000^4Q8 1 ri-P06 
IVXUUU D*r2f Oll/.V/UO 


T TP9-Pn1nnMptl ivpr 
\J V^Z*-V^U1UIXLVXCIXj1 vox 


381 1 


4544£3 


9007 TH8 o-y/T* 95776^ 


p 


Tur000^4087r** A 1 1 
IVXUUU J *r 27 o / V_/./YI I 


U \^Z"V^OIOIlIVXoll-rIVOX 


381 9 


454463 


9007 "K"08 <T-y4.^ 9 S 7767 


i f 

r 


iv/rooo^4oo i r 1 -T70 1 

IVXU UU3t , 57j7 X . XIU I 


u v^z-v^uiuixivxcix-fi vex 


3813 


569750 


9007 POO tr-yzt^ 95777^ 

ZUU / .V/U7.gZH-J ZJ / / / J 




IV/f OOO ^4Q78TI -TT00 

IVXUUU»> 4 TJ7 / OJ-/.XXU27 


T Tr*9-Pr»1nnM^*rT ivpr 

KJ V^Z»-V->UIUIliVXCLl-/l VOX 


3814 


560659 


9Hft7 1700 #rrr4^ 9^7778 


r 


lV4WifK408 ^ n -FIO^ 
IVxUUU o J x-7. J-/U J 


%J V^Z~V^OxUIIIVXCIXjI VOX 


381 ^ 


55361 5 


OOH7 TOO rrrJ.^ 957781 


p 


IVXUUU3*f 370 / V^.x> IZ 


iJV^Z - V^UIOIlIVXCUL»XVOX 


3816 


555512 


2007.F10.gz43 257794 


F 


M00054984AA05 


UC2-(^lonMetLiver 


3817 


556598 


2007.110.gz43_257797 


F 


M00054987C:G04 


UC2-CoIonMetLiver 


3818 


552188 


2007J10.gz43_257798 


F 


M00054990A:F10 


UC2-ColonMetLiver 


3819 


558670 


2007.All.gz43_257805 


F 


M00054976C:A03 


UC2-ColoiiMetLiver | 


3820 


452682 


2007.Mll.gz43_257817 


] F 


M00054994B:D11 


UC2-ColoiiMetLiver 


3821 


553915 


2007.Pll.gz43_257820 


F 


M00055000A:H10 


UC2-ColoiiMetLiver 


3822 


550886 


2007.C12.gz43_257823 


F 


M00054979A:H10 


UC2-ColoixMetLiver 
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Table 2 



SEQ 
ID 
NO 


CLUSTER 


SEQ NAME 


ORIE 
NT 


CLONE ID 


LIBRARY 


3R93 


S^3787 

J J mJ / O / 


2007 H12 ez43 257828 


F 


M00054986CD08 


UC2-ColonMetLiver 


1R94 


475624 


2007 112 ez43 257829 


F 


M00054987DD0 1 


UC2-ColonMetLiver 


^R95 


554212 


2007 012 sz43 257835 


F 


M00054997A-C02 


UC2-ColonMetLiver 


3R96 


5558R3 


9007P12ff743 257836 


F 


M00055000BB04 

IT XV/ V V «^ Vr V/ V/ • w ■ 


UC2-ColonMetLiver 


^R97 


45R957 


2007 B13 ffz43 257838 


F 


M00054977CA09 


UC2-ColoiiMetLiver 


^R9R 


555197 


2007 D13 <^z43 257840 


F 


M00054980DH07 


UC2-ColonMetLiver 




553739 


2007 P13 ez43 257852 


F 


M00055000BF02 


UC2-ColonMetLiver 


3R30 


556416 


2007B14o-z43 257854 


F 


M00054977CB08 


UC2-ColonMetLiver 


^R31 
JO J X 


501056 


2007 B15 ez43 257870 


F 


M00054977C:D08 


UC2-ColoiiMetLiver 


**R39 


559437 

JJZ>*tJ / 


2007 E15 ffz43 257873 


F 


M00054982CA02 


UC2-ColoiiMetLiver 


JOJJ 


555658 

J JJUJO 


2007 LI 5 ez43 257880 


F 


M00054993BH06 

1TX \J \J V ■ ^ */A^«XXW 


UC2-ColoiiMetLiver 


3R34 


476739 

*T / vJ / JZ> 


2007 015 ffz43 257883 


F 


M00054997AG11 


UC2-ColonMetLiver 


jOjJ 


5566Q0 


2007 B16 ^743 257886 


F 


M00054977CD12 


UC2-ColoiiMetLiver 




55761 5 

J J /U1J 


9007 HI 6^743 257892 


F 


M00054986DD11 


UC2-ColonMetLiver 


7R^7 


54^490 


9007 017^743 257907 


F 


M00054985CB12 


UC2^olonMetLiver 


^r^r 


S547Q3 

JJT/7J 


9007 HI 7*7743 257908 


F 


M00054986D:E03 


UC2-ColoiiMetLiver 


JOJ7 


556^06 


9007 T17 0743 957909 


F 


M00054988CB08 


UC2-ColoiiMetLiver 


JO^fl/ 


jjOZoo 


9007 T17cr743 9^7910 


X 


M00054990C-A08 

1 VX v/ v/ V/ »J "T ✓ ^ V/ . jtx v/ \J 


UC2-ColonMetLiver 


^R41 


JJZvJJ 


9007 T 17 0743 957919 


F 


M00054993CC10 

XV XV/ V/ V/ ' "T ^> • V- / X V/ 


UC2-ColoiiMetLiver 




j'tyj 10 


9HH7 P17 rr-74^ 9570T6 

^\JU / .x X / .gZtJ ^^J / f XvJ 


F 

X 


M00055000CF08 

lVXv/ V/ V7— / Jv/uuv/ . x v/ vJ 


UC2-CoIoiiMetLiver 

V-/ VvV/XV/XXXTAVInJUX T vl 


^R4^ 


^fil 119 
JD1 1 1^ 


9007R1RO743 957918 

Z»vlVJ / . XJ X O .£jZHJ jLj / 710 


F 


M00054977C F09 


UC2-ColonMetLiver 


JO*rf 


557054 

J J / UJ*T 


9007T1Ra743 957919 


F 


M00054979BG12 


UC2-ColonMetLiver 


3R45 


4R3410 

to J*tXv/ 


2007D1Rfr 7 43 257920 


F 


M00054981BB09 


UC2-ColonMetLiver 




5QR5RQ 


2007 G18 ez43 257923 


F 


M00054985C:D04 


UC2-ColonMetLiver 


^R47 


55Q0Q6 


2007O18ez43 257931 


F 


M00054997B:F10 


UC2-CoIonMetLiver 


3R4R 


556759 


2007 119 ez43 257941 


F 


M00054988C:G02 


UC2-ColoiiMetLiver 


^R4Q 


555977 
jjjl / / 


2007K19az43 257943 


F 


M00054992AC08 


UC2-ColoiiMetLiver 


^R50 


551 604 

J J X 


2007 R20 pz43 257950 


F 


M00054977DB06 


UC2<k)loiiMetLiver 


^R^l 


569655 

JUZ.UJJ 


9007 F90 <7743 957953 


F 


M00054983AF08 


UC2-ColonMetLiver 


^R59 


559053 

JJ7UJJ 


2007 H20 az43 257956 


F 


M00054987AA09 


UC2-ColonMetLiver 


JOJJ 


556286 


2007 J20 2z43 257958 


F 


M00054990DA06 


UC2-ColoiiIV[etLiver 


^R54 


5517QR 
j j 1 / 70 


2007 K20 ffz43 257959 


F 


M00054992AD1 1 


UC2-ColonMetLiver 


JOJJ 


54Q9 1 4 


9007 T\T90 <7743 957962 


F 


M00054996AD 1 0 


UC2-ColonMetLiver 


Jo JO 


46790^ 


9007 D91 0743 957968 


F 

X 


M00054981BH12 


UC2<^oloiiMetLiver 


Joj / 


5034^9 

JUJ*tJZ 


9H07 T91 <7743 9^7974 


F 


M00054990DF04 


UC2-ColoiiMetLiver 


3858 


556019 


2007.L22.gz43_257992 


F 


M00054993C:G12 


UC2-ColonMetLiver 


3859 


560748 


2007.G23.gz43_258003 


F 


M00054985D:E03 


UC2-ColoiiMetLiver 


3860 


562323 


2007X23 .gz43_258008 


F 


M00054993C:H05 


UC2-ColoiiMetLiver 


3861 


2284 


2007.A24.gz43_258013 


F 


M00054976D:F06 


UC2-ColonMetLiver 


3862 


481243 


2007.D24.gz43_258016 


F 


M00054981C:A11 


UC2-ColonMetLiver 


3863 


555958 


2007.N24.gz43_258026 


F 


M00054996B:C11 


UC2-ColonMetLiver 


3864 


560984 


2007.O24.gz43_258027 


F 


M00054997C:H03 


UC2-ColoiiMetLiver 
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SEQ 
ID 
NO 


CLUSTER 


SEQ NAME 


OKIE 
NT 


CLONE ID 


LIBRARY 


3865 


555967 


2008.N0 Lgz43 258044 


F 


M00055016DC07 


UC2-ColonMetLiver 

V>Xj v> V^x Wixi YXw LX—r X V wl 


3866 


556488 


2008 P01 ez43 258046 


F 


M00055019BB11 


T TC?-r , oIonMei"T ,i ver 

' V^^u. V/UlUlliVXvLlJX V wl 


3867 


553064 


2008.A02.gz43 258047 


F 


M00055001AB10 


UC2-ColoijMetLiver 


3868 


552006 


2008D02ez43 258050 


F 


M00055005AC07 


T JC^-ColonA/fetT ,ivpr 


3869 


559676 


2008E02ez43 258051 


F 


M00055006AD01 


V_/ \_^^ V> vXVJJJLXVxw LX— f X V wX 


3870 


557308 


2008J02.gz43 258056 


F 


M00055011B F09 


UC2-ColOxxMetLiver 

\J V^/ VXVJXXX TXw LX— f X V vl 


3871 


549723 


2008.N02.gz43 258060 


F 


M00055016DC11 


UC2-ColonMetLiver 

\J ^ VUlwXLLVXv LJ — /X V vl 


3872 


549912 


2008.B03.gz43 258064 


F 


M00055002BE07 


UC2^olonMetLiver 

V VyUlUlUtlw UL-/J. V t/X 


3873 


551444 


2008.G03.gz43 258069 


F 


M0OO55008CFO9 


UC2-ColoiiMetLiver 


3874 


557240 


2008 P03 ez43 258078 


F 


M00055019BH10 

XTX W V/ X ^ XJ .X XX Vr 


UC2-CoIonMetLiver 


3875 


559011 


2008.C04.gz43 258081 


F 


MOOO550O3C:EO8 


UC2^oloixMetLiver 


3876 


555701 


2008.D04.gz43 258082 


F 


M00055005AE02 


UC2-ColonMetLiver 


3877 


556115 


2008.P04.gz43 258094 


F 


M00055019C:C01 


UC2-ColoiiMetLiver 


3878 


556042 


2008.B05.gz43 258096 


F 


MOO055002B:GO3 


UC2-ColonMetLiver 


3879 


558212 


2008.E05.gz43 258099 


F 


M00055006B:A09 


UC2-ColoiiMetLiver 


3880 


554932 


2008.G05.gz43 258101 


F 


M00055008C:H10 


UC2-ColonMetLiver 


3881 


548858 


2008.J05.gz43 258104 


F 


M0005501 1CE04 


UC2-ColonMetLiver 

V V^X>r ^/vlVUlTlVlXJX V VX 


3882 


605117 


2008 L05 ez43 258106 


F 


M00055014BE08 


UC2-CoIonMetLiver 

w v^x# v_/ yj x v/xxx vx v i.i ix v vi 


3883 


560525 


2008 M05 ez43 258107 


F 


M00055015DC09 


UC2-ColoiiMetLiver 

\J V^^. VyUlv/XUVXvlXJX V VJL 


3884 


597780 


2008 106 ez43 2581 19 


F 


M00055010C-C10 


KJ WUlUlUVXv 1.1 iX V vl 


3885 


411113 


2008 N06 ez43 258124 


F 


M00055017A* Al 1 


wV/A>^>/Ult'lllVXWlXJXVvX 


3886 


552674 


2008 E07 2z43 258131 


F 


M00055006BC08 


\J V/UlUXUVXiyLX^X V Vl 


3887 


495942 


2008 O07 ez43 258141 


F 


M00055018CC06 


UC2 -ColonMetLiver 

V/V^X> V>UlvXUTXVULvX V VX 


3888 


34381 


2008 F08 cz43 258148 


F 


M00055007CDO 1 

lil\/V/Ui/>/vu / \_/*X^W X 


UC2-ColonMetLiver 

\J V/UIvXIXtXwUJX V vl 


3889 


549195 


2008 H08 ez43 258150 


F 


M00055009CF06 


UC2-ColonMetLiver 

V>X« WlvlXLTXV U /X V wl 


3890 


554084 


2008 L08 ez43 258154 


F 


M00055014C-F05 


UC2-ColoiiMetiLiver 

\y Vv1w1UT1VU>>1 V VX 


3891 


557121 


2008 P08 ez43 258158 


F 


M00055019C-G06 


UC2-rolonMetLiver 

V' V^^r V/vlV/XllVXV 1.1 ^X V Vl 


3892 


554693 


2008 A09 cz43 258159 


F 


M00055001AG05 


UC2-CoIoTiiVretLiver 

\J VyVJlvlll'lw t>l ./I V Vl 


3893 


134237 


2008 C09 ez43 258161 


F 


M00055003DF09 

1TXV \/V*/a/w\la/X/.X \J *r 


UC2-ColcmMetLiver 


3894 


594994 


2008 H09 £z43 258166 


F 


M0005 5009C *F07 


W *^^<^-/W1W111tXWU1_/1 Vvl 


3895 


555371 


2008 109 ez43 258167 


F 


M00055010DA02 


UC2-ColoiiMetLiver 

V^Xi* VrVlvlllTAVUUl V VX 


3896 


347057 


2008 E10 ez43 258179 


F 


M00055006BE05 


T JC2-ColonMetLiver 

V-/ Z-r WlVlUTlVtAJl V VX < 


3897 


558378 


2008H10ez43 258182 


F 


M00055009DD12 


i UC2-ColonMetLiver 


3898 


497086 


2008K10az43 258185 


F 


M00055013BB07 


UC2-ColonMetLiver 


3899 


554379 


2008 Bll ez43 258192 

Xrf\/ \^ V/ - * # X X ~ r~j ~ X^«^ Vf X X^ 


F 


M00055002DA07 


UC2-ColonMetLiver 


3900 


451118 


2008X1 Lgz43_258202 


F 


M00055014D:A11 


UC2-CoIonMetLiver 


3901 


557059 


20O8.Mll.gz43_258203 


F 


M00055015D:G05 


UC2-ColonMetLiver 


3902 


604179 


2008.Oll.gz43_258205 


F 


M0O055018D:DOl 


UC2^olonMetLiver 


3903 


556487 


2008.Pll.gz43_258206 


F 


M00055019D:B02 


| UC2-ColonMetLiver 


3904 


562550 


2008.B12.gz43_258208 


F 


M00055002D.E04 


UC2-ColonMvtLiver 


3905 


556530 


2008.L12.gz43_258218 


F 


M00055015A:C08 


| UC2-ColonMetLiver 


3906 


524897 


2008.O12.gz43_258221 


F 


M0O055018D:EO5 


UC2<^loiiMvtLiver 
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TD 
NO 




<?FO NAMF 


ORIE 
NT 


CLONE ID 


LIBRARY 


3907 


480960 


ZUUo.r 13.gz43_z3oZZo 


r 




T Tr^9_Oo1r»riA7TfatT i\«=»r 

u ^z-^oionivioi j-fivcr 


3908 


560563 


2008 .Crl 3 .gz43_23 ©229 


T7 

r 


JVlUUu J J UU o J,/ . JT U J 


u^z-i^oioiuvi6iJ-/ivcr 


3909 


r f f CIO 

555512 


2008.113.gz43_23o231 


r 


JV1UUUj3U 1UJ-/.J-/U0 


TTr , 9-Pn1nnA/rptT iv^r 
U ^Z^l^OiOIHVlClljlVCr 


3910 


143218 


2008.L13.gz43_25o234 


T7 

r 


Tv>innn^ ^ n i ^ A '"pha 

JYIU UU J Z> U 1 D A . Ej\3 £ * 


U — OIUILLVIC LLfl V CI 


3911 


556616 


2008. N 13.gz43 25823o 


r 


JVIUU03DU1 /v^.l^lZ 


u i^/Z - \^ oi onivic lLt l v cr 


3912 


552613 


2008.A14.gz43_Z3 8239 


17 

r 


IV1UUU30UU 11j.Jlj.U D 




3913 


r r r SCO 

555658 


2008 .114 ,gz43_25 8247 


r 


JVIUUU3 3 U 1 \JU . JuiU o 


u ^Z"v>oioiuvicxi-fivcr 


3914 


516018 


2008.K14.gz43_23o249 


r 


JVLUUU3 D v 1 ^ Ij .nu 1 


TTf^9-i^nlrm1V/1>tT ivf»r ' 


3915 


604859 


2008.M14.gz43_25825 1 


r 


iVIUUU J 3 U I O A. liU4 


U V^Z-^OlOIliVLo U-flVCi 


3916 


553967 


2008.A15.gz43_258233 


T? 

r 


A/Tfinn^nn i A 1 1 




3917 


600021 


2008 .115 ,gz43_2582o3 


T? 

r 


lVlUUU33UlUJ-/.r i 1 




3918 


437580 


2008.Blo.gz43_23o2 /2 


r 


1VIUUU3 3UU3A.U 1 1 


U \^^-\^U1UIjLLV1CL1_»1 VOX 


3919 


556616 


2008.D16.gz43_25 8274 


T? 

r 


JV1UUU3 3UU3 . IjU4 




3920 


551268 


2008.Ulo.gz43_23o2o3 


TJ 

r j 


JYIUUU33U 1 o J-/. JT1U3 




3921 


335714 


2008. A17.gz43_25 8287 


T7 

r 


AjrnnfK^nni r**^! n 

jVIUUUD D UU 1 \^ .Vs 1 u 




3922 


460445 


/>aao TV 1 -"T A ? 1COOAA 

2008 .D 1 7.gz43_25 829 U 


17 

r 


JVlUUU33UU3U.J3Uo 


\J V^Z - V^UlUIliVlCLL/lVCr 


3923 


362109 


2008 .1 17 .gz4 J 25 8295 


T7 

r 


A/rnnn^^ni l a -til 1 

JVIUUUD3U1 1A.D1 1 


T Tr"* 9 -f*r\] nn A/f ofT t\/f»r 
U l^/Z *-V^OIOIllVlC LLfl VCI 


3924 


556320 


2008 J 1 7 .gz43_25 8296 


17 

r 


JVIUUU3 3 U I ZJt> .A1U 


U \^Z-\^010I11VICLL»1VCI 


3925 


931 


/>> AAO T IT _ >| *i TCOTf\0 

2008.L17.gz43_258298 


X? 

r 


JV10U03 3 U 1 3 1> . AU4 


U ^Z-I^OlOlliVlCtljlVcr 


3926 


494300 


200 8 .M 1 7.gz43_25 8299 


T7 

r 


1V1UUU3 3 U 1 ox$ . JiUZ 


U \^Z-V^OlOIllVICU-flVCr 


3927 


453908 


2008. N 17. gz43_23 8300 


T7 

r 


JVIUUU33U1 /U.JL/U3 


U V^^— V^UIUI11Y1CLX-/1VC1 


3928 


C A C\/T C\t\ 

549690 


2008.A18.gz43_23o3U3 


17 

r 


1V1UUU3 3 UU 1 . vJU / 


U V^UlUiUVlCLL/l V CI 


3929 


555949 


2008 .B 1 8 .gz43_23 83U4 


T7 

r 


XVlUUU33UUoiV.vJU*t 


TTP9-T , n1rmM>fT ivpr 

U ^^**V^UlwIMVlCtl-/I VCI 


3930 


555289 


2008.D18.gz43_2583Uo 


17 

r 


JVlUUU33UU3L/.v^l 1 . 


U ^^-V^UIUIliVlCLL^l VCI 


3931 


550497 


2008. r 18.gz43_23o3Uo 


17 

r 


JYIUU U 3 3 UU o X5 . xlU o 


TTP9-T , n1r»TilV/ff*tT ivpr 

^^~^01VJIUVJ.C ULrl V CI 


3932 


559464 


2008.11 8. gz43_25 83 1 1 


17 

r 


A>fAAA^<iA1 1 A*i^AA 
1VIUUU33U 1 1 A.LAK) 


U V_>U1UIU VICLLfl V CI 


3933 


601219 


2008.B19.gz43__258320 


17 

r 


A>f AAA^^AA"2 A *XJA1 

JVIUUU3 3 UU3 A.rlU 1 


U \^AJ1UIIIV1CLL/1 VCI 


3934 


522762 


2008 .F 1 9.gz43_25 8324 


17 

r 


A A A A A ^ < A A QT5 • ¥7 A Q 
JV1U UU3 3 U U ol3 . Jr U j 


ij V-'Z** v^uiuinvicLi^i v ci 


3935 


558530 


2008.F20.gz43_258340 


17 

r 


A>f AAA^<AAQl>-t7AA 

JVlUUU33UUox5.r U4 


T T/~* r J_f^r%\ nn A/f ^fT « rp»r 
U V^Z, ~ VxUlUIHVlC LLfl V CI 


3936 


551656 


2008 .H20 .gz43__25 8342 


17 

r 


A/fAAH^^AI A A *TJA^ 
JVIUUU33U lUA.rlU3 


U V^Z ~\_/ vJIUIHVIC UL/l V CI 


3937 


554245 


2008.N20.gz43258348 


T7 

! r 


AA"AAA<^A1 7A-Unfi 

JVIUUU33U1 lAJ.rXjo 


U V^Z.—^/UlLlIllVlC LLfl V CI 


3938 


562142 


2008.P20.gz43_258350 


T7 
i I 4 


A4AAA<K AOA A ./^.l 1 

JVLUUU33UZUA.<jrl 1 


U \^^-V^UIUIIIVl.CLL#l VCI 


3939 


555639 


r\r\a T^'X 1 - yi o ^>COOCO 

2008 .B2 1 .gz43_25 8352 


r 


A A"A AA< < A (VI U -TJ AT 

lVlUUU33UU3x5.1iU / 


TTi^9-i^n1rmA/fpTT ivpr 


3940 


388688 


2008.12 1 .gz43_258359 




A>fAAA<^A1 1 A *17AA 

MUUU33U1 lA.r Uo 


U "v^L/l L/UiVICLLfl VCI 


3941 


476438 


r\ rv AO TO 1 - >f> OCOZA 

2008 . J2 1 ,gz43_25 83 60 


r 


AyfAAA^^AI OTJ»XJ1 0 

MUUU33U 12r>.rll2 


L>Z, - V/UHJI1IVIC LL*1 V CI 






2008 L21 ez43 258362 


F 


M00055015B:E09 


UC2-ColonMetLiver 


3943 


557308 


2008.M21.gz43_258363 


F 


M00055016C:G04 


UC2-ColonMetLiver 


3944 


553523 


2008.D22.gz43_258370 


F 


M00055005D.G04 


UC2-ColonMetLiver 


3945 


460169 


2008.H22.gz43_258374 


F 


M00055010B:A06 


UC2-ColonMetLiver 


3946 


558744 


2008.B23.gz43 258384 


F 


M00055003B:G11 


UC2-ColoiiMctLiver 


3947 


495984 


2008.123.gz43_258391 


F 


M00055011A:G12 


UC2-ColonMetLiver 


3948 


497513 


2008.H24.gz43_258406 


F 


M00055010B:C11 


UC2-ColonMetLiver 
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Table 2 



SEQ 
ID 
NO 


CLUSTEB 


SEQ NAME 


ORIE 
NT 


CLONE ID 


LIBRARY 


3949 


557910 


2017.B02.gz43_258434 


F 


M00055020D:G06 


UC2-ColonMetLiver 


3950 


557603 


2017.102.gz43_258441 


F 


M00055029C-.D05 


UC2-ColonMetLiver 


3951 


523495 


201 7.M03 .gz43_25 846 1 


F 


M00055034B:G04 


UC2-ColonMetLiver 

%/X/Xv/lJUlTXWiXJX V Wl 


3952 


390017 


2017.H04.gz43J258472 


F 


M00055028C:D05 


UC2-ColonMetLiver 


3953 


610986 


2017.B05.gz43_258482 


F 


M00055021A:F08 


UC2-ColonMetLiver 


3954 


551884 


20 1 7.E05 .gz43_25 8485 


F 


M00055024D:E09 


UC2-ColonMetLiver 


3955 


559447 


2017.E06.gz43_258501 


F 


M00055024D:F07 


UC2-ColonMetLiver 


3956 


322123 


2017.F06.gz43 258502 


F 


M00055026AG08 


UC2-ColonMetLiver 


3957 


556258 


2017.H06.gz43_258504 


F 


M00055028DA08 


UC2-ColonMetLiver 

Vy V^^rf V/Wl will T A V LXil ▼ Vl 


3958 


561279 


2017.K06.gz43_258507 


F 


M00055032B:D08 


UC2-ColonMetLiver 


3959 


550815 


2017.E07.gz43J258517 


F 


M00055024D:G09 


UC2-ColonMetLiver 


3960 


561935 


2017.D08.gz43 258532 


F ! 


M00055023CG11 


UC2-ColonMetLiver 


3961 


557733 


2017J08.gz43_258538 


F 


M00055031B:E05 


UC2-ColonMetLiver 

W w^rf V/ViviU"lvl'XVl V vl 


3962 


556852 


2017.N09.gz43 258558 


F 


M00055035DE07 


UC2-ColonMetLiver 

V' V^^w WIVUITXVUJI V Vl 


3963 


557049 


2017D10ez43 258564 


F 


M00055023DG02 


UC2-ColonMetLiver 

V_^X- vUlvillTi,VllJl V vl 


3964 


560250 


2017.E10.gz43 258565 


F 


M00055025A\D11 


UC2-ColonMetLiver 


3965 


560294 


2017.H10.gz43 258568 


F 


M00055028DE10 


UC2-ColonMetLiver 


3966 


481930 


2017 Dll ez43 258580 


F 


M00055024AB04 


KJ v^ vUll/JiiVXv 1 1 " V vl 


3967 


550986 


2017Ellez43 258581 


F 


M00055025BA03 


UC2-ColonMetLiver 

V>UlUlliTj.vUUl V vl 


3968 


560621 


2017Hllez43 258584 


F 


M00055029AA02 


T jr'2-ColonMetLiver 

\y vUlUlUTj,b V (I V vl 


3969 


549599 


2017111 sz43 258585 


F 


M00055030BE03 

ITXVVU*/ X/ V/ X-/ «XVVft/ 


UC2-ColonMetLiver 


3970 


558004 


2017 Jll az43 258586 


F 


M0005503 IB JI08 


UC2-ColonMetLiver 


3971 


562590 


2017.L12.gz43 258604 


F 


M00055033B:H01 


UC2~ColonMetLiver 


3972 


558755 


2017.N12.gz43 258606 


F 


M00055035DF09 


UC2-ColonMetLiver 


3973 


556458 


2017.M13.gz43 258621 


F 


M00055034DA10 


UC2-ColonMetLiver 


3974 


450059 


2017.B14.gz43 258626 


F 


M00055021B.G09 


UC2-ColonMetLiver 

V-/ w ■^— r VV1V111 "XV«X/1 ¥ vSl 


3975 


557353 


2017.C14.gz43 258627 


F 


M00055022CB12 


UC2-ColonMetLiver 


3976 


560898 


2017.D14.gz43 258628 


F 


M00055024AC07 


UC2-ColonMetLiver 

V-/ X— X^rf wV/lv/lXXTXvtXvl ▼ X/X 


3977 


461850 


2017 114 ez43 258633 


F 


M00055030BH07 


UC2-ColonMetLiver 


3978 


552848 


2017J14ez43 258634 


F 


M00055031CB11 


UC2-ColonMetLiver 

V-^ \>ViVlll"XVl*Xi/l T Wl 


3979 


552891 


2017F16ez43 258662 


F 


M00055026CE07 


UC2-ColonMetLiver 


3980 


454483 


2017.G16.gz43 258663 


F 


M00055027D:E08 


UC2-ColonMetLiver 


3981 


508671 


2017 D18 sz43 258692 


F 


M00055024BE02 


UC2-ColonMetLiver 


3982 


456183 


2017N18ez43 258702 


F 


M00055036AH10 

lTXVvv«/%/ V«/\/i X*XXX V/ 


UC2-ColonMetLiver 


3983 


557578 


2017 018 ez43 258703 


F 


M00055037CD01 


UC2-ColonMetLiver 

V*f/ X— ^Xrf W VXVXJLL TXVIXJX V VI 


3984 


561868 


2017.D19.gz43 258708 


F 


M00055024B:E03 


UC2-ColonMetLiver 


3985 


556856 


2017.F19.gz43_258710 


F 


M00055026D:E12 


UC2-ColonMetLiver 


3986 


557010 


2017.L19.gz43_258716 


F 


M00055033D:D03 


UC2-ColonMetLiver 


3987 


529356 


2017.D21.gz43_258740 


F 


M00055024B:E06 


! UC2-ColonMetLiver 


3988 


557883 


2017.L21.gz43_258748 


F 


M00055033D:G01 


UC2-ColonMetLiver 


3989 


549114 


2017.B22.gz43_258754 


F 


M00055021D:F06 


UC2-ColonMetLiver 


3990 


556933 


2017.D22.gz43_258756 


F 


M00055024B:F02 


UC2-ColonMetLiver 
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SEQ 
ID 
NO 


CLUSTEE 


SEQ NAME 


ORIE 
NT 


CLONE ID 


LIBRARY 


3991 


494314 


2017.F22.gz43_258758 


F j 


M00055027A:E04 


UC2-ColonMetLiver 


3992 


408586 


2017.M22.gz43_258765 


F 


M00055035B:D11 


UC2-ColonMetLiver 


3993 


460078 


2017.L23.gz43_258780 


F 


M00055034A:G01 


UC2-ColonMetLiver 


3994 


561836 


201 7.N23 .gz43_258782 


F 


M00055036B:H02 


UC2-ColonMetLiver 


3995 


558185 


2043 .D03 .gz43_259228 


F 


M00055222D:B11 


UC2-ColonMetLiver 


3996 


550632 


2043.F03.gz43_259230 


F 


M00055225C:E08 


UC2-ColonMetLiver 


3997 


551677 


2043 . J03 .gz43_25 9234 


F 


M00055230A:H01 


UC2-ColonMetLiver 


3998 


451720 


2043 .N03 .gz43_25923 8 


F 


M00055236C:E05 


UC2-ColonMetLiver 


3999 


559447 


2043 X05 .gz43_259268 


F 


M00055233C-.F09 


UC2-ColonMetLiver 


4000 


558230 


2043.P05.gz43_259272 


F 


M00055239B:C05 


• UC2-ColonMetLiver 


4001 


554627 


2043 .A06.gz43_259273 


F 


M00055219B:B04 


UC2-ColonMetLiver 


4002 


558422 


2043.N06.gz43_259286 


F 


M00055236D:E04 


UC2-ColonMetLiver 


4003 


558679 


2043 .G08 .gz43_2593 1 1 


F 


M00055226C:H05 


UC2-ColonMetLiver 


4004 


491240 


2043.F09.gz43_259326 


F 


M00055225D.G11 


UC2-ColonMetLiver 


4005 


550815 


2043 .H09 .gz43_259328 


F 


M00055227C-.D02 


UC2-ColonMetLiver 


4006 


481362 


2043.P10.gz43_259352 


F 


M00055239C:F08 


UC2-ColonMetLiver 


4007 


467710 


2043 .C 1 1 .gz43_259355 


F 


M00055221D:H08 


UC2-ColonMetLiver 


4008 


557965 


2043.B12.gz43 259370 


F 


M00055220D:G07 


UC2-ColonMetLiver 


4009 


380127 


2043.C12.gz43 259371 


F 


M00055222A:A06 


UC2-ColonMetLiver 


4010 


374281 


2043.M12.gz43_259381 


F 


M00055235B:C01 


UC2-ColonMetLiver 


4011 


464067 


2043 . C 13 .gz43_25 93 87 


F 


M00055222A:C08 


UC2-ColonMetLiver 


4012 


552521 


2043.D13.gz43__259388 


F 


M00055223B:C04 


UC2-ColonMetLiver 


4013 


552019 


2043.P 13 .gz43_259400 


F 


M00055239D:C01 


UC2-ColonMetLiver 


4014 


556790 


2043.A14.gz43_259401 


F 


M00055219C:C10 


UC2-ColonMetLiver 


4015 


557411 


2043.K14.gz43_25941 1 


F 


M00055232C.D04 


UC2-ColonMetLiver 


4016 


378459 


2043.L15.gz43_259428 


F 


M00055233D.G11 


UC2-ColonMetLiver 


4017 


550571 


2043.H16.gz43_259440 


! F 


M00055227D:E07 


UC2-ColonMetLiver 


4018 


557676 


2043 .P 1 6.gz43_25 9448 


F 


M00055239D:G04 


UC2-ColonMetLiver 


4019 


558652 


2043.A17.gz43_259449 


F 


M00055219C:H06 


UC2-ColonMetLiver 


4020 


559053 


2043J17.gz43_259457 


F 


M00055229C:D01 


UC2-ColonMetLiver 


4021 


552086 

*—r V/ \J \J 


2043.C18.gz43_259467 


F 


M00055222B:A01 


UC2-ColonMetLiver 


4022 


552430 


2043.L18.gz43_259476 


F 


M00055234A-.G12 


UC2-ColonMetLiver 


4023 


562317 


2043 .P 19.gz43_259496 


F 


M00055240A.B07 


UC2-ColonMetLiver 


4024 


543540 


2043 122 ez43 259537 


F 


M00055229D:D07 


UC2-ColonMetLiver 


4025 


489275 


2043.F23.gz43_259550 


F 


M00055226CA.il 


UC2-ColonMetLiver 


4026 


450193 


2043.M23.gz43_259557 


F 


M00055236B.C07 


UC2-ColonMetLiver 


4027 


557656 


2043.O23.gz43_259559 


F 


M00055238D:D10 


UC2-CoIonMetLiver 


4028 


559764 


2043.D24.gz43 259564 


F 


M00055223D:H03 


UC2-ColonMetLiver 


4029 


560717 


2043.E24.gz43 259565 


F 


M00055225B:H01 


UC2-ColonMetLiver 


4030 


552249 


2043.P24.gz43 259576 


F 


M00055240B:E12 


UC2-ColonMetLiver 


4031 


556829 


2044.G02.gz43 259603 


F 


M00055248D:B12 


UC2-ColonMetLiver 


4032 


558093 


2044.A03.gz43 259613 


F 


M00055240C:A03 


UC2-ColonMetLiver 
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SEQ 
ID 
NO 


CLUSTER 


SEQ NAME 


ORIE 
NT 


CLONE ID 


LIBRARY 


4033 


616134 


2044.B03.gz43 259614 


F 


M00055241DF10 


UC2-ColonMelLiver 


4034 


558355 


2044 E03 ez43 259617 


F 


M00055245DA04 


UC2-ColonMetLiver 

w Vy Vy VJ 1 vJU-L VX vy UI—/A V t/1 


4035 


558785 


2044 J03 ez43 259622 


F 


M00055252BD07 


V7 Vy -<0 Vy KJ 1 V/AAA VX vy l/Xy/A V vl 


4036 


607430 


2044 F04 ffz43 259634 


F 


M00055247BA1 1 


T TP?-Polnn'M'ptT .iver 

VJ Vy/- Vy vJIvJiAlVXvy LX-/1 V VvA 


4037 


558512 

' *y Vy J- A 


2044 L04 ez43 259640 


F 


M0005525 5BF05 


UC2-ColonMetLiver 

Vy V/A. Vy \JX V/LIX VX w ULyA V VyA 


4038 


488574 


2044 N04 ez43 259642 


F 


M00055257DA1 1 

J- ▼ vy vy vy »y / xy .iii. X 


UC2-ColonMetLiver 

W Vyy~* Vy V7AV/A AATXvy ULyA V W 


4039 


558098 

w» «y V7 <' \J 


2044 P04 ez43 259644 


F 


M0005 5260B ■ A05 

i"x vy vy vy y y vy \y J-y .1 i.v- ' 


UC2-ColonMetLiver 

W Vy At Vy V A V/ AAA » AVv Ulyl V vl 


4040 


557389 


2044 F05 ez43 259650 

A»V/~~.X V/.y 'f_t A«y y Vy^y \J 


F 


M0005 5247CB0 1 


T JC 2-Cnl on MetT A ver 

V/ Vy j1» Vy VJAUAAAVXvy LXy 1 V vl 


4041 


557928 


2044 K05 sz43 259655 


F 


M00055254AE02 

J.TJLV vy vy ^ i<y Jy vm 


UC2-CoIoiiMetLiver 

V VyZ» Vy V/AV7AXAVX VlXyl V VyA 


4042 


551305 


2044 E06 az43 259665 


F 


M00055245DD01 


UC2-ColoiiMetLiver 

Vy VyA Vy V A Vy AAA »AV Lx—f A V VyA 


4043 


504415 


2044 H06 sz43 259668 


F 


M00055250BA05 


UC2-ColonMetLiver 

vy \y«< vyuivuiixvvui v vi 


4044 


549052 


2044 106 ez43 259669 


F 


M00055251CD01 


UC2-C^lonMetLiver 

Vy Vyi-r VyvyiVVAUTXVIiAJl V Vl 


4045 


138927 


2044 006 ez43 259675 


F 


M00055259AG06 


UC2-ColoixMvtLiver 

\-y vy*< vyviviuTxwuyi t va 


4046 


554246 


2044 N07 ez43 259690 


F 


M00055257D*E02 


UC2-ColoiiMetLiver 

Vy vy*< VyVlvlllTJ.WUUl V vl 


4047 


159419 


2044 J08 ez43 259702 


F 


M00055252CE02 


UC2-ColcmMetLiver 

vy vy*< vy v/ivyiu.Txwuxyi v vx 


4048 


460049 


2044.P08.gz43 259708 


F 


M00055260B:H06 


UC2-ColoiiMetLiver 


4049 


560317 


2044 D09 ez43 259712 


F 


M00055244DC07 


UC2-ColoiiMetlyiver 

vy va vyvivuiTxvuyi v vx 


4050 


560317 


2044 E09 ez43 259713 


F 


M00055246AC01 


UC2-ColonMetLiver 


4051 


552357 


2044 H09 ffz43 259716 


F 


M00055250B-G09 


TJf t 2-ColonMetLiver 

Vy VyA Vy v/AVJAXLVXvULyx V vl 


4052 


560144 


2044 P09 ez43 259724 


F 


M00055260CA1 1 


Vy VyA Vy V/J. v/XIX *X v ULv A V vx 


4053 


560669 


2044 E10 ez43 259729 


F 


M0005 5246AE 1 0 

iij.v/v/vw' yz< i v/n.Ju x vy 


IJC2-ColoiiMetLiver 

Vy VyA Vy VyAV/AAA~XVyl 1 VX V VA 


4054 


423884 


2044N10ffz43 259738 


F 

X 


M0005 525 8 AC07 


UC2-ColonMetLiver 

Vy VyA Vy UiVlU YAVVUl V vl 


4055 


523732 


2044111 ez43 259749 


F 


M0005525 1CH07 

atx vy vy vy _y *_y *_y 1 vy.iAV/ / 


UC2-ColonMetLiver 

Vy VyA Vy VI V/1ALTA V IXyi V VA 


4056 


493359 


2044Kllffz43 259751 


F 


M00055254CA06 

1tAV/vv»/ «y j--^J i \»y ..iWy vy 


Vy VyA Vy VyAV/AJ-LVXyyt 1 /A V vl 


4057 


560204 


2044 E14 ffz43 259793 


F 


M00055246BC04 


UC2-ColonMetLiver 

Vy VyA Vy V7 1 VJ AAA t X Vv 4.1 jL V VI 


4058 


558461 


2044 F14 az43 259794 


F 

X 


M00055248AF10 

lVXvy vy vy w/ w/y_#*T OyTv.X x Vy 


T TC9-Po1nn Met! ,iver 

Vy Vy A Vy VyAVylXl VX v LX_/A V vl 


4059 


447380 


2044 J14 az43 259798 


X 


M00055252D-C10 

IVlv/uV/*' Jj6J^Ay.\y X V/ 


T 1(^2-^01011 MetTyi ver 

Vy VyA Vy V/Xv/Xil VX V LXy 1 V vl 


4060 


558559 


2044 01 4 az43 259803 


F 

X 


M000S5259R*Ct1 1 

AVAVy vy Vy^y ^ y_._y J7 . VJ X X 


TTC2-Co1onMetL,iver 

VV VyA Vy v/lv/lXJLVl v LXy A V vl 


4061 


450025 

*T-J V/V/A«y 


2044 A15 ffz43 75Q805 


F 

X 


M0005524-1 A*C06 

lVXV/V/V/»^<-/^*T 1/V.V/V/U 


T JC2-ColonMetIyAver 

Vy VyA Vy v/1 v/lXLVX V LXy 1 V Vl 


4062 


484086 

« u^rvy vy vy 


2044 F15 az43 259809 


F 

X 


M0005 5246BD 1 0 

J.VXvy V/ Vy ^ ~JjZ.—t vJX-/.X-^ X V/ 


T JC2-ColonMetLiver 

vy VyA Vy VyAV/AAAVXVLlyA V VA 


4063 


558861 

«y »/ O Uu X 


2044K15ez43 259815 


F 

X 


M00055254DB1 0 

XVXV/V/V/»/*7A.»7*Tl - /.X J> * X \J 


I JC2-Colon MetLiver 

V VyA Vy V/1V/1XL VX V Uuix V vl 


4064 


486787 


2044 C16 ffz43 259823 


F 


M00055243DC06 

1tAV/V/V/»/.J Z<^_/Ay . V_y V/ Vy 


T JC2-ColoiiMetLiver 

Vy VyA vv/iv'iALTXVUyi v vx 


4065 


562256 


2044 L16 ffz43 259832 


F 

X 


M00055256A-D12 


I JC2-ColonMetLiver 

Vy VyA Vy vlvlUYAvUyl V VI 


4066 


561579 


2044 El 7 &z43 259841 


F 

X 


M00055246CF08 

1tAv/V/v/«/»/ >I."TVy\»y «X V/O 


T JC2-ColonMetLiver 

Vy VyA Vy V/lv/lXLVXVU_/l V VI 


4067 


560205 

\J \J \J 


2044 J 17 ez43 259846 


F 


M0005 525 3 A- C07 


UC2-ColonMetLiver 

Vy V** vyviVUiTivuyi t v* 


4068 


558134 


2044.M17.gz43_259849 


F 


M00055257B:B10 


UC2-ColonMetLiver 


4069 


488143 


2044.H18.gz43_259860 


F 


M00055251A:B09 


UC2-ColonMetLiver 


4070 


500737 


2044.118.gz43„259861 


F 


M00055251D:H11 


UC2-ColonMetLiver 


4071 


238121 


2044.C20.gz43_259887 


F 


M00055244A:C06 


UC2-ColonMetLiver 


4072 


629002 


2044.D20.gz43_259888 


F 


M00055245B:H01 


UC2-ColonMetLiver 


4073 


562256 


2044X21.gz43_259912 


F 


M00055256B:C07 


UC2-ColonMetLiver 


4074 


541784 


2044.B22.gz43_259918 


F 


M00055242D:E02 


UC2-ColonMelLiver 
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SEQ 
ID 
NO 


XUSTEF 


SEQ NAME 


OKIE 
NT 


CLONE ID 


LIBRARY 


4075 


560507 


2044.D22.gz43 259920 


F 


M00055245C:D04 


UC2-ColonMetLiver 


4076 


555340 


2044 F22 cz43 259922 


F 


M00055248C:B11 


UC2-ColonMetLiver 


4.077 


560059 


2044 G22sz43 259923 


F 


M00055249C:B12 


UC2-ColoiiMetLiver 


407X 


404130 


2044 K24 £z43 259959 


F 


M00055255A:F11 


UC2-ColonMetLiver 


4070 


557028 


2044 L24 ffz43 259960 


F 


M00055256B:G05 


UC2-ColonMetLiver 


40X0 


550391 
~> *j y ~> x 


2053 K01 ez43 259984 


F 


M00055272D:E04 


UC2-ColonMetLiver 


4081 


343X1 

-7*T,JO X 


2053 E02 &z43 259994 


F 


M00055265A:G01 


UC2-ColonMetLiver 


4089 


556447 


2053 H02 ffz43 259997 


F 


M00055268B:D12 


UC2-ColonMetLiver 


4o$n 


555564 


2053 E03 ez43 260010 


F 


M00055265A:G07 


UC2-ColonMetLiver 


40X4 


558360 


2053 M03 ez43 260018 


F 


M00055275B:H06 


UC2-ColonMetLiver 


40X5 


55X170 

J JO lXiV 


2053 B05 sz43 260039 


F 


M00055262B :A1 1 


UC2-ColonMetLiver 


40X6 

HvOvJ 


511 276 


2053 D05 ez43 260041 


F 


M00055264A:F03 


UC2-ColonMetLiver 


40X7 


550074 


2053 F05 ez43 260043 


F 


M00055266B:C12 


UC2-ColonMetLiver 


40RX 


55XX67 

J JOOU / 


2053 L05 ffz43 260049 


F 


M00055274CC06 


UC2-ColonMetLiver 


40XO 


555564 


2053 N05 ffz43 260051 


F 


M00055277D:A02 


UC2-ColoiiMetLiver 


4000 


55170X 


2053 O05 ffz43 260052 


F 


M00055279AE03 


UC2-ColonMetLiver 


4001 


556010 


2053 DOX az43 260089 


F 


M00055264B:E06 


UC2-ColonMetLiver 


400? 


4X0060 


2053 O09 ez43 260116 


F 


M00055279B:D02 


UC2-ColonMetLiver 




47X0X7 

*T / ouo / 


2053 C10 ffz43 260120 


F 


M00055263C:D02 


UC2-ColonMetLiver 


4004 


450507 

*tJ \J J V/ / 


2053 D10 ez43 260121 


F 


M00055264B:H12 


UC2-ColonMetLiver 


4005 


SI 2721 


2053 E10 ez43 260122 


F 


M00055265C:A04 


UC2-ColonMetLiver 


4006 


55X052 


2053 Bll ez43 260135 


F 


M00055262C:B08 


UC2-ColonMetLiver 


4007 


550X63 


2053 A12 sz43 260150 


F 


M00055261B.G12 


UC2-ColoixMetLiver 


400X 


560057 


2053 L12 ez43 260161 


F 


M00055274D:A11 


UC2-ColonMetLiver 


4000 


560621 


2053 Ml 2 ez43 260162 


F 


M00055275D:E12 


UC2-ColonMetLiver 


4100 


450942 


2053 C13 ffz43 260168 


F 


M00055263C:F10 


UC2-ColoiiMetLiver 


4101 


446557 


2053 R14 sz43 260183 


F 


M00055262CF05 


UC2-ColonMetLxver 


A107 


56067X 


9053 F14 cr743 260186 


F 


M00055265CE01 


UC2-ColonMetLiver 


4103 


557561 


2053 F14 ffz43 260187 


F 


M00055266D:C09 


i UC2-ColonMetLiver 


4104 


505X5X 

JUJOJO 


9053 (^15^743 260200 


F 


M00055263C:G09 


UC2-ColonMetLiver 


4105 


557561 


2053 El 5 ez43 260202 


F 


M00055265C:F01 


UC2-ColonMetLiver 


4106 


44X450 


9053 Ml 5 t*743 260205 


F 


M00055268D.G09 


UC2-ColonMetLiver 


Al 07 


A7557X 
*r /*/«7 / O 


9053 T1 5 cr743 260206 


F 


M00055270DB05 


UC2-ColonMetLiver 


41 OX 


540059 


9053 T 15 a743 260209 


F 


M00055274D:B10 


UC2-ColoixMetLiver 


4100 


569X71 

«7U.£.0 / 1 


9053 Ml 5 *7z43 260210 


F 


M00055275DH08 


UC2-CoIonMetLiver 


4110 


558263 


2053X18.gz43_260257 


F 


M00055274D:C10 


UC2-ColonMetLiver 


4111 


560538 


2053.O18.gz43_260260 


F 


M00055280A:C09 


UC2-ColonMetLiver 


4112 


554000 


2053.C19.gz43J260264 


F 


M00055263D:C03 


UC2-ColonMetLiver 


4113 


128962 


2053.A20.gz43_260278 


F 


M00055261D:D09 


UC2-ColonMetLiver 


4114 


560278 


2053 ,M20.gz43_260290 


F 


M00055276B:C09 


UC2-ColonMetLiver 


4115 


420892 


2053.F21.gz43_260299 


F 


M00055267A:D01 


UC2-ColonMetLiver 


4116 


558513 


2053 .P2 1 .gz43_260309 


F 


M00055281AF08 


UC2-ColonMetLiver 
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SEQ 
ID 
NO 


CLUSTER 


SEQ NAME 


ORES 
NT 


CLONE ID 


LIBRARY 


4117 


511847 


2053 F22 ez43 260315 


p 


MO0OSS967A-fr1 1 

lVXV/WV/«/*/X>v / Zi.,VJ X X 


TTP^-PnlrniA/IWT i-i;pr 
w -v^uiujxiviclljI vcr 


4118 


481958 


2053 P22 ez43 260325 


F 


M0005598 1 BTj04 


TTr , 9-r*n1on1V4f»+T tx^r 

vJ \^x#*-V^OiOJxiVJ.CLx-/l VCI 


4119 


561666 


2053 F24 ez43 260347 


F 


M00055267BB06 


LJ V^Z,~V^U1UJULLVXC LLrl VCI 


4120 


553665 


2053 124 ez43 260350 


F 


M00055271R-rO1 


TTP9-Pn1nn1\/TptT ivf*r 

U v^Z» VvUlUlliVXC LL/X VCI 


4121 


643513 


1669 A01 ez43 260687 


F 


M 0005 5 65 1 C ■ F07 


T TP 9 .Norm Pnl nn 


4122 


641658 


1669 J01 sz43 260696 


F 


M00055663AH1 0 


T TP 9 -MnrmPrilrin 


4123 


639752 


1669 M01 ez43 260699 


F 


M00055667BC08 


T TP9-"MnrmPri1rMn 


4124 


518569 


1669P01ez43 260702 


F 


M00055671AH03 


T TP9-Mnrm Pnlnn 


4125 


642068 


1669N02ffz43 260716 


F 


M00055668CA04 


T TP9 -lSJnrm Polon 


4126 


639743 


1669 O02 ez43 260717 


F 


M 0005 5 669D B08 

j. vX V/ \s \/J \s\J ^ XS . X-# v/O 


T TP9 -Mrvrm Pnlrin 


4127 


642230 


1669 P03 az43 260734 


F 


M00055671BB02 


T TP9 -Mnrm Pnl nn 
\j vx i n \jx xxxy^/Xjixjxx 


4128 


447585 


1669 104 sz43 260743 


F 


M00055669BF1 1 


T TP9 -MnrmPrkl nn 


4129 


640078 


1669 J04 ez43 260744 


F 


M00055663BF09 


TTP9-."MnrmPn1nn 


4130 


647333 


1 669 B06 az43 260768 


X 


1^0005565^-^0^ 


T TP 9 -lSlnrm Pnl nti 
U ^Z.*"l>i Ul IIIV-HJIUII 


4131 


472226 


1 669 C06 ez43 260769 


F 

X 


M00055654B-GOO 


T TP 9 -KTnrm Pnl nn 
u v^x. i> uniiv^uiun 


4132 


455778 


1 669 H06 oz43 260774 


F 

X 


M00055661 BF07 


T TP9-TsJnrrnPn1nn 


4133 


523875 
~f x, ~j \j I -j 


1 669 T07 ^74^ 960791 


T7 


M00055669P-P1 1 


T TP9 -Mnrm P nl nn 
KJ KsZ. -IN Ul lllv^OlOIl 




VJ— ?Ox» 1 X 


1 669 T07 cr74^ 9607Q9 


r 


1^0005566^(^*1105 


T TP 7 JWVvrm P/>1 r*« 

u v^z,-rs ui iiiv^oioii 


41^5 


*TU-J 


1 66Q KiV7 crz4^ 9607Q3 


x: 


A/TOOn5^66^P-P 1 0 


u v^z-*xn onriv^oioii 


4136 


648352 


1 669 O07 frz43 260797 


F 

X 


M00055670A -R04 


T TP 9 -"Mnrm Pnl nn 


4137 


641469 


1669C08i?z43 260801 


F 

X 


M00055654PA05 


T TP9 -TsJnrm Pnl nn 
L/ v-/x.-l>Unil^Ulvjll 


4138 


450883 


1 669 F08 az43 960803 

x *jvjj7 .i^yo,H/(7j Z.UUOUJ 


F 

X 


M00055657 A B04 


T TP9 -Mnrm Pnlrvn 

ulz -in oi jtiii^oion 


4139 


638971 


1669P09crz43 260817 


F 


M00055654P-P1 0 


T TP9 -."Mnrm Pnl nn 
\J \^r£ -IN UllilV/OlOIl 


4140 


641379 


1669 109 ffz43 260823 


F 

X 


M0005 5 669 PD1? 


T TP9 -Mnrm Pnl nn 
U v>-x. IN Ul 1 11 V-/\J1U11 


4141 


618998 


1669 A 10 ffz43 260831 


F 

X 


M00055659BB1 1 

IVlUuuJ JUJX.JJ.lJl x 


T TP9 -Mnrm P nl nn 
yJ \w/x#-1NUH11\^U1U11 


4142 


644971 


1669 B10 ez43 260832 


F 

X 


M00055653BG01 

IVlvuV/J JUJ JJJ.VJu X 


T TP9 -Mnrm Pnl nn 

KJ v>x- "IN Ul 111V_/UIU11 


4143 


556488 


1669n0«r74^ 960833 


F 


M00055654PTjO^ 

IVIvvvJJUJ'tv.I/wJ 


T TP9JMnrmPn1nn 
U ^Z-*"IN UI IIIV/OIOII 


4144 


452687 


1669 G10 ez43 260837 


F 

X 


M00055660AA06 


T TP 9 -Mnrm P nl nn 

U IN Ul 1 11V_/U1U11 


4145 


643909 


1 669 Jl 0 0^43 260840 

X \J\J Z? .J X \J .fd i J-jr~T ~J i,U\/0*tv 


F 

X 


\f0005566^n*R05 


T TP 9 -Mnrm Pnl nn 
\J V/X -IN Ul XllV^UlUIl 


4146 


646894 


1669 M10 pz43 960843 

XyJKJZs ,XYX X \J .^XiT J X»vJY/0"J 


F 

X 


M 0005 5 667r - F07 


T TP 9 -Mnrm Pnl nn 
\J v>Z.-lN Ullllx^UlUll 


4147 


556750 


1669N10 ez43 260844 


F 

X 


M00055668DE1 1 


T TP 9 -Mnrm Pnlnn 
w \_^x. —IN UI lliV/Uluxl 


4148 


640259 


1669Bllez43 260848 

1 UV7.JU1 i .gtlTJ ArV/V0~O 


F 

X 


M00055653BG1 1 

IY1V/UI/JJ VJJ JjJ, VJ X X 


T TP9 -Mnrm Pnlnn 

\J w-x* "1 N \JX XX I V> Ul UJX 


4149 


643804 


1669Dllez43 260850 


F 

X 


M00055656AF0Q 


T TP 9 -Mnrm Pnl nn 
U L/X "IN UllllV^UlUll 


4150 

~ X J V/ 


97909 


1669F11 az43 960859 


T7 
V 


M00O5565OA- A08 


T TP9-MnrmPn1nn 

u -in ui m^oion 


4151 


503275 


1669 Gil oz43 260853 


F 


M0005566OA-P05 

IVluUU J J OOUA. vU J 


T TP9-MnrmPn1nn 
U -IN Ol lllV^OXOIl 


4152 


640678 


1669.L1 l.gz43_260858 


F 


M00055666B:E06 


UC2-NormColon 


4153 


406734 


1669.A12.gz43j260863 


F 


M00055652B:F12 


UC2-NormColon 


4154 


643563 


1669.G12.gz43__260869 


F 


M00055660A:C08 


UC2-NormColon 


4155 


658447 


1669.H12.gz43_260870 


F 


M00055661C:E11 


UC2-NormColon 


4156 


446188 


1669.112.gz43_260871 


F 


M00055662C:H06 


UC2-NormColon 


4157 


603857 


l669.L12.gz43__260874 


F 


M00055666C:Cll 


UC2-NormCoIon 


4158 


561894 


1669.M12.gz43_260875 


F 


M00055667D:B01 


UC2-NormColon 
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Table 2 



SEQ 
ID 
NO 


:lusteb 


SEQ NAME 


OWE 
NT 


CLONE ID 


LIBRARY 


4159 


639596 


1669.11 5 .gz43_2609 19 


F 


M00055662D:A09 


UC2-NormCoIon 


4160 


589483 


1 669.G 1 7.gz43_260949 


F 


M00055660B-.H02 


UC2-NormColon 


4161 


639665 


1669.117.gz43_26095 1 


F 


M00055662D-.B07 


UC2-NonnColon 


4162 


641683 


1669 017.ez43 260957 


F 


M00055670D:F02 


UC2-NormColon 


4163 


642111 


1 669 .C 1 8 .gz43_26096 1 


F 


M00055655A:A09 


UC2-NormColon 


4164 


640179 


1 669.K1 8 .gz43_260969 


F 


M00055665D:F11 


UC2-NonnColon 


4165 


648352 


1669.N18.gz43_260972 


F 


M00055669B-.G02 


UC2-NonnColon 


4166 


640018 


1669.I20.gz43J260999 


F 


M00055662D:E05 


UC2-NormColon 


4167 


642332 


1669.L20.gz43 261002 


F 


M00055667AB12 


UC2-NonnColon 


4168 


639706 


1669.F21.gz43_261012 


F 


M00055659C:B10 


UC2-NonnColon 


4169 


642411 


1 669.C22.gz43_261 025 


F 


M00055655B:B08 


UC2-NormColon 


4170 


641091 


1669.122.gz43_261031 


F 


M00055663A.B02 


UC2-NonnColQn 


4171 


463312 


1669.B23.gz43 261040 


F 


M00055653D:H02 


UC2-NormColon 


4172 


456561 


1669.L23.gz43_261050 


F 


M00055667A.H10 


UC2-Norn»Colon 


4173 


649631 


1669.A24.gz43_261055 


F 


M00055653AG08 


UC2-NormColon 


4174 


639674 


1669.F24.gz43_261060 


F 


M00055659C.D06 


UC2-NormColon 


4175 


640261 


1669 I24.gz43 261063 


F 


M00055663A-.G04 


UC2-NormColon 


4176 


640889 


1670.N03.gz43 261116 


F 


M00055691B:E07 


UC2-NormColon 


4177 


426366 


1670 DOS az43 261138 


F 


M00055676D:H07 


UC2-NormColon 


4178 


643594 


1670.L05.gz43_261 146 


F 


M00055687C:F01 


UC2-NormColon 


4179 


446675 


1670.M05.gz43_261 147 


F 


M00055689B:F04 


UC2-NonnColon 1 


4180 


640645 


1670.H06.gz43__261 158 


F 


M00055683B:D10 


UC2-NormColon 


4181 


645909 


1670. J06.gz43_261 160 


F 


M00055685B:E12 


UC2-NormColon 


4182 


551287 


1670K06sz43 261161 


F 


M00055686B:D03 


UC2-NormColon 


4183 


646638 


1670.N06.gz43__261 164 


F 


M00055691C:E02 


UC2-NormColon 


4184 


639805 


1670.B07.gz43 261168 


F 


M00055673D:C01 


UC2-NonnColon 


4185 


640092 


1670.A08.gz43_261 183 


F 


M00055672D:F06 


UC2-NormColon 


4186 


634122 


1670 M08 gz43 261195 


F 


M00055689C:B03 


UC2-NonnColon 


4187 


641618 


1670.G09.gz43_261205 


F 


M00055682C.D06 


UC2-NormColon 


4188 


641287 


1670.L09.gz43 261210 


F 


M00055688AA02 


UC2-NormColon 


4189 


557625 


1670.H10.gz43_261222 


F 


M00055683CA03 


UC2-NormColon 


4190 


599838 


1670.1 1 0.gz43_26 1 223 


F 


M00055684B:D06 


UC2-NormColon 


4191 


634122 


1670.M10.gz43_261227 


F 


MO0055689C:C03 


UC2-NormColon 


4192 


641726 


1670.K12.gz43_261257 


F 


M00055686D-.E04 


UC2-NoraiColon 


4193 


649921 


1670.L12.gz43_261258 


F 


M00055688A:E04 


UC2-NormColon 


4194 


645431 


1670.P12.gz43_261262 


F 


M00055695D:E01 


UC2-NormColon 


4195 


641728 


1670.K13.gz43_261273 


F 


M00055686D:E10 


UC2-NormColon 


4196 


502609 


1670.N13.gz43 261276 


F 


M00055691D:E04 


UC2-NormColon 


4197 


650673 


1670.C14.gz43 261281 


! F 


M00055675D:E04 


UC2-NormColon 


4198 


644699 


1670.H14.gz43 261286 


F 


M00055683C:D11 


UC2-NormColon 


4199 


645146 


1670.A16.gz43 261311 


F 


M00055673BA04 


I UC2-NonnColon 


4200 


647427 


1670.E17.gz43 261331 


F 


M00055680B:H04 


UC2-NormColon 
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Table 2 



SEQ 
ID 
NO 


CLUSTER 


SEQ NAME 


ORBE 
NT 


CLONE ID 


LIBRARY 


4901 


553Q91 


1670 HI 7 <rr43 961334 


i? 

r 


M0n0^^683r 1 *H'1 1 

iVlUUVJJ JOo JV^.JTIA a 


u vyZ"iNoi xiiv^oion 


4909 


630460 


1 670 A 1 8 ay43 961 343 


Xr 


A/TOOO^*; 6731^-1^0 8 


Ul^Z-xNOlxllv^OlOn ! 


t-Z/DJ 


350500 


1670 1718 0743 961348 

AO / V/.J7 lO.gZtj ZOlJtO 


r 


lV>f000^^6811i-P1 1 


u v^z-xn oirn^oion 


4904. 


640558 


1670 118 a-743 9613*59 

AO /U.J XO.gZ**^ ZiUlJJZ 


J7 
r 


Ayrooo^ ^ 68 -"R06 

1VAUUU J <J OO J A-/ .DUO 


u Vw/Z-iN oimv^ oion 


490S 


561090 


1 670 1SJ1 8 a-743 961356 


p 

r 


1V/fO0O^6O9 A pn^ 


T Tr"7JMArmPrtlAn 

u v^z**in Oiiui^oion 


4906 


446949 


1670P90Q7-43 961377 


V 


A/fOOO^ ^676Tl«riO<; 


U I^Z**1N OITIlv>010Il 


49 07 


645068 


1670R91 tr-743 961309 


P 

x^ 


M000^ ^674^^0^ 


1 IC* 9 -TSJnrrw r*1 on 


4908 


637066 


1670 F99 a-743 961419 

lO / U.J7ZZ.gZ*t.J? ZVO A*+1Z 


p 


A/f000^689 A -R07 


1 Tr 1 9 -NJnrm f 1 r*l r\n 

u v^z-rNOiinv^oioix 


4900 


640896 

0*tOOZD 


1 670 A/T99 ar43 961410 

Jl O 1 \J .1VAZZ, .gZ.HO JL>\) A*r A2/ 


p 

X? 


TV/T000^6Q1 A*Ti08 


1 Tr^9 -Islnrm l^rkl nti 

u v^Z"iN oiiiiv>oiori 


4910 


649391 


1670D93 »7'43 961496 

1U /U.J-/Z,.>.gZ*TJ .Z#UX*T.Z»0 


P 

X7 


M00OS^67OR*PO3 




491 1 

HZl 1 


601 137 

DUI A^> / 


1670 193 oy43 961431 

1U / V/,XZ^»gZ*rO ZD I HO 1 


P 


AA000S^68S A -P10 


1 TP 9 -TNJnrm C r\\ nn 
U V^Z-lNDHll\^01t>lx 


4919 

HZ IZ, 




1670 F94 crr43 961443 


P 


M00OS^681 A-P09 




4913 
*+z l.? 


647856 

OH / OjO 


1670 194 crz43 961447 


p 


A/T000^S68^ A «P09 
IVADDU J D OO JirV. A? uZ 


1 TP9 -TsJnrm PaI nn 
U v^Z-iN OI lALl^OlOTl 


4914 

A*r 


554564 


1 670 "K"94 a^43 961440 


P 
x^ 


MO0OS S687P -"R04 


1 TP 9 -KFr»rm P r*1 nn 


491 S 


561499 


1671 107 0743 961550 

ID / 1 .AD / .gZ*hJ ZOJIJJ7 


p 


M000SS71 1R-"R08 


1 TP9 JSJnrm Prdnr 3 
uv^Z""i>oxiii^O'iori, i 


4916 


649988 

O^+ZZOO 


1671 P07 a^43 961 566 


P 


MO0OSS79 1 R»D08 


T TP9-TSJnrmPn1m- 


4917 


649069 
OtZOOZ 


1671 P08 cr-743 961571 


A? 


mooo s ^ 704 A • noo 

lVXUUD-J-> /D^/V.X^DI/ 


1 TP9 -Mnrm Pnlntt 


491 8 

*tZr AO 


463368 


1671 1^08 ar43 961 577 

1 D / 1 . JVUO .gZHO ZDU/ / 


P 


MO0OSS71SAT)in 


1 TP 9 -"Mnrm P r*1 nn 


4910 

*frZ Ai? 


646098 
OHOUZO 


167T FIOO o^43 961586 


r 


^000^^709^ -no 1 

J\1\J\J\JDD /UZV^.X/U1 


1 TP9»iMnrmPn1nn 
U v/Z 1 1 Ol lIlwUlUAX 


4990 


478109 


1671 MOO 0743 961505 


P 

X7 


M000S S7 1 8 A P0 1 


1 TP9 JNJnrm P n1 nn 

U VvZ J.N DI 111 V^OIOXI 


4991 

H-ZZ 1 


640 Q47 


1671 niO ot43 961609 

AO / 1 .J-r AU.gZnO ZDADDZ 


P 


M000^^709P»P1 1 
lVAUUUJ D /UZl^.A^ A I 


1 TP 9 -NTnrmPnl nn 

u v_^z-in oi in v^oion 


4999 


463548 


1671 PM0 <r*43 961 605 
AO/ A .VjriU.gZHO_Z010U.> 


P 

r 


TV/f00O^*i7O6TVFn 9 
1V1UUUJ J /DOI3.A71AZ 


1 TP 9 -"NJnrm P nl nn 
U ^Z-rNOllllv^OlOIl 


4993 


6409 89 
OHUZoZ 


1 671 "MIO cyy43 961611 
AO / A.lVAlU.gZ*fj__ZOiOA A 


P 1 
r 


A>T000^^71 8 A RO^ 


1 TP 9 -"Mnrm Pnl nn 
U ^/Z-1N OI I IlV^OlUIl 


4994 


41 141 

*+l A*t 1 


1671 "R1 1 ar43 961616 

XO / A .OA A .gZHO ZD AO AD 


p 


M00OS S600P -Ti 09 

1VXUUD J D K>27 y . U DZ 


1 TP 9 -Mnrm P nl nn 

U V^Zr"-i>lOIIIlV-r010xX 


4995 


463368 


1671 111 o-t43 961693 
lO / I.Jill ,gZHD_ZO I OZJ 


X* 


"M"000^^71 ir*-AA7 


1 TP 9 -"Mnrm Pnl nn 


4996 
H-ZZO 


558086 


1671 111 rrz43 961694 
IO / 1J 11 .gZH-3_ZO A OZH 


r 


lV/r000^^71 3/^^19 
1VAUUU3 D 1 IjL/.F 1Z 


T TP 9 JMnrm Pnl nn 


4997 
*tZZ / 


560546 


1671 Pi 9 cttAI 7£l/;33 
AO / 1 .^lZ.gZH\3__ZO 10.33 


r 


A>f000^^701 A-R04 


T TP 9 JMn-rm Pnl nn 


4998 


551 41 5 


1671 PI 9 nvAl 7/;i*;35 
IO/ l.JilZ.gZ*f3_Z010^-5 


i p 


1V/TOOO ^ ^70415 '1^04 


U Z ~iN Olllll^ OxOxl 


499Q 


55771 0 
JJ / / 1U 


1671 TsJ19 rr-7/13 961644 
AO / A .IN AZ.gZ^f J_ZO 1D4h 


r 


Aif 000^^71 O A -Til 1 


1 TP 9 -Mnrm P nl nn 

u v>z-xN oi lii^oion 


4930 


640490 


1671 014 »v43 961660 
IO / l.VJAT , .gZ £ iO_Z0100i' 


Jl 


1V/T0OO S ^707 A • P07 

1VAUUU*)*? / D /A.J5D / 


1 TP 9 -Mnrm Pnl nn 
\J v^Z"l > UXIX1\^010I1 


4931 


451401 
HJ AHU 1 


1671 1 14 0^43 961674 
IO / A.A-flH-.gZHJ_Z010/*+ 


r 


\A OOO ^ ^ 7 1 7"R • A 03 


T TP 9 JKTnrm Pnl nn 
U V^Z-xN Ol IH\-/OIOH 


4939 


649030 


1A71 ^17^43 9A1713 


X* 


IVyf OOO ^ ^ 70 1 T* • r*03 


TTP9 MormpAlnn 


4933 


471 9651 
*f / AZOo 


1671 1717 rrr43 961716 
IO / l.X* 1 / .g^'f^_ZOA / AO 


1 p 


ivyf ono^ ^7o<r* -p 1 9 


1 TP 9 -"Mnrm Pnl nn 
U \-/Z-lN Ol IIIV^OIOD 


4934 


41 7950 


1671 T-T1 7 rr*7-43 9^1 71 » 
AO /l. XT. A / .gZ^f j ZOI / AO 


P 


TV/fnon^^norvrri n 


T TP9_MnrmPnlnn 
U V^Z-xN OIIHI^OIOII 


4935 


*f Z 1 oZO 


1^71 T 17rrr43 9^1779 
A O / 1 ..Li 1 / .gZ^O_ZO 1 / ZZ 


P 


A/f00n<I^71 7T*'F04 


T TP 9 .Mrtrm P />1 nn 
U V-/Z-IN Ol IHV_/010Il 


4236 


465589 


1671 M17 ez43 261723 


F 


M00055718BH11 


UC2-NormColon 


4237 


645505 


1671.N17.gz43_261724 


F 


M00055719A:G04 


UC2-NormColon 


4238 


510195 


1671.N18.gz43_261740 


F 


M00055719A-.G12. 


UC2-NoimColon 


4239 


492094 


1671.P18.gz43_261742 


F 


M00055722A:B05 


! UC2-Noi*mColon 


4240 


453006 


1671.G20.gz43_261765 


F 


M00055707A:F11 


UC2-NormColon 


4241 


600115 


1671.P21.gz43_261790 


F 


M00055722AC04 


UC2-NonnColon 


4242 


641700 


1671.H22.gz43_261798 


F 


M00055710B:E04 


UC2-NonnColon 
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Table 2 



SEQ 
ID 
NO 


3LUSTEB 


SEQ NAME 


OKIE 
NT 


CLONE ID 


LIBRARY 


4243 


466265 


1671. 022.gz43_26 1805 


F 


M00055721A.A07 


UC2-NormColon 


4244 


553985 


1 67 1 .B23 .gz43_26 1 808 


F 


M00055700B:E10 


UC2-J\formColon [ 


4245 


536415 


1 671 .L23 .gz43_26 1818 


F 


M00055717C:B07 


UC2«NonnColon 


4246 


640282 


1671 .N23 .gz43_26 1820 


F 


M00055719C:A09 


UC2-NormColon \ 


4247 


498194 


1672A01cz43 261841 


F 


M00055722B:A01 


UC2-NorxnColon 


4248 


641305 


1672E02ez43 261861 


F 


M00055725A:F12 


UC2-NonnColon 


4249 

*TXV^.7 


641070 


1 672 .H02 .gz43_26 1 864 


F 


M00055727D:G01 


UC2»NormColon 


4250 


556326 


1672.B03.gz43 261874 


F 


M00055723A:B08 


UC2-NormColon | 


4251 


522497 


1672.C03.gz43 261875 


F 


M00055723D:C04 


UC2-NormColon 


4252 


557833 


1672.E03.gz43 261877 


F 


M00055725A:G07 


UC2-NormColon 


4253 


647688 


1672.B04.gz43 261890 


F 


M00055723A:C04 


UC2-NormColon ^ 


4254 


416377 


1672 M04 ez43 261901 


F 


M00055733D:H01 


UC2-NormColon 


4755 


643897 


1672 O04 2z43 261903 


F 


M00055736B:C08 


UC2-NormColon 


4956 


645171 


1672A06ez43 261921 

X » XV.XXWY/.f^f iT*J X< V X ^ X 


F 


M00055722B:G12 


UC2-NormColon 


4257 

*TXV-J / 


649106 


1672 C06 az43 261923 


F 


M00055723D:E05 


UC2-NormColon 


4758 

*TxvJO 


454383 


1672K06ez43 261931 


F 


M00055731C:F09 


UC2-NormColoii 


4950 


649360 


1672.B07.gz43 261938 


F 


M00055723A:F04 


UC2-NonTiColon | 


4760 


639458 


1672 K08 ez43 261963 


F 


M00055731C:H07 


UC2-NormColon 




642078 


1672Bllsz43 262002 


F 


M00055723B:A09 


UC2-NormColon 


4762 


642631 


1672Lllez43 262012 


F 


M00055733A.G04 


UC2-NonnColon 


4263 


650303 


1672A12 ez43 262017 


F 


M00055722C:E11 


UC2-NormColon 


4264 


284586 


1672.C12.gz43 262019 


F 


M00055724A:C12 


UC2-NormColon 


4265 


642535 


1672.G12.gz43 262023 


F 


M00055727B.F10 


UC2«NormColon j 


4266 


642491 


1 672. Al 3 .gz43_26203 3 


F 


M00055722C:F11 


UC2>NormColon 


4267 


100821 


1672P13gz43 262048 


F 


M00055738B.H11 


UC2-NonnColon . 


4268 


641056 


1672A14ez43 262049 


F 


M00055722D:A07 


UC2-NoimColon 


4269 


642197 


1672B14ez43 262050 


F 


M00055723B:C03 


UC2-NormColon 


4270 


801 


1672.E14.gz43 262053 


F 


M00055725D.B02 


UC2-NormColon 


4271 


449956 


1672.N14.gz43 262062 


F 


M00055735C:G08 


UC2-NormColon 


4272 


645018 


1672 C15 ez43 262067 


F 


M00055724A:E03 


UC2-NormColon | 


427^ 
"xv / — > 


640879 


1672 K15 sz43 262075 


F 


M00055731D:F09 


UC2-NormColon 


4274 

^XV> / *T 


642263 


1672 015 ffz43 262079 


F 


M00055736D:D11 


UC2-NormColon 


4775 


647906 


1672P15ffz43 262080 


F 


M00055738C.C02 


UC2-NormColon 


4776 


559776 


1672 G16 ez43 262087 

iv / xv.vj xvj.ft^x«T»J xvv/xv\/vy / 


F 


M00055727C:B02 


| UC2-NormColon 


4277 


647060 


1672H16ez43 262088 

X yJ / XV.X IlVtfttiTJ XVx7XV\/<J<J 


F 


M00055728B:G1 1 


UC2-NormColon 


4278 


642315 


1672.116.gz43_262089 


F 


M00055729C:D11 


UC2-NormColon 


4279 


504501 


1672.L16.gz43_262092 


F 


M00055733B:F11 


UC2-NormColon 


4280 


460967 


1672.A17.gz43_262097 


F 


M00055722D.B10 


UC2-NormColon 


4281 


504568 


1672.C17.gz43_262099 


F 


M00055724A:G08 


UC2-NormColon 


4282 


559828 


1672.D17.gz43_262100 


F 


M00055724D:A02 


UC2-NonnColon 


4283 


455113 


1672.H17.gz43__262104 


F 


M00055728C:B08 


UC2-NormColon 


4284 


477064 


1672.117.gz43_262105 


F 


M00055729C.E02 


UC2-NonnColon 
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Table 2 



SEQ 
ID 
NO 


CLUSTER 


SEQ NAME 


ORIE 
NT 


CLONE ID 


LIBRARY 


4285 


1558 


1672J17ez43 262106 


F 


M0005573 1 AG05 


TJr'2-Normr'olon ' 

KJ \-*jLi 1 > \JX XX IV/UIvlxl 


4286 


419443 


1672 LI 7 ez43 262108 


F 


M00055733B-H05 


X y KJXXliK^XJXKJLX 


4287 


546705 


1672 118 ez43 262121 


F 


M00055729DA06 


I JC 2-Norm Colo n 

KJ \^rJLt 1 > KJX XI lv KJX KJ 1 1 


4288 


235397 


1672 J18 ez43 262122 


F 


M0005573 1AH07 


UC2-NormColoii 

VJ V/ ^« J- ^ V»X XXXV^vXVXX 


4289 


651085 


1672 K18 sz43 262123 


F 


M00055732AB04 


UC2-NormColon 


4290 


502826 


1672 N18 ez43 262126 


F 


M00055735DC01 


TJC2-NorniColon 


4291 


544797 


1672 A19 ez43 262129 


F 


M00055722DE05 


Ur , 2-NormColon 


4292 


489426 


1672B19ez43 262130 


F 


M00055723CB02 


UC2-NormColon 

' llVlX XX V-/ Ul VJXX 


4293 


452212 


1672E19ez43 262133 


F 


M00055725DF05 


UC2-NormColon 


4294 


646609 


1672.M19.gz43 262141 


F 


M00055734DD01 


UC2-NormColon 


4295 


556336 


1672N19ez43 262142 


F 


M00055735DE10 

Iflvvvh/ J / */%/X/«XiX V 


UC2-NormColon 


4296 


467822 


1672 C20 erz43 262147 


F 


M00055724BD04 

XTXV / ' X^ • X^ V/ 1 


UC2-NormColon 


4297 


650397 


1672 L20 ez43 262156 


F 


M00055733CH12 

1TJLV V-f \J i V-' .X X X A* 


UC2-NormColoii 


4298 


37862 


1672 P20 2z43 262160 


F 


M0005573 8D-G08 


I JC2-NormColon 


4299 


644609 


1672 121 ez43 262169 


F 


M00055729D-F05 


I JC 2-NormColon 

V_/^-« X 1 \JX ILL \~/\JX\JXX 


4300 


561968 


\612 L21 ez43 262172 


F 


M00055733DB1 1 


I JC2-NormColon 


4301 


550049 


1672 P21 ez43 262176 


F 


M0005 573 9AB02 


UC2-NonnColoTi 


4302 


554708 


1 672 H22 ez43 262 184 


F 


M00055728DF02 


TJr , 2-Normr , oIon 


4303 


608540 


1672 P22 ez43 262192 


F 


M00055739AB04 


T 1^2^01-111(^01011 

KJ X ^ VX XXXV> VXVXX 


4304 


470769 


1672 A23 ez43 262193 


X 


M00055722DG1 2 


T TP9-NoiiiiPo1on 


4305 


642722 


1672 C23 ez43 262195 


F 


M00055724BG03 


UC2-NormColon 

V^* l > UllllV vxvxx 


4306 


645151 


1672 L23 ffz43 262204 


F 


M00055733DD04 


UC2-Normr i oloTi 

V^_sjZ* 1 1 VX XXI KJ X VXI 


4307 


648265 


1672 N23 ez43 262206 


F 


M00055736AB1 1 


T TC2-NorTnColon 

VJ V//- X. ^1 VI XI IV VI VAX 


4308 


489249 


1672 F24 ez43 262214 


F 


M00055726DH12 


UC 2-NormColon 

V-* Vp' <w x ~ VX XXXV_/ vxvxx 


4309 


644173 


1682 H02 &z43 262248 


F 

X 


M00055770AG08 


TJC2-NOTTT1C01011 

V»* V^X> X 1 VX XI IvUlUil 


4310 


642417 


1682 L02 ez43 262252 


F 


M00055774DA05 


UC2-NormColon 

V*» V*sX* X ^ VX XXX v^ vxvxx 


4311 


553877 


1682 M02 bz43 262253 


F 


M00055775CB10 


UC2-NormColon 

\— ' V_^^-— X ™ VX XXIV/V/XMXI 


4312 


530238 


1682 H03 ez43 262264 

x .x xv/ — ^ »r , 


F 


M00055770AH1 1 


UP2-NormColon 


4313 


553380 


1682 L03 ez43 262268 


F 


M0005 5 774D B07 


UC2-NormColon 

\—r ^ X ^ VX XXX VXVXX 


4314 


637387 


1682E04ez43 262277 


F 


M00055766BE11 

1UV Wmf *J 1 WX^.XWX X 


UC2-NormColon 


4315 


634660 


1682F04sz43 262278 


F 


M00055767AD10 

XTXW 1 V / X X.X^ X V/ 


UC2-NormColon 


4316 


642791 


1682J04sz43 262282 


F 


M00055772AH08 


UC2-NormColon 


4317 


446163 


1682 K04 &z43 262283 


F 


M0005 5 773 C * C09 


UC2-NormColon 

V> A# i. ^1 VX XI iv^vxvxx 


4318 


641056 


1682 M04 ez43 262285 


F 


M00055775CE10 


UC2-NormColon 

W X ^ VI XXX vxvxx 


4319 


550562 


1682 B05 &z43 262290 


F 

X 


M00055763BD07 


UC2-NorrnColon 


4320 


648320 


1 682 . A06.gz43_262305 


F 


M00055761D:C03 


UC2-NonnColon 


4321 


446757 


1682.H06.gz43_2623 12 


F 


M00055770B:D06 


UC2-NormColon 


4322 


454540 


1682.K06.gz43_262315 


F 


M00055773C.H12 


UC2-NonnColon 


4323 


458979 


1682.L06.gz43_262316 


F 


M00055774D:E02 


UC2-NormColon 


4324 


641174 


1682.P06.gz43_262320 


F 


M00055780C:E02 


I UC2-NormColon 


4325 


642852 


1682.E08.gz43_262341 


F 


M00055766B:H10 


UC2-NormColon ! 


4326 


488680 


1682.A09.gz43_262353 


F 


M00055761D:H08 


UC2-NonnColoii 
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SEQ 
ID 
NO 


CLUSTER 


SEQ NAME 


ORIE 
NT 


CLONE ID 


LIBRARY 


4327 


27083 


1682K09ez43 262363 


F 


M00055773D:G11 


UC2-NormColon 


4328 


524470 


1682 C10 ez43 262371 


F 


M00055765A:A04 


UC2-NormColon 


4329 


46395 1 


1682J10.gz43 262378 


F 


M00055772C:E11 


UC2-NormColon | 


4330 


584179 


1682 Cll ez43 262387 


F 


M00055765A:B02 


UC2-NoimColon 


4331 


642460 


1682 Dll ez43 262388 


F 


M00055765D:E06 


UC2-NormColon 




448944 


1682 Ell ez43 262389 


F 


M00055766C:C11 


UC2-NormColon 




642082 


1682 Fll sz43 262390 


F 


M00055767B:D02 


UC2-NormColon | 


4334 


642095 


168?Nllsz43 262398 


F 


M00055778D:A03 


UC2-NormColon 


4335 


562876 


1682.H12.gz43 262408 


F 


M00055770C:H11 


UC2-NormColon 


4336 


453079 


1682 112 <*z43 262409 


F 


M00055771C:A11 


UC2-NormColon i 


4337 


101499 


168^ LP ez43 262412 


F 


M00055775A:G09 


UC2-NormColon 


4338 


639829 


1682.N12.gz43 262414 


F 


M00055778D:C03 


UC2-NormColon 


4339 


642332 


1682P12gz43 262416 


F 


M00055780D:D09 


UC2-NormColon 


4340 


541499 


1682 G13 hz43 262423 


F 


M00055768B:H12 


UC2-NormColon 


4341 


509410 


1682.I13.gz43 262425 


F 


M00055771C:D09 


UC2-NormColon 


4342 


89082 


1682.013.gz43 262431 


F 


M00055780A:C04 


UC2-NormColon 


4343 


453091 


1682 P13 ez43 262432 


F 


M00055780D:F08 


UC2-NormColon 


4344 


639981 


1682 A14 ez43 262433 


F 


M00055762B:B11 


UC2-NormColon 


4345 


642564 


1682 B14 ez43 262434 


F 


M00055763C:G08 


UC2-NoimColon 


4346 


561558 


1682 014 ez43 262447 


F 


M00055780A:C06 


UC2-NormColon 


4347 


562870 


1682 B15 ez43 262450 


F 


M00055763D:B11 


UC2-NormColon 


4348 


462742 


1682 D15 sz43 262452 


F 


M00055766AA01 


UC2-NormColon 


4349 


451037 


1682 El 5 ez43 262453 


F 


M00055766C:G12 


UC2-NonnColon 


4350 


641563 


1682 J 15 ez43 262458 


F 


M00055772D:C10 


UC2-NormColon 


4351 


643940 


1682A16az43 262465 


F 


M00055762C:B04 


UC2-NormColon ! 


4352 


642204 


1682 C16 sz43 262467 


! F 


'M00055765A:C11 


UC2-NormColon 


4353 


517912 

Jl / y X fa 


1682 J 16 sz43 262474 


F 


M00055772D:D03 


UC2-NonnColon 


4354 


641 576 


1682 D17 ez43 262484 


F 


M00055766A:D08 


UC2-NonnColon 


4355 


642082 


1682 F17 ez43 262486 


F 


M00055767D:A12 


UC2-NormColon 


4356 


450211 


1682 017 sz43 262495 


F 


M00055780A:E11 


UC2-NonnColon 


4357 


621636 


1682 B18 ez43 262498 


F 


M00055764A:F02 


UC2-NormColon 


4358 


639232 


1682 D18 ez43 262500 


F 


M00055766A:E08 


UC2-NormColon 


4359 


450553 


1682T18ffz43 262506 


F 


M00055772D:F10 


UC2-NormColon 


4360 


647940 


1682 K18 ez43 262507 


F 


M00055774B:F07 


UC2-NormColon 


4361 


461835 


1682 Ml 8 ez43 262509 


F 


M00055778A:F09 


UC2-NormColon 


4362 


641496 


1682.018.gz43_262511 


F 


M00055780A:F07 


UC2-NormColon 


4363 


464510 


1682.H19.gz43 262520 


F 


M00055770D:E10 


UC2-NormColon 


4364 


642564 


1682.M19.gz43 262525 


F 


M00055778A:F12 


UC2-NormColon 


4365 


629002 


1682.C20.gz43 262531 


- F 


M00055765A:G02 


UC2-NormColoii ! 


4366 


1093 


1682.D20.gz43 262532 


F 


M00055766A:H03 


UC2-NormColon 


4367 


50351 


1682.E20.gz43_262533 


F 


M00055766D:D05 


UC2-NormColon 


4368 


642417 


1682.F20.gz43 262534 


F 


M00055767D:E07 


1 UC2-NormColon 
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SEQ 
ID 
NO 


CLUSTER 


SEQ NAME 


ORIE 
NT 


CLONE ID 


LIBRARY 


4369 


466971 


1682.J20.gz43_262538 


F 


M00055772DH04 


U^-NormCnlfiii 

W 1 ^ V/X JLXlV^wJlvJxJL 


4370 


641185 


1682.K20.gz43_262539 


F 


M00055774BH01 


T IC 2-Nnrm CrA on 

w VyXi X >l VJX X llVyVjXVJXX 


4371 


644240 


1682.L20.gz43_262540 


F 


M00055775BG02 


T TP?-NnTTnC'r»1nn 

1 iVJXXllVyVJXvyll 


4372 


509027 


1682.M20.gz43_262541 


F 


M00055778AG02 


T TP?-NnrmCr»1rvn 

\J VyZ- X > VyX X llVyVJXvyXX 


4373 


546740 


1682.F21.gz43_262550 


F 


M00055767DF04 


\~J Vy^- X > XXI VyUXVyXX 


4374 


559662 


1682X21. gz43__262556 


F 


M00055775BG04 

xtau \y \y .y «y i i *yju.vj\/~ 


T TP 9 -Norm Pol nn 

W Vy X. 11 VyX X HVyVyXVJXX 


4375 


388688 


1 682.B22.gz43_262562 


F 


M00055764BF05 


IJP2-NormPo]on 

V/VyZ> X^vyXXXXVyvyli/ix 


4376 


555140 


1682.G22.gz43 262567 


F 


M00055768DG02 


\J Vy X l VyX X XlVyVylvJXX 


4377 


640617 


1682.K22.gz43 262571 


F 


M0005 5 774C • D09 


T Tr , ?-Notmr , o1oTi 

KJ Vy*< X 1 VjFX XXX Vy\JX\JXX 


4378 


447224 


1682.D23.gz43 262580 


F 


M00055766BB01 


T TP ? -"Morm Pnlnn 

U\yi- X > \JX XXlVy VyXUXX 


4379 


644765 


1682.B24.gz43 262594 


F 


M00055764CC07 

Xflw w \/.y */ / v T Vy • Xy V/ / 


I TC2-NormPolon 

Vy ■£« X > VJ I XX 1 Vy VJXVJXX 


4380 


648996 


1682.F24.gz43_262598 


F 


M00055768AB05 


T TC2-"MormPo1oTi 

UvA X 1 V<FX XXL V^UXvyXX 


4381 


642036 


1682.I24.gz43 262601 


F 


M00055772AC10 

J- t A \y vy \y — / — ' / 1 Zjiii\y 1 W 


T TP^-TMormPnloTi 

VyJ Vy X» XN vXXXllVyUXUXX 


4382 


644914 


1682J24.gz43_262602 


F 


M00055773BA07 


V v/*»"i 1 VyX x 1 1 Vy vyXV^XX 


4383 


458425 


1682.M24.gz43_262605 


F 


M00055778BE01 


UC2-NormCo1 on 

\J V/i< X t VyX XXX Vy VyXV^XX 


4384 


556476 


1682.N24.gz43_262606 


F 


M00055779BF06 


UC2-Normr , oloTi 

W VyA( x ivyxxxiVyvyivyxx 


4385 


672601 


1683.E01.gz43 262613 


F 


M00055785DC06 


T TC^-NormColon 

V VyX. X^l\^XXXXVyVJX\JXX 


4386 


463290 


1683.H01.gz43 262616 


F 


M00055789CF 1 0 

i»i*y w«y»/ # vyy Vy.X x v/ 


T TC2-NormCo1on 

V/ VyA™ X^( Ullll VyVJXVyXX 


4387 


675768 


1683.D03.gz43_262644 


F 


M00055784C:H02 


UC2-Nortxir , olon 

Vy Vy*« X i Vy i XXlV^VyXVyXX 


4388 


642146 


1683.H03.gz43 262648 


F 


M00055789DB04 


UC2-Norrtir , o1on 

V/ VyX X 1 VyXXIlVyV^XVyXX 


4389 


648905 


1 683 .F04.gz43_262662 


F 


M00055787BF10 


UC2-Normr , olori 

V VyX X VyX XXX Vy VXUXX 


4390 


467306 


1683.I04.gz43 262665 


F 


M00055790DG10 


T TP?-NormPo1on 

W VyX. X 1 VyXXllV^VJJlVJXX 


4391 


497971 


1683.C05.gz43 262675 


F 


M00055783DB02 

XTivwVi/^' / vy xv .iy v/Lf 


T TC2-Norm Pol on 

V VyX. X 1 UX XX IVy VJXUXX 


4392 


645508 


1683.N05.gz43 262686 


F 


M00055795DE09 


T TP2-NormPo1on 

V VyX. X 1 VyX X X 1 Vy Vy X Vy XX 


4393 


644047 


1683.O05.gz43 262687 


F 


M00055796BG05 


UP2-NormPo1on 

V/ VyX. X > Ul XXX V^VJXVJXX 


4394 


554117 


1683.K06.gz43 262699 


F 


M00055793AH09 

liivvuy^ # y ^xk,x xv^ 


T TP2-NormPo1on 

V/VyX. 1 > Vyl lllVyUlUll 


4395 


641919 


1683.K08.gz43 262731 


F 


M00055793BB06 


UP 2-NormPo Ion 

Vy 1 VyX. x> VJXXXIVy vyXVJXX 


4396 


605761 


1683.L08.gz43 262732 


F 


M0005 5794AD08 


UP2-NormPo1on 

Vy VyX. X ~ VyX XIX VyVJXVyXX 


4397 


549434 


1683.D09.gz43 262740 


F 


M00055785AH08 


UC2-NormPo1on 

Vy VyX. X 1 VX XXX Vy VJXVJXX 


4398 


446595 


1683.M09.gz43 262749 


F 


M00055795AF09 


TJC2-NormPolon 

VVVyX X N \Jl XXI Vy V/XVyXX 


4399 


643350 


1683 F10 ez43 262758 


F 


M00055787CD09 

XYXVy Vy Vy -y 1 \J 1 Vy .Xy V/y 


T TP ? -TSlnrm Pnl nn 

V VyX. IN VJX XllVy VJXVy'XX 


4400 


454485 


1683.M10.gz43 262765 


F 


M00055795AF1 2 

liivu \y.y «y / yjxi,.! XX 


T TP9-T\TormPo1on 

V VyX.— IN Ul XXIV^UXVJXX 


4401 


647639 


1683 O10 ez43 262767 


F 


M00055796CE1 1 

lYXvuv/^*^ / Vy\— /.X— < X x 


TTP2-TMorniPo1on 

V VyX— IN VJX XXtVy VJXVJXX 


4402 


510545 


1683 11 1 sz43 262777 


F 


M0005579 1 B-E02 


T TP ? - TVnrrn Pn ? r»n 

VJ VyX 1NU1 ill Vy UlVyVIX 


4403 


77144 


1683Nllez43 262782 


F 


M00055795DH08 


T TP 7 - Norm Pnl nn 

Vy VyX— XNlJlXJlVyVjiU-ll 


4404 


140909 


1683.C12.gz43_262787 


F 


M00055784A:B07 


UC2-NormColon 


4405 


650076 


1683.D12.gz43_262788 


F 


M00055785B:B06 


UC2-NormColon 


4406 


449836 


1683.F12.gz43_262790 


F 


M00055787C:E12 


UC2-NormColon 


4407 


642184 


1683.B14.gz43_262818 


F 


M00055783A:C06 


UC2-NonnColon 


4408 


638962 


1683.H14.gz43_262824 


F 


M00055790B:A08 


^ UC2-NormColon 


4409 


641700 


1683.M15.gz43_262845 


F 


M00055795B:F09 


UC2-NormColon 


4410 


437580 


1683.N15.gz43_262846 


F 


M00055796A:A08 


UC2-NormColon 
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SEQ 
ID 
NO 


CLUSTER 


SEQ NAME 


ORIE 
NT 


CLONE ID 


LIBRARY 


4411 


456626 


1683.C16.gz43_262851 


F 


M00055784A:D05 


UC2-NormColon 


4412 


553979 


1683.016.gz43_262863 


F 


M00055796D:E06 


UC2-NonuColon 


4413 


640525 


1683.017.gz43_262879 


F 


M00055796D:E10 


UC2-NormColon 


4414 


553546 


1683.H18.gz43„262888 


F 


M00055790B:D05 


UC2-NormColon 


4415 


643230 


1683.A19.gz43_262897 


F 


M00055782B:C08 


UC2-NormColon 


4416 


523590 


1683 .B20.gz43_2629 14 


F 


M00055783B:A04 


UC2-NormColon 


4417 


388085 


1683.M20.gz43_262925 


F 


M00055795C-.B10 


UC2-NonnColon 


4418 


457396 


1683.L21 .gz43_262940 


F 


M00055794C:D10 


UC2-NoimColon 


4419 


417617 


1683. 023 .gz43;_262975 


F 


M00055797B:A11 


UC2-NonnColon 


4420 


463821 


1684.G01.gz43_262999 


F 


M00055804D:F02 


UC2-NormColon \ 


4421 


448787 


1684.M01.gz43_263005 


F 


M0O055811C-.AO1 


UC2-NonnColon 


4422 


32812 


1 684 .103 .gz43_263 03 3 k 


F 


M00055807AB10 


UC2-NonnColon 


4423 


418340 


1684.A04.gz43_263041 


F 


M00055798B:D12 


UC2-NoimColon 


4424 


462249 


1684.C04.gz43_263043 


F 


M00055800D:B03 


UC2-NonnColon 


4425 


642099 


1684.P04.gz43_263056 


F 


M00055815C:B03 


UC2-NormColon 


4426 


644915 


1684.A05.gz43 263057 


F 


M00055798B:F02 


UC2-NormColon 


4427 


640222 


1684.G05.gz43 263063 


F 


M00055805AA02 


UC2-NonnColon 


4428 


550370 


1684.K05.gz43_263067 


F 


M00055809AB10 


UC2-NormColon 


4429 


645560 


1684.A06.gz43_263073 


F 


M00055798B:G04 


UC2-NormColon 


4430 


513238 


1 684.E06.gz43_263077 


F 


M00055803A.C06 


UC2-NormColon 


4431 


650231 


1684.M07.gz43_263 10 1 


F 


M00055811D:C02 


UC2-NonnColon 


4432 


644314 


1684.H08.gz43 263112 


F 


M00055806B:B10 


UC2-NormColon 


4433 


650782 


1684J08.gz43_263 1 14 


F 


M00055808B:A04 


UC2-NormColon 


4434 


648221 


1684.P08.gz43_263 120 


F 


M00055815C:G05 


UC2-NonnColon 


4435 


554742 


1 684.M09 .gz43_263 1 3 3 


F 


M00055811D:C12 


UC2-NormColon 


4436 


642962 


1684.P09.gz43_263 136 


F 


M00055815DA11 


UC2-NormColon ! 


4437 


455028 


1684.A10.gz43_263 137 


F 


M00055798DA10 


UC2-NormColon 


4438 


447015 


1684.P 10.gz43_263 1 52 


F 


M00055815D:B02 


UC2-NormColon | 


4439 


642653 


1684.E1 1 .gz43_263 157 


F 


M00055803A:G08 


UC2-NonnColon | 


4440 


647336 


1684.011.gz43_263167 


F 


M00055814C:C07 


UC2-NormColon j 


4441 


549640 


1 684 .P 1 1 .gz43_263 1 68 


F 


M00055815D:B03 


UC2-NormColon 


4442 


642198 


1684.G12.gz43 263175 


F 


M00055805B:C08 


UC2-NormColon 


4443 


520320 


1 684 .1 1 2 .gz43_263 1 77 


F 


M00055807B:F05 


UC2-NonnColon 


4444 


644147 


1684.N12.gz43 263182 


F 


M00055813AD10 


UC2-NormColon 


4445 


530774 


1684 E13 ez43 263189 


F 


M00055803BA11 


UC2-NonnColon 


4446 


375814 


1684.H14.gz43 263208 


i F 


M00055806C:E09 


UC2-NonnColon 


4447 


446789 


1684.114.gz43_263209 


F 


M00055807B:G10 


UC2-NonnColon 


4448 


210839 


1684.J14.gz43_263210 


F 


M00055808B:H07 


UC2-NormColon 


4449 


490414 


1684.B15.gz43 263218 


F 


M00055800A:F02 


UC2-NormColon 


4450 


503491 


1684.M15.gz43_263229 


F 


M00055812AE01 


UC2-NormColon 


4451 


486134 


1684.D17.gz43_263252 


1 F 


M00055802B:H03 


UC2-NormColoh 


4452 


639441 


1684.I17.gz43 _263257 


F 


M00055807C:F05 


i UC2-NormColon 
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SEQ 
ID 
NO 


CLUSTER 


SEQ NAME 


ORE 
NT 


CLONE ID 


LIBRARY 


4453 


551683 


1684.N17.gz43_263262 


F 


M00055813C:E03 


IJC2-NnrmPn1nn 


4454 


649035 


1684.P17.gz43_263264 


F 


M00055816AC04 


UC2-NormCo1nn 

w V_/^« 1 ^ \JX 1 IlV^WI Wll 


4455 


639194 


1684.F18.gz43_263270 


F 


M00055804BC02 


UC2-Norn.f , o1on 

V^^> — 1 ^ VJ1 111 V_yUlUAl j 


4456 


454772 


1684.K19.gz43_263291 


F 


M00055809C:E11 


UC2-NormColoTi 

w X > vl 111V~sVX1 vMX 


4457 


649995 


1684.L19.gz43_263292 


F 


M00055810DG04 

X T JL V/ W VF JL VJh/ « ^JJ Vr I 


UC2-NorrnCo1ot. 

1 \JX XXX V^V/lLMJL 


4458 


466016 


1684.M19.gz43_263293 


F 


M00055812BF03 


UC2-Morm Colon 


4459 


494271 


1 684.N1 9.gz43_263294 


F 


M00055813DB12 


UC2-NorrnC , o1oTi 

W V^Z» X^IUXXXXV>*JXUll 


4460 


460517 


1684J20.gz43_263306 


F 


M00055808D:C12 


UC2-NormColon 

v v^r x i \jx xx i\_> ui ui x 


4461 


470602 


1684.12 l.gz43_263321 


F 


M00055807D:C04 


UC2-NormColon 


4462 


640537 


1684.B22.gz43_263330 


F 


M00055800C:E11 


UC2-NormCoIoTi 


4463 


643991 


1684.D22.gz43_263332 


F 


M00055802C:E12 


UC2-NormColon 


4464 


564382 


1684.C23.gz43_263347 


F 


M00055801D:E06 


UC2-NormColon 


4465 


529325 


1684.G23.gz43_26335 1 


F 


M00055805D:H01 


UC2-NormColon 


4466 


455665 


1684.L23.gz43_263356 


F 


M00055811B:A03 


UC2-NonriColon ! 

>w^^- x i yjx xxx\^\jx\jix 


4467 


645700 


1684.A24.gz43_263361 


F 


M00055799C:D09 


UC2-NormColon 


4468 


520320 


1684.J24.gz43_263370 


F 


M00055808D:F09 


UC2-NormColon 

/ X# 11 \JX X LX V/X UU 


4469 


648752 


1684.M24.gz43 263373 


F 


M00055812CB07 


UC2-Normro1on 

I > kJX X 11 V^UILUJ. 


4470 


642850 


1684.024.gz43 263375 


F 


M00055815AH12 

1T1V V/ \S «y *— ' VJ' X <-/ JTX,X X X^* 


UC2-NormroloTi 

\J Vyx* XlUl XXXV^ V/1L/11 


4471 


556896 


1970.H01.gz43_263397 


F 


M00054702A:C06 


UC2-CoIonMetLiver 

w *-^X» V^VJ iUUiTXWUul V vl 


4472 


549238 


1 970.10 l.gz43 263398 


F 


M00054703DE07 


UC2-ColonMefT iver 

^ V/^^AU Villi VlULLfl V Ol 


4473 


551941 


1970.M01.gz43 263402 


F 


M00054710DA02 


UC2-Co1onlVfpfT iver 

w wZ»^^v/1LI111Y1,CIJL/1 V CI 


4474 


552055 


1970.G02.gz43 263412 


F 


M00054699DA12 


UC2-Co1onMefT ivpr 

VJV^^""Y^tJllJJJlYAwlJL4lVCx 


4475 


483141 


1970.H02.gz43 263413 


F 


M00054702AF03 


W v_^x- Vv'VFXvUll VJ. W U-rl V vl 


4476 


561434 


1970.I02.gz43 263414 


F 


M00054703D F04 


V_/Z- V-r*JiUlXiVlwLl-/l V wl 


4477 


453708 


1970.K02.gz43 263416 


F 


M00054707B B08 


UC2-Colon M etT A ver 

v V/A( V>4JXUXXi VXt/ UL^l V VI 


4478 


557935 


1970.A03.gz43 263422 


F 


M00054689CB11 


UC2-Co1onMefT iver 

v-/ V^X. V/Xv/lllT J.&iXul V vl 


4479 


509202 


1970.H03.gz43 263429 


F 


M00054702A-H03 

iflv v Vk/ i / Vf J>iJi«X XV %/ 


UC2-ColonMetT J ver 

V V/a» V> VJ JAJ.LX1 VI t> I.I /l Vvl 


4480 


549388 


1970.B04.gz43 263439 


F 


M00054690DG03 


UC2-ColonMetLiver 


4481 


551243 


1 970 .C05 .gz43_263456 


F 


M00054693A C09 


UC2^olonMetLiver i 


4482 


549557 


1 970.N05 .gz43_263467 


F 


M00054712CC07 


UC2^olonMetLiver ! 


4483 


551527 


1970A06.gz43_263470 


F 


M00054689DE12 


UC2-ColoiiMetLiver 


4484 


553128 


1970.E06.gz43 263474 


F 


M00054696BH1 1 

iTAvvV/ft/ rv> VXV«X XX X 


UC2-ColonMetT , i ver 

v vx« vUlvlUTlvlUl V vl 


4485 


549151 


1970.K06.gz43 263480 


F 


M00054707CD02 


UC2-ColoiiMetLiver 

w v./ a# v^\/x uui vx v i-i— fi v vi 


4486 


32021 


1970.O06.gz43 263484 


F 


M00054714BF05 


UC2-ColoiiMetLiver 

W v./^ vi/iV/111 VX vtJyl V vl 


4487 


552598 


1970P07ez43 263501 


F 


M00054715DE11 


UC2-ColonMetT.iver 


4488 


552051 


1970JD08.gz43J263505 


F 


M00054695AA12 


UC2-ColoixMetLiver 


4489 


491827 


1970J09.gz43_263527 


F 


M00054706A:G10 


UC2-ColonMetLiver 


4490 


552625 


1970.O09.gz43_263532 


F 


M00054714C:E01 


UC2-ColonMvtLiver 


4491 


552073 


1970.Mll.gz43_263562 


F 


M00054711CA08 


UC2^olonMetLiver 


4492 


552649 


1970.E12.gz43_263570 


F 


M00054697AE03 


UC2-ColoiiMetLiver 


4493 


389201 


1970.H12.gz43_263573 


F 


M00054702D:E07 


UC2-ColonMetLiver 


4494 


552682 


1970.112.gz43_263574 


F 


M00054704D:F02 


UC2-ColonMetLiver 
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ID 
NO 


CLUSTER 


SEQ NAME 


ORIE 
NT 


CLONE ID 


LIBRARY 


4495 


552686 


1970 K12 ez43 263576 


F 


M00054708A:F11 


UC2-ColonMetLiver | 


4496 


559389 


1970A14az43 263598 


F 


M00054690B:B02 


UC2-ColonMetLiver 


4497 


562216 


1970 E14 ez43 263602 


F 


M00054697A:G06 


UC2-ColonMetLiver 


4498 


452392 


1970.H14.gz43 263605 


F 


M00054702D:F05 


UC2-ColonMetLiver 


4499 


552314 


1970 114 ez43 263606 


F 


M00054705B:C08 


UC2-ColonMetLiver 


4500 


584499 


1970 F15 ez43 263619 


F 


M00054699A:G01 


UC2-ColonMetLiver 


4501 


552418 


1970.A16.gz43 263630 


F 


M00054690B:D10 


UC2-ColonMetLiver 


4502 


551996 


1970F16ez43 263635 


F 


M00054699B:A05 


UC2<;olOxiMetLiver 


4503 


549945 


1970.H16.gz43 263637 


F 


M00054702D:H10 


UC2-ColonMetLiver 


4504 


527679 


1970.116.gz43_263638 


F 


M00054705B:D02 


UC2-ColonMetLiver 


4505 


391511 


1970.K16.gz43 263640 


F 


M00054708B:F04 


UC2-ColonMetLiver 


4506 


490393 


1970 P16 ez43 263645 


F 


M00054716B:D06 


UC2-ColonMetLiver 


4507 


552197 


1970L17gz43 263657 


F 


M00054710B:B10 


UC2-ColonMetLiver 


4508 


550975 


1970.0 17.gz43_263660 


F 


M00054715A:G02 


UC2-ColonMetLiver 


4509 


498454 


1 970 .1 1 8 .gz43_263 670 


F 


M00054705B:E04 


UC2-ColonMetLiver 


4510 


553938 


1970.N18.gz43 263675 


F 


M00054713C:D07 


UC2-ColonMetLiver 


4511 


448332 


1970.C19.gz43 263680 


F 


M00054693D:A08 


UC2-ColonMetLiver 


4512 


550053 


1970.J19.gz43 263687 


F 


M00054706C:B12 


UC2-ColonMetLiver 


4513 


550330 


1970B20az43 263695 


F 


M00054692B:C06 


UC2-ColonMetLiver 


4514 


450755 


1970.C20.gz43 263696 


F 


M00054693D:C04 


UC2-ColonMetLiver 


4515 


420686 


1970B21ez43 263711 


F 


M00054692B:D01 


UC2-ColonMetLiver 


4516 


552581 


1970 E21 ez43 263714 


F 


M00054697C:E1 1 


UC2-ColonMetLiver 


4517 


551995 


1970 F21 ez43 263715 


F 


M00054699C:A04 


UC2-ColonMetLiver 


4518 


521552 


1970 H22 ez43 263733 


F 


M00054703C:F01 


UC2-ColonMetLiver 


4519 


553358 


1970 J22 ez43 263735 


, F 


M00054706D:B03 


UC2-ColonMetLiver 


4520 


558768 


1970 L22 ez43 263737 


F 


M00054710C:A12 


UC2-ColonMetLiver 


4521 


42994 


1970 B23 ez43 263743 


F 


M00054692C:B02 


UC2-ColonMetLiver 


4522 


555255 


1970.G23.gz43 263748 


F 


M00054701D:H05 


UC2-ColonMetLiver [ 


4523 


451032 


1970 023 ez43 263756 


F 


M00054715C:D05 


UC2-ColonMetLiver 


4524 


554477 


1970 124 ez43 263766 


F 


M00054705C:D11 


UC2-ColonMetLiver 


4525 


560317 


1970 K24 ez43 263768 


F 


M00054709A:A10 


UC2-ColonMetLiver 


4526 


S 54900 


1993 J01 ez43 263783 


F 


M00054863D:H05 


UC2-ColonMetLiver 


4527 


560254 


1993 L01 ez43 263785 


F 


M00054866BC01 


UC2-ColonMetLiver 


4528 


555740 


1993 E02 ffz43 263794 


F 


M00054857CG09 


UC2-ColonMetLiver 


4529 


551212 


1993 F02 ffz43 263795 


F 


M00054858DC04 


UC2-ColonMetLiver 


4530 


554456 


1993.G03.gz43_263812 


F 


M00054859D:E09 


UC2-ColonMetLiver 


4531 


493303 


1993.M03.gz43_263818 


F 


M00054867B:D02 


UC2-ColonMetLiver 


4532 


522648 


1993.P03.gz43_263821 


F 


M00054870C:C05 


UC2-ColonMetLiver 


4533 


559885 


1993.A04.gz43_263822 


F 


M00054852C:G03 


UC2-ColonMetLiver 


4534 


554257 


1993.H04.gz43_263829 


F 


M00054860D:D01 


UC2-ColonMetLiver 


4535 


554524 


1993.B05.gz43_263839 


F 


M00054854A:H06 


UC2-ColonMetLiver 


4536 


554672 


1993.C05.gz43_263840 


F 


M00054855B:E04 


UC2-ColonMetLiver 



180 



INSDOCID: <WO 0214500A2J_> 



WO 02/14500 



PCT/US01/25840 



Table 2 



SEQ 
ID 
NO 


CLUSTER 


SEQ NAME 


ORIE 

NT 


CLONE ID 


LIBRARY 


4537 


560714 


1993.D05.gz43 263841 


F 


M00054856D-E11 


V/ V/i V_/ VXV/XIX VXV/ LX-r X v ox 


4538 


557852 


1993.G05.gz43 263844 


F ; 


M00054859DG04 


TJr*2-Pnlnnl\/rptr ivf»r 

\J \s*-r V/ V/Xv/XXXVX w LXjX V vi 


4539 


553295 


1993 JOS .gz43 263847 


F 


M00054864BB02 


v-/V_/^r VyVIAV/XXXVXv>IJL/X Vwl 


4540 


559027 


1993N06ez43 263867 


F 


M00054868C-G1 1 


V/ V_* Z. V*/ VJ X V/XXL VXt> LXv 1 V V/X 


4541 


548861 


1993 B07 ez43 263871 


F 


M00054854B-C04 


vJ V-^ -<i. V_y UXVJXXLYXt/ UL/l V CI 


4542 


554500 


1993 E07 cz43 263874 


F 


M00054857DF1 2 


VJ V/Z, ~ V_^*JHjXxlVXC ULrl VCx 


4543 


522507 


1993 107 ffz43 263878 


F 


M00054862CD05 


U V_>^. v^U lUJxiVxC LL/1 V CX 


4544 


501534 


1993 J08 ez43 263895 


F 


M00054864R-D08 


T TP9 -PnlnnA/Tf»tT ivf»r 

VJ V^Z. V^VIIVJXXXVXCUL/l V CI 


4545 


556715 


1993 A09 ffz43 263902 


F 


M00054852DD09 

x vx v/ v/ v/ "T o ^/ xy . xy v ✓ 


VJ V/Z. Vw'V/XUIxXVXCIXjI V CI 


4546 


496897 


1993 C09 ez43 263904 


F 


M00054855C-B06 


VJ V/Z, "~V_/ V/lV/XlXVXCLJL/1 VCX 


4547 


554084 


1993 D09 ez43 263905 


F 


M00054857AB1 1 


TTP9-PnlnnMptT ivpr 

VJ \_^-<i. V^UXV/lXlVXCU-fl VCX 


4548 


554497 


1993 E10 ez43 263922 


F 


M0005485 8 A- B05 


VJ V-/Z. ~V^«JXUXXXtXCIJL/1 VCX 


4549 


555968 


1993 F10 ez43 263923 


F 


M00054859A-D10 


T JC2-Po1on M etT .iver 

VJ V_/ -i- V/X V/XXX VXC LLfl V CX 


4550 


43642 


1993 K10 ez43 263928 


F 


M00054865B-H04 

IVlVvVJ " O UJ XJ . X 1V/T 


VJ V_^x* V_^VJXvJXxiVXC UL/X V CI 


4551 


595506 


1993 L10 ez43 263929 


F 


M00054866CG07 


VJ \~f^ Vs V/XUlXLVXVv LJ—/X V CX 


4552 


594434 


1993 O10 ez43 263932 


F 


M00054869D-F02 


VJ Vy ^ V-'V/XVIXxXvXClX-fl V CX 


4553 


555702 


1993 P10 2z43 263933 


F 


M00054870DG08 


VJ V/^< V/ VJXVXXXXVXC LX-rX V CI 


4554 


556132 


1993 All ez43 263934 


F 


M00054853AA10 


VJ V_^Z* V-/VHV/XXXVXCLX-/X VC1 


4555 


555667 


1993Fllffz43 263939 


F 


M00054859A-E02 


VJ V^Z. *V UXUXXIVXC LXjI VCX 


4556 


519109 


1993Jllez43 263943 


F 


M00054864C* AO 1 

lVJLvvv VTT V/ .xxv X 


T TP 9 -Pnl nnlVf ptT ivpr 

VJ V//- V-/ VJXVJXXi VXt/ LX-jX v CI 


4557 


450829 


1993 B12 ez43 263951 


F 

X 


M00054854r*H02 

iYlUUUJ*TtJJtV/.l XV/ X# 


TTP9-Pn1on"M"f*tT iv^r 

VJ V> V-' V/XUXXL VXC IX-fl V CI 


4558 


463513 


1993 M13 sz43 263978 


F 


M00054867DH03 


TJP2-Pnlr»nMf k tT iver 

VJ V^^» V^VIXV/XXXVXCU-fX VCX 


4559 


552396 


1993 N13 ez43 263979 


F 

X 


M00054869AA09 


TTP9-Pr»lonlVff*fT iwr 

VJ -i- V-^ VfXVJXXXVXC IXjX V CX 


4560 


557077 


1993 C14 &z43 263984 


F 

X 


M00054855D-G12 

1VX V7 Vy vy " O ^ X-/ . VJ Xw 


T Tp 7 -Pnl nn TVTetT ivpr 

VJ V/Z V_^JX\JX1XVXCLXjX V CI 


4561 


554756 


1993 D14 ez43 263985 


F 

X 


M00054857AG09 


TIP9-PnlnnMpfT ivpr 

VJ V^^^V^UXVUXXVXCLXwl VCX 


4562 


482306 


1993 E14 sz43 263986 


F 


M00054858A.D06 


VJ V^r^r V/X V/XXXVXC UL-/1 V CX 


4563 


560259 


1993 H14 ez43 263989 


F 

X 


M00054861 B-H08 

lVXV/V/V/*/^OVI XXv.X XV/ o 


TTP9-Pn1nnlV/rptT n/f»r 

VJ V_y ~ v_y V/lUXXiVXC ULyl V CX 


4564 


402742 


1993 L14 &z43 263993 


F 


M00054866DC03 

1 VX V/ X/ V/ ««/ O VI ux/ . V> Vf «/ 


T TP9-Pr>1nn"M<»tT .i ver 

VJ V-/Z, V^ V/X VIXXXVXC UL/X VCX 


4565 


390017 


1993 P14 ffz43 263997 


F 

X 


M0005487 1 A-C04 

IVXV/Vf V/JtO / lzx.\/vt 


TTP9-Po1nnA/fptT ivpr 

VJ VyZ^y VJ1UXXXVXC tXyl V CX 


4566 


554989 


1993 D15 ez43 264001 


F 

X 


M000S48S7 A Gl 2 

jlvxv/v/\/»/*to«/ / yrv.vJ lz* 


T TP9-Pr>1nn"MpfT ivpr 

VJ V-yZ, V^ V/XVjXxiVACLLvl V CI 


4567 


550714 


1993 F15 az43 264002 


X 


M0005485XAT)0X 

lVXVy V/W^*tO_J 0/l.i/uO 


TTP9-Pn1nnlv/Tp+T iv/pr 

VJ V/Z," V^UXUXlIVlCUL/l V CI 


4568 


562962 


1993 115 oz43 264006 

1^7«/.1JL J.gZiTJ ^»V"T\/\7\J 


T7 

X 


ivxv/V/ V/*/*tO VJ Jyrx..yrV X X 


TTP9-Pn1nn"lv/fptI ivf»r 

VJ V^Z." V^UlUXxlVxCUL/l VCI 


4569 


394567 


1993 K16 ffz43 264024 


F 

X 


M0005486STVF05 

iviUvV/J^rOUJxi.xvJ 


TTP9-Pn1nnA/TpfT ivpr 

VJ V^Z,~A^VJXCXXXVXClx-(X V CX 


4570 


457825 


1993 N17 ffz43 264043 


F 

X 


XVXV/ \J \J J *T O VI 7/1, !»/ \Jis 


TIP9-Pn1nnlVfptT ivpr 

VJ Vy Z. V_x VJXVJXXXVXC LX-/1 V W 


4571 


553072 


1993 017 az43 264044 


F 

X 


M00054870B- A 1 ? 

XVXV/V/V/-/*tO / UD.iT.lZ. 


TTP9-Pn1onA/TptT iver 

VJ V_xZr Vy VJXVjXXiVXC LX-vI VCX 


4572 


522703 


1 993 .A 1 8 .gz43_264046 


F 


M00054853B:E07 


UC2-ColoixMetLiver 


4573 


451491 


1993.E18.gz43_264050 


F 


M00054858B:A10 


UC2-ColonMetLiver 


4574 


551811 


1993.H18.gz43_264053 


F 


M00054861C:E10 


UC2-ColonMetLiver 


4575 


599433 


1993.A19.gz43_264062 


F 


M00054853B:E10 


| UC2-ColoiiMetLiver 


4576 


593343 


1 993 .D 1 9.gz43_264065 


! F 


M00054857B:D08 


UC2-ColoixMetLiver 


4577 


554613 


1993.C20.gz43_264080 


F 


M00054856A:F08 


! UC2-ColoiiMctLiver 


4578 


497365 


1993.G2Lgz43_264100 


F 


M00054860C:E04 


UC2<:olOxxMetLiver 
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Table 2 



SEQ 
ID 
NO 


:lustef 


SEQ NAME 


ORIE 
NT 


CLONE ID 


LIBRARY 


45 79 
*+j / 7 


555553 
jj j j j j 


1993 021 ez43 264108 


F 


M00054870B:G12 


UC2-ColonMetLiver 


4580 


481641 


1993 P21 ez43 264109 


F 


M00054871A:H07 


UC2-ColoiiMetLiver 


4581 


557974 


1993 M23 ez43 264138 


F 


M00054868B:A05 


UC2-ColoiiMetLiver 


4582 


557719 


1993 F24 ez43 264147 


F 


M00054859D:D02 


UC2-CoIonMetLiver 


4583 


551591 

J — ' X— ' ,7 X 


1993 L24 ez43 264153 


F 


M00054867A:G01 


UC2-ColoiiMetLiver 


4584 


553586 


1993 P24 ez43 264157 


F 


M00054871B:E10 


UC2-(^loiiMetLiver 


4585 


553983 

J J J 70J 


2018 J01 ez43 264167 


F 


M00055049B:F05 


UC2<;oloiiMetLiver 


4586 


557939 

J J / 7 J7 


2018N01ez43 264171 

^bUXUillV Jl>gw^«' X*Xy^^ X # X 


F 


M00055053D:G04 


UC2-ColonMetLiver 


4587 


549858 


2018 P01 ez43 264173 

x*Vy X U .X \y X - gi>~«-' r x / — ' 


F 


M00055056C:A05 


UC2-ColoiiMetLiver 


4588 


561838 

Jul O O 


2018 F02 ez43 264179 


F 


M00055045D:F10 


UC2-ColonMetLiver | 


4589 


561438 

JVJ 1TJO 


2018 A03 sz43 264190 


F 


M00055039A:G06 


UC2-ColonMetLiver 


45Q0 


447555 

"1*1 / J J J 


901 8 N03 ez43 264203 


F 


M00055054A:A10 


UC2-ColonMetLiver 


4591 


556490 


2018 104 ez43 264214 


F 


M00055048C:C06 


UC2-ColonMetLiver 


45Q9 


554377 

J J*T J / ^ 


20 1 8 L04 ez43 2642 1 7 


F 


M00055051C:F10 


UC2-ColonMetLiver 


45Q3 


557504 


2018P04ffz43 264221 


F 


M00055056C:C05 


UC2-ColonMetLiver 


45Q4 


555340 


9018 R05 <7743 264223 

lO. JJ vj J , A»vrTA»A«j 


F 


M00055041A:C02 


UC2-ColoiiMetLiver 


4505 


540000 


9018 1.05 ez43 264233 


F 


M00055051D:D12 


UC2-ColoiiMetLiver 


*»• j^o 


598775 

JZO / / J 


9018 005^743 264236 


F 


M00055055B:H1 1 


UC2--ColoxiMetLiver 


45Q7 


55301? 


9018P05ffz43 264237 

Z*\J X O .XT V/ J .batt J x«< vy i Xrf«y / 


F 


M00055056C:D07 


UC2-ColonMetLiver 


4508 


386940 


9018 H06 ffz43 264245 


F 


M00055047C:D11 


UC2-CoIonMetLiver 




55Q775 
jj" / / j 


901 8 TC06 ^743 264248 


F 


M00055050D:C01 


UC2-ColonMetLiver 


4600 


4Q3961 


901 8 M06 ff743 264250 

£*\J X O ..LYXV/V7. e£x*T J £-t\J~ \J 


F 


M00055052D:G12 


UC2-ColonMetLiver 


4601 


555660 

JJJUUv 


2018A07ez43 264254 


F 


M00055039B:E02 


UC2-ColoiiMetLiver 




561901 

JVI X -7 \J X 


9018 C07 ez43 264256 


F 


M00055042A:D09 


UC2<:olonMetLiver 


4603 


554936 


2018H07ez43 264261 


F 


M00055047C:F07 


UC2-ColoxxMetLiver 


4604 


555010 


2018J07ez43 264263 


F 


M00055049C:H12 


UC2-ColoiiMetLiver 


4605 


560801 


2018L07ez43 264265 


F 


M00055051D:F01 


UC2<:oloiiMetLiver 


4606 


555993 


2018A08ez43 264270 


F 


M00055039B:G11 


UC2<:olonMetLiver 


4607 


465907 


2018 H08 2z43 264277 


F 


M00055047C:F08 


UC2-ColonMetLiver 


4608 


4191 53 

7171 Jj 


2018 108 ffz43 264278 


F 


M00055048D:D08 


UC2-ColonMetLiver 


460Q 


569808 


9018P08a743 264285 


F 


M00055056C:F04 


UC2-ColoiiMetLiver 


461 0 


601365 

VJV* x JU J 


2018 A09 ez43 264286 


F 


M00055039B:H10 


UC2-ColoiiMetLiver 


461 1 
HOI 1 


^75380 

J / J JOv 


9018 F09 &743 264291 


F 


M00055046B:C07 


UC2-ColonMetLiver 


'fOIZ 


56171 6 


9018T09a743 964995 


F 


M00055049DD09 


UC2-ColonMetLiver 


zt61 1 




9018 000 ot43 964300 


F 

X 


M00055055CC09 


UC2-ColonMetLiver 


4614 


555524 


2018.B10.gz43_264303 


F 


M00055041B:D11 


UC2-ColonMetLiver 


4615 


549008 


2018.C10.gz43 264304 


F 


MOOO55042B:BO9 


UC2-ColonMetLiver 


4616 


555771 


2018.E10.gz43 264306 


F 


M00055045A:F03 


UC2-ColoixMetLiver 


4617 


515115 


2018.O10.gz43_264316 


F 


M00055055C:D02 


UC2-ColonlVl^iver ! 


4618 


529733 


2018.All.gz43 264318 


F 


M00055039C:B05 


UC2-ColonMetLiver 


4619 


553850 


2018.Ell.gz43 264322 


F 


M00055045A:F12 


UC2<:olonIVIefLiver 


4620 


557735 


20 1 8 .Gl 1 .gz43 _264324 


F 


M00055047A:E07 


UC2<:oloiiMetLiver 
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Table 2 



SEQ 
ID 
NO 


CLUSTEB 


SEQ NAME 


ORIE 
NT 


CLONE ID 


LIBRARY 


4621 


555798 


2018.Hll.gz43 264325 


F 


M00055047DC12 


TJC2-ColonMetLiver 

VJ v^^« UlUlliVlv L J— '1 V vl 


4622 


553108 


2018.Mll.gz43 264330 


F 


M00055053BA02 

X TX> Vr V_/ \y «S \J^J <+s X^r a XXV*« 


UC2-ColonMetLiver 


4623 


555708 


2018 C12 ez43 264336 


F 


M00055042BE02 


T JC 2-Co1 onMetLi ver 


4624 


559699 


2018 E 12 ez43 264338 


F 


M00055045BA04 


T Tp 9 -Pnl on lVfptT iver 


4625 


551068 


2018 K12 ez43 264344 


F 


M00055051AC09 


T TP?-Cn1nnlVfptT .ivf^r 

V^V/lWXXlVXCLXwl Vtl 


4626 


559246 


2018L12ez43 264345 


F 


M00055052AF07 


UC2-ColonMetLiver 

VJ V-y^ V> WIVJIXI VXL/LXuX V L>1 


4627 


557401 


2018P12gz43 264349 


F 


M00055056DB06 


UC2-ColonMetLiver 


4628 


557525 


2018 M13 sz43 264362 


F 


M00055053BC02 


UC2-ColotxMetLiver 


4629 


8997 


2018N13ez43 264363 


F 


M00055054BE10 


UC2-ColonMetLiver 

VJ \_y ^ V-/ V/1V/1UtXw L.I .il VW 


4630 


560720 


2018.E14.gz43 264370 


F 


M00055045B:C08 


UC2-ColonMetLiver 


4631 


558120 


2018.G14.gz43 264372 


F 


M00055047A:H05 


UC2-ColoiiMetLiver 


4632 


349744 


2018.I14.gz43 264374 


F 


M00055048D:H04 


UC2-ColonMetLiver 


4633 


561259 


2018.A15.gz43_264382 


F 


M00055039D:D07 


UC2-ColoiiMetLiver 


4634 


557326 


2018 G15 ez43 264388 


F 


M00055047BA10 


IJC2-ColonMetLiver 

V/ V^4h» \— ^>J1\-/1X1TXV/1. 1 <X V wX 


4635 


449437 


2018 J15 ez43 264391 


F 


M00055050AD1 1 


UC2-ColoiiMetLiver 


4636 


555340 


2018 K15 ez43 264392 


F 


M0005505 1 AG09 

lTlVUV^^V/^ 1Z1.VJV/ 


UC2-r^lorxMetLiver 

W x^X« V/vlvillTXvlXJl V vl 


4637 


555172 


2018 C16 ez43 264400 


F 


M00055042CB05 


T JC2-ColoTiMetLi ver 


4638 


289316 


2018 G16 ez43 264404 


F 


M00055047BB03 

iVlV/V/UJ JUT / JJ.JJUJ 


l-J v_^^—' vIXV^XXLVXwLXwX V vl 


4639 


557279 


2018 H16 ez43 264405 


F 


M00055048AA04 


W V^^r vVlUJXIVXv Uul V vl 


4640 


556169 


2018 264407 


F 

X 


M00055050AH08 

ItIV/vV/ J JUJ v/^»..X Xv/O 


TTr*9-r*nlnnM^fT iver 

C v^UlvxlxVXvU-fX V vl 


4641 


551897 


2018 K16 e?43 264408 


F 


M0005505 1 A HI 0 

lVlV/v/V/^/JV/J lxv,l 11V/ 


T TP9 -Polnn A/TpfT ,ivf»r 

KJ V/Z, *"V_>^JHJ1XLVXC LX-/1 V Vl 


4642 


556391 


2018 P16 ez43 264413 

Z*v/lU.l lv.g^'TJ AUTTU 


F 

X 


M00055056DH12 

IVlVUUJJWJUL/.lHi 


T TP9-Pn1nri"MetT .iver 

\w/^« WXVMXIYX vLXjI V Vl 


4643 


562949 


2018 D17 ez43 264417 


F 


M00055044AA08 


\J \~s.£* V/UJlVJLLL¥lv I I /l V Vl 


4644 


551371 


2018 L17 ez43 264425 


F 


M00055052BE03 


T Tr t 7-r , o1on"M'ef T ,iver 

V^Z. V/VJlUlliVlvLl-/l V vl 


4645 


562292 


2018 M17 cz43 264426 


F 


M00055053CB03 

XYlvV/UJ JVJJv.UvJ 


T IC^-PolonA/fet T jver 

VJ V-s \— ' vlUlUYXv t.l fX V VX 


4646 


555082 


2018 N17 ez43 264427 


F 


M00055054BG12 

irivv/v/ j jvjtjj.vj x^ 


I JC 2-ColonMetLiver 

\J V/A v>vl vlll'XvUJl V VX 


4647 


562272 


2018 P 17 ez43 264429 


F 


M00055057A* A04 


UC2-ColoiiMelLiver 

V-/ ^ VV/lvUlTXVLL/1 V VX 


4648 


555883 


2018 A18 sz43 264430 


F 

X 


M00055040AF01 


T TC?-Pn1nnMetT ,iver 

\^sjL. vUlUlllYXvlXJl V vl 


4649 


558389 


2018 F18 sz43 264435 


F 


M00055046CC05 


UC2^olonMetLiver 


4650 


463341 


2018 018 ez43 264444 


F 


M00055056AH12 

lVlUVU J J V J VA.lllZ; 


T TP2-Pn1nn"MetT .iver 

\J V-'\J'1W1X1 VXV U-jX V VI 


4651 


553002 


2018 P18 ez43 264445 


F 


M0005 5 057A- A05 

Xiiuv/v/jjvj / iv.nvj 


T JC2-ColonMetLiver 

\J \^sJ~* VlVlllVXV LXJX V VX 


4652 


224812 


2018 E19 ez43 264450 


F 


M00055045CH05 


UC2^olonMetLiver 

\_J WlvlLLTXVUJl V Wl 


4653 


206098 


2018 J19 ez43 264455 


F 


M00055050B-E1 1 


UC2-ColoiiMetLiver 

V-/ V>^< Vvlv/lUVXVU-11 V Wl 


4654 


453893 


2018 H20 sz43 264469 


F 


M00055048AF04 


UC2^oloiiMetLiver 

V-/ V^^^ VVlUlUTlWUUl V VX 


4655 


557308 


2018N21ez43 264491 


F 

X 


M00055054DA02 


I TC2 -C ol oTiMetLi ver 

VJ 1 V>Xi V> UlUlliVXV IL/l V Vl 


4656 


470617 


20 1 8 .B22 .gz43_264495 


F 


M00055041C:H12 


UC2"ColonMetLiver 


4657 


555057 


2018.N22.gz43_264507 


F 


M00055054D:E12 


UC2-ColoiiMetLiver 


4658 


447815 


2018.A23.gz43_264510 


F 


M00055040C:G08 


UC2-ColoiiMetLiver 


4659 


561963 


2018.N24.gz43_264539 


F 


M00055055A:B03 


UC2-ColoixMetLiver 


4660 


551544 


2018.P24.gz43„264541 


F 


M00055057B:B01 


UC2-Q)lonMetLiver 


4661 


555213 


2018.D07.gz43_264641 


F 


M00055043B:B01 


UC2-ColonMetLiver 


4662 


555010 


2018J07.gz43_264647 


F 


M00055049C:H12 


UC2-ColonMetLiver 
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SEQ 
ID 
NO 


CLUSTEF 


SEQ NAME 


ORIE 
NT 


CLONE ID 


LIBRARY 


4663 


505451 


2018.P07.gz43 264653 


F 


M00055056C.E10 


UC2-ColonMetLiver 


4664 


555993 


2018.A08.gz43 264654 


F 


M00055039B-G11 

l"Xu Vf V/ w/ W**r ^ 1 ■# • V^P X X 


UC2-ColonMetLiver 

V_ /^w V/VIV/XXLtXVULJX t vv* 


4665 


465207 


2018.H08.gz43 264661 


F 


M00055047C-F08 

*r*v v V»»/*/ VT / V_^ ♦ X V Vr 


UC2-ColoiiMetLiver 


4666 


557834 


2018 K08 ez43 264664 


F 


M00055050DF04 


UC2-ColonMetLiver 

V- / v_^-x^ \y\yi\yiiiTXvvXyi V wi 


4667 


557882 

tJ^J f \J OX* 


2018 L08 ez43 264665 


F 


M00055051DG01 


UC2-ColonMetLiver 

v_y x_> vywivui'xvixyi v vvx 


4668 


551711 


2018B09gz43 264671 


F 


M00055041BC10 

j. rx v vy \y«— 7 — ^ r x»«» • x v 


UC2-ColoiiMetLiver 


4669 


555524 


2018 BIO ez43 264687 


F 


M00055041BD11 


UC2-ColonMetLiver 

V-/ v/ v/xy/i 1 1 v x vy ixjx v va 


4670 


446739 


2018 J10 az43 264695 

X>* Vr X VP • »r X V/ I 1 »y **V# » **r 


F 


M00055049D:D10 


UC2-ColoiiMetLiver 


4671 


385531 


2018P10gz43 264701 

X-r V* X VP ♦ X X V tfjl > r Xrf V 1^ # vr x 


F 


M00055056CH07 


UC2-ColonMetLiver 


4672 


559052 


2018 Fll az43 264707 

v X *-T * X X X - J ^ r f * — ' VT * * V # 


F 


M00055046B:D02 


UC2-ColonMetLiver 


4673 


553108 

+*J X \J \J 


2018.Mll.gz43 264714 


F 


M00055053B.A02 


UC2-ColonMetLiver 


4674 


397338 


2018A12az43 264718 

A* Vr x vp *x x x * t'y* ' ■ """^ ^* T # X VP 


F 


M00055039CD11 

x t j. vy v/ vr ^ \-r ^ v • X-^ X x 


UC2-ColoiiMetLiver 


4675 


559699 


2018.E12.gz43 264722 


F 


M00055045B:A04 


UC2-ColonMetLiver 


4676 


553975 


2018.O12.gz43 264732 


F ; 


M00055055C:E08 


UC2-ColonMetLiver 


4677 


561994 


2018 B13 ez43 264735 


F 


M00055041BF04 


UC2^oloiiMetLiver 


4678 


448098 


2018.C13.gz43 264736 


F 


M00055042B.E05 


UC2-ColoiiMetLiver 


4679 


558254 


2018 D 13 ez43 264737 


F 


M00055043D:D10 


UC2-ColonMetLiver 


4680 


557420 


2018.F13.gz43 264739 


F 


M00055046B:E08 


UC2^olonMetLiver 


4681 


558007 

—s S vy vy vy i 


2018 Gl 3 sz43 264740 

x^* vr x vp • x «y • fi^j r^y ■ * ■ vy 


F 


M00055047AH04 

ITXv v v*/ Vr ■ « X X * X XV ■ 


UC2^olonMetLiver 


4682 


557525 


2018 M13 ez43 264746 

X* Vr X VP tXTJ, X mfZ^Z+Tm* Xrf VP 1 T # IV 


F 


M00055053B:C02 


UC2-ColoiiMetLiver 


4683 


450765 


2018.C14:gz43 264752 


F 


M00055042B:E08 


UC2^olonMetLiver 


4684 


247 


2018 D14 ez43 264753 

«W Vr X Vr t X-^ X T .C.Zj i *^ X^ VP P / «y 


F 


M00055043DF07 


UC2^olonMetLiver 


4685 


560720 


2018 E14 ez43 264754 

X Vf « -1— ' X T . t-^X^ T X># Vr T # I 


F 


M00055045B.C08 


UC2-ColonMetLiver 


4686 


491799 


2018 F14 ez43 264755 


F 


M00055046BF06 


UC2-ColonMetLiver 


4687 


558120 


2018 G14 ez43 264756 

si*\t x vy . vj x ■ » }i*.m 9 — y x*vy » # v 


F 


M00055047AH05 


UC2-ColonMetLiver 


4688 


432970 


2018 K14 ez43 264760 

Vr x Vr «X m*. X ■ • * J ^ X* vr w r Vr Vr 


F 


M00055051AE11 


UC2-ColonMetLiver 


4689 


561259 


2018 A15 ez43 264766 


F 


M00055039DD07 


UC2-ColonMetLiver 


4690 


551630 


2018O15ez43 264780 


F 


M00055055DD11 


UC2-CoIonMetLiver 


4691 


461 


2018 M16 ez43 264794 


F 


M00055053CA12 


UC2-ColonMetLiver 


4692 


555399 


2018A17ez43 264798 


F 


M00055040AC02 


UC2-CoIonMetLiver 


4693 


554149 


2018 B 17 ez43 264799 

X.V/XO.XJX / .f~ % t,t~~s *L~\J~ i ^ ^ 


F 


M00055041CC10 

ITXv Vr Vf w Vr T X V_^ ■ V—-' X V»r 


UC2-ColonMetLiver 


4694 


553603 


2018G17ez43 264804 


F 


M00055047BB10 


UC2-ColonMetLiver 


4695 


552641 


2018 Kl 7 ez43 264808 


F 


M00055051BB08 


UC2-ColonMetLiver 


4696 


562292 


2018 M17 ez43 264810 


F 


M00055053CB03 


UC2-ColonMetLiver 


4697 


562272 


2018P17ez43 264813 


F 


M00055057AA04 

XTXv/v V*/*r Vk/ # X V * X XVr I 


UC2-ColoiiMetLiver 


4698 


450755 


2018.D18.gz43_264817 


F 


M00055044A.C02 


UC2-ColonMetLiver 


4699 


555892 


2018.E18.gz43_264818 


F 


M00055045C:F09 


UC2-ColonMetLiver 


4700 


551793 


2018.G18.gz43_264820 


F 


M00055047B:C03 


UC2-ColonMetLiver 


4701 


553002 


2018.P18.gz43_264829 


F 


M00055057A:A05 


UC2-ColonMetLiver 


4702 


487182 


2018.119.gz43_264838 


F 


M00055049A.F10 


UC2-ColonMetLiver 


4703 


206098 


2018J19.gz43_264839 


F 


M00055050B:E11 


UC2-ColoiiMetLiver 


4704 


557572 


2018.K19.gz43_264840 


F 


M00055051B:D07 


1 UC2-ColonMetLiver 



184 



INSDOCID: <WO 0214S00A2J_> 



WO 02/14500 



PCT/US01/25840 



Table 2 



SEQ 
ID 
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SEQ NAME 


0RIE 
NT 


CLONE ID 


LIBRARY 


*T / V J 


551334 

J J X J J*T 


2018M19ez43 264842 


F 


M00055053C:F06 


UC2-ColonMetLiver 


4706 


561046 


9018T90O743 264854 


F 


M00055049A-G03 


UC2-ColonMetLiver 


4707 




901 8 T 90 ot43 264857 


F 


M00055052CB12 


UC2-ColonMetLiver 


47H£ 


S^l 61 6 
j JlOxO 


901 ft TSJ90 0*43 96485Q 

^UXO.xNZv/.gZ*rJ ZlrrOJ~ 


F 

X 


M0O0 5 5 054C *G1 0 


UC2-ColonMetLiver 


47HO 




9018P90or43 264861 


F 


M00055057A*D10 


UC2-ColonMetLiver 


471 f\ 


548Q35 

J*rOJ7J J 


9018 C91 oy43 264864 


F 


M0005 5042D G03 


UC2-ColonMetLiver 


471 1 


4££Q70 
'foo? /U 


901RF91 ct-74^ 964867 


F 


M00055046CE07 


UC2-ColoiiMetLiver 


471 9 
/ 1Z 


JJJ / Jl 


9018 A99 0743 964878 

1 0 . AZ>^> .gZi^J Z,VJ*rO / O 


F 


M00055040BF02 


UC2-ColonMetLiver 


AT\ 1 




9018 P99 a^43 964880 


F 


M00055042DH02 


UC2-ColoixMetLiver 


471 4 


5^1 ^7R 


9018 F99 ot43 964889 


F 


M00055045DA07 


UC2-ColonMetLiver 


471 

*f / 1 j 


556171 


9018 ft99 ot43 264884 


F 


M00055047BG06 


UC2-ColonMetLiver 


*t / It) 


4^1 1 77 
*f J 1 1 / Z 


901 X P99 ot43 9648Q3 

Zl/lO.riZ.gZ'+J ZrV/tO J 


F 

JL 


M00055057AF10 


UC2-ColonMetLiver 


471 7 


jjjZjU 


901 X R9^ ot43 9648Q5 

-ZU 1 0 .XjZ*D .gZ.t J ^Ut07J 


F 

X 


M00055041DB07 


UC2^oloiiMetLiver 


*f / lo 


476968 
4 /OZOo 


9018 fr93 or43 264Q00 


F 


M00055047BG10 


UC2<3oloxiMetLiver 


471 Q 
/ lit 




9018 K9^ a^43 964Q04 


F 


M0005505 1CB05 


UC2^oloiiMetLiver 


477 H 
*\l ZU 


<;^i <;6i 

JJ IjOI 


9018 09^ 0743 964Q08 


F 

X 


MO 0 05 5 05 6B * GO 1 

XVXV/V/V/J Jvr-^ \JU . VJ V/ x 


UC2^olonMetLiver 


All 1 


JJJ 1 / J 


901 » P94 a^4^ 96401 9 


F 

X 


M00055043AB06 


UC2<^olonMetLiver 


4T99 
4 /ZZ 


j j / joZ 


901 X T 94 fr^4^ 96409 1 


F 

X 


M 0005 5 05 2D B05 


UC2^olonMetLiver 


477*2 


jjjOIO 


9090 W01 ot4^ 9640^^ 

ZUZU ,xx\J X *&£rr D Z.KJ'^zfD J 


F 

X 


M00055081A:E08 

XVXV/V/V/J JV/u X-tx . J t\J\J 


UC2^oloiiMetLiver 


ATI A 


^40607 
j*f you / 


9090 T01 ov43 964Q35 


F 


M00055083BE05 


UC2-ColonMetLiver 


477^ 


40780^ 

H7Z07J 


9090 T 01 0743 264037 


F 

X 


M00055086AB10 


UC2-ColonMetLiver 


477 


^6471 


9090 F0? ot43 964046 


F 

X 


M00055077DB01 

ATX \J\J \J *J \J 1 f .X» ' V J- 


UC2-ColoiiMetLiver 


4777 
*r /Z / 




9090 T 09 CT^43 964053 


F 

X 


M00055086AC09 

ivxv/v/v/ j jvov/rv. v^v/^ 


UC2-ColonMetLiver 


479 Q 
4 /Zo 


jj IZoj 


9090 "KT09 o-z4^ 964055 


V 

X 


M00055 088D • AO 1 

Xvxv/v/v/j j v/ 1> ui/ liiu x 


UC2-ColonMetLiver 


477 Q 

*r /zy 


^9.790 
J JO /zu 


9090 P03 o-t-43 964060 


F 

X 


M00055076AC06 

XVXV/V/V/ J Jv / V/xX.. V»/V/ w 


UC2-ColonMetLiver 




JJJJ 1Z 


9090 A 04 o^43 964074 


F 

X 


M0005 5 073D F06 

XVXV/V/V/ J J V/ / JJLS.X \/V/ 


UC2-ColonMetLiver 


47^1 
*f / j 1 


^401 9Q 

j^y izy 


9090 FI04 o-t-43 964077 


\ V 

X 


M0005 5 077AB07 

XVXV/V/V/ J—J v/ # / jtX..X_#V/ / 


UC2-ColonMetLiver 


AH11 


«£4Q'7 

jjo4y / 


7090 T04 o-t4^ 9640R^ 
Z UZ U . J U4 . gZ^f j _X O^t y o j 


17 


MOOO55O83C-C05 

XVXV/vrV/ J J UOJVv.vwJ 


UC2-ColonMetLiver 


AH11 
H / 36 


^£1 497 
J014ZZ 


9070 "M04 mA'X 964QR7 
ZUZU.±NU4.gZ4J_^ZrU**i/o / 


17 


MOOO55O88D*B08 

IVXvv V J J V/O OX/ .X/u U 


UC2-ColonMetLiver 




CCZI/I AO 

JJD4UO 


7H9H OH4 eirrA'X 964088 
ZUZU.LIU4.gZ < l-j ZrOHhi^oo 


F 


MOOO55090CB03 


UC2-C^lonMetLiver 


*r / J J 


jjOOjj 


707H "RO^ cr-rA^ 964001 
ZUZU .DvJ .gZ^f J XDfi^lf 1 


T7 
r 


MOOO55O75AC09 

XVXV/V/V/ J JV/ / — /xX. \_/ V7 


UC2-ColonMetLiver 


*t / J0 


jjjjOo 


7070 f*H^ o-y4^ 964QQ9 
ZUZU . vA) J .gZH* j Z#U*ti7 y^ 


17 

IT 


MOO055076AD1 1 

XVXV/V/v/ J JV/ / V//»..JJ x X 


UC2-ColonMetLiver 


AH1H 
*\ 1 5 1 


4i04Uo 


9H9H T7H6 rr^A^ 965H1 1 
ZUZU.rUO.gZ4j__ZQJUI x 


T7 
F 


M0005 5078D-G04 

XVXV/V/V/J J V/ / O JJ.VJ v/"T 


UC2-ColonMetLiver 




jjZjOi 


9H9H Un£ rr-y4'5 96^01 ^ 
ZUZU .riUO.gZ4j ZOj U 1 j 


•c 
r 


MOOO55O81B-F10 

XVXViWv/J JV/O XJJ.J_/X V/ 


IJC2^oloijMetLiver 

V/ V_y^» V/ViV/lUTXvlxvi V v/x 




JJ0440 


9H9A l?*fi6 rt-rAI 96^01 6 
ZUZU.Jx.UO.gZ4j ZOjUIO 


F 
r 


M00055084DR01 

IVIvvUJJvOt-X/.JJvI 


UC2-ColoiiMetLiver 


4740 


557476 


2020.L06.gz43_265017 


F 


M00055086B-.D10 


UC2-ColonMetLiver 


4741 


556559 


2020.O06.gz43_265020 


1 F 


M00055090C-.C12 


UC2-ColoiiMetLiver 


4742 


548943 


2020.K07.gz43_265032 


F 


M00055084D:C09 


UC2-ColonMetLiver 


4743 


555126 


2020.P07.gz43 265037 


F 


M00055091B-.A07 


UC2-ColoiiMetLiver 


4744 


i 142614 


2020.E08.gz43_265042 


F 


M00055078A:C05 


UC2-ColoiiMetLiver 


4745 


453756 


2020.M08.gz43_265050 


F 


M00055087D:D08 


UC2-ColonMetLiver 


4746 


556040 


2020.G09.gz43_265060 


F 


M00055080B.G10 


UC2-ColonMetLiver 
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Table 2 



SEQ 
ID 
NO 


CLUSTER 


SEQ NAME 


ORIE 
NT 


CLONE ID 


LIBRARY 


A1A1 
4 /4 1 


^7A^A 
J J /H04 


9090 "MfiQ a743 265067 


F 

X 


M00055089AH06 


UC2-ColonMetLiver 


4 /4o 


A^9R99 
4JZoZZ 


9090 PI 1 o-7&V\ 965088 


p 

X 


M0005 5 076B E08 


UC2-ColonMetLiver 


4/4V 


A^AOA^ 


9090 PI 1 9650Q0 

ZUZU.X-rX x.jjjCrrD 


p 

X 


M0005 5078 AE1 0 


UC2-ColonMetLiver 


4 / JU 


JUyJUJ 


909 A PIT cr-ril^ 9650Q1 


p 

X 


M0005 5 079 AH05 


UC2-ColonMetLiver 


AH^ 1 


^^A6i 

jjjUOI 


9090 PT1 1 cr^zi^ 965093 

.XX 1 1 Z.UJI/7J 


p 

X 


M0005508 1CA12 


UC2-ColonMetLiver ! 


A7^9 
4 / JZ 


55666ft 


9090 T1 1 or43 265095 


F 


M00055083DD08 


UC2-ColonMetLiver 


4 / Jj 


J*t70 JLU 


9090 PI 9 ar4^ 965104 


F 


M00055076BF04 


UC2-ColoiiMetLiver 


A7^A 


^5776A 

J J / / Ov 


9090 F1 9 az43 265106 


F 


M00055078AF01 


UC2-ColonMetLiver 




J J*tUO*t 


9090 TSM2 **z43 265115 


F 


M00055089BC01 


UC2-ColonMetLiver 


4 / jo 


1 6AQ9 1 


9090 P19 ^7-43 265117 


F 


M00055091CB04 


UC2-ColonMetLiver 


4 / J / 


5^1^A9 

JJ U't-Z 


9090 T 11 c*743 265129 


F 


M00055086DC07 


UC2-ColonMetLiver 






9090 A14erz43 265134 


F 


M00055074B:E05 


UC2-ColoiiMetLiver \ 


4 / JV 


5^571 a 

JJJ / 1U 


9090 T14 a741 965142 


F 


M00055082DE08 


UC2-ColonMetLiver 


A7£A 


AQ7Aft6 
4^ /UoO 


9090 H14 a741 965148 


p 

X 


M00055090DE03 


UC2-ColonMetLiver 


4 /ol 


c/;i A ftQ 
DOl4oV 


9n.7n PI 5 o^AI 9^S154 


p 

X 


M0005 5 07 8B *F05 


UC2-ColonMetLiver 


4/oZ 


JJOOlO 


9A7rt PI cf7/3l\ 96S155 


F 

X 


M00055079CG06 


UC2-ColonMetLiver 


4/OJ 


£<QA7 1 

j jVU 1 1 


ZUZU.Jr 1 J.gZH j ZOJ IDJ 


p 

x^ 


M00055091CD1 1 

lVJLvl/V/J^V^ X V^.X^ X X 


UC2-ColonMetLiver 


4/o4 


^A7A 1 
JjU/Ul 


ZUZU.V^ lD.gZ 4 rJ> ZOJlOO 


F 

X7 


M0005 5076B H06 

1VX V/ \J \J ~J -J \J I vJJ .X XV/ W 


UC2-CoIonMetLiver 


4/OJ 


J JlV /O 


7A7n Ptl 6 <r^41 96S 1 79 


P 
X 


M00055080DA01 


UC2-ColonMetLiver 


4/00 


AAA9AA 
40UZ44 


9fl9A T1 A otA** 965174 


P 
X 


M00055082DG01 


UC2-ColoiiMetLiver 


4 /D/ 


jjOoUZ 


9A9A T16 ctt41 965175 
ZUZU.J JLO.gZ't.D ZOJ X / O 


p 

X 


MG0055084A*E 1 0 


UC2-ColonMetLiver 


4/Oo 


A^AOI A 


9A9A T<T 1 6 a^4*3 965 1 76 
ZUZ U. IV XO.gZ*f J ZOJ1 /U 


p 

X 


M00055085BD02 


UC2^olonMetLiver 


4 /oy 


JJ /Z14 


9A9A T 16 ctt-A^ 965177 
ZUZU . iv 1 0 .5ZH J ZOJ1 / / 


p 

X 


M00055086DH07 


UC2-ColonMetLiver 


4/ /U 


£19Q£1 


9A9A PI 7 a^A^K 9651517 


F 

X 


M00055079DA03 


UC2-ColoiiMetLiver 


4/ / 1 


A£^AAA 
40J44O 


9A9A T17 otAI 965100 

ZV/ZU.1X / .g2»*T-5 ^UJ17V 


p 

X 


M00055082DH02 


UC2-ColonMetLiver 


AT 79 
4/ /Z 




9090 T17a741 965191 

ZUZU. J 1 / .gZHO ZUJ X ^ X 


F 

X 


M00055084AF10 


UC2-ColonMetLiver 


4 / /J 


700O7 


9090 017 a741 965196 


F 

X 


M00055090DF03 


UC2-<^lonMetLiver 


AH HA 
4 / /4 


jjjj /Z 


9090 Pfl R <T741 965905 

ZUZtl.X7XXO.gZ.*TJ 


F 


M00055081CG01 


UC2-ColonMetLiver 


4 / / J 




7A90 Tift cr?A1 965907 


F 


M00055084BA04 


! UC2-ColonIVletLiver 


4 / /O 


JJJ J lo 


9090 PrlO ot41 965220 

ZUZv.VJl7.ftZi t rJ j6.UJaZ.\/ 


F 

X 


M00055080DE07 

lYlv v vJ»/ WV/vJ-/ ,XJv ■ 


UC2-ColonMetLiver 


4 / / / 


AAOA1 O 


9090 Pf 1 0 (TT741 96599 1 
zuzu.niy.gz*rj ^ujz,z,i 


F 

X 


M00055081CH04 


UC2-ColonMetLiver 


/1770 
4 / /o 


^£A^A7 
jOUjU / 


9A9A i\T10 mA'X 965997 
ZUZU.xN xy.gZ*T^> Z,OJZ»Z, / 


F 

X 


M00055089C-D06 


! UC2-ColonMetLiver 


4 / /y 


jouuoy 


7A7A T*7A mA'X 965911 
ZUZU.JDZU.gZ4J ZOjZjI 


p 


M00055075DD05 

1VX\/ wV/JJU / — ' X-X .X-f vy«y 


UC2-ColonMetLiver 


/I TO A 


jj /yj4 


7A7A rOH rr-rAI 965979 
ZUZU.V^ZU.gZ4J ZOJZJZ. 


p 


M00055076C-H07 


UC2-ColonMetLiver 


4/ol 


jjjyyo 


7A7AP71 «^A1 965950 
ZUZU.jQZl.gZ4J ZOJZJU 


P 
X 


M00055078DA07 


UC2-ColonMetLiver 


4782 


494625 


2020.G21.gz43 265252 


F 


M00055080D:F01 


UC2-ColonMetLiver 


4783 


554828 


2020.P21.gz43 265261 


F 


M00055091D:A03 


UC2-ColonMetLiver 


4784 


562229 


2020.A22.gz43 265262 


F 


M00055074D:B04 


UC2-ColonMetLiver 


4785 


561975 


2020.M22.gz43 265274 


F 


M00055088C-.D01 


UC2-ColoiiMetLiver 


4786 


560080 


2020.N22.gz43 265275 


F 


M00055090A:F02 


UC2-ColonMetLiver 


4787 


557783 


2020.C23.gz43 265280 


F 


M00055076D:F11 


UC2-ColonMetLiver 


4788 


556881 


2020.K23.gz43 265288 


F 


M00055085D:F03 


UC2-ColoixMetLiver 
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SEQ 
ID 
NO 


CLUSTER 


SEQ NAME 


ORE 
NT 


CLONE ID 


LIBRARY 


4780 


SS77A7 
33 1 /*+ / 


7090 A/T9^ mA*\ 96^900 

ZUZU.lvlZ,.3 .gZH3__Z03Zi7U 




ivyrooo*; *; o ft ft -no9 

IV1U UU3 3 U 5 5 .L/UZ 


u v^z-^oioiuviexjjrver 


A7QO 
4 /7U 




9090 09^ rr-yA7 9£^9Q9 
ZUZU.V-FZ D .gZ43_Z03ZyZ 


r 


AAO0O<I*IOO1 A'fYMl 
XVIUUU3 3 Uy 1 A. L/UV 


u^z-i^oioiiMeujiver 


A701 


43 1 1 5 J 


9A90 P9A mAI 9£<\9QQ 
ZUZU.J/Z4.gZ'+3_Z03Zyi7 


JT 


"\yfooo*;*;oftn A »Pn«; 

1YIUU U3 3 U 5 U/\. X 4 U3 


u i^z^oionMetLjiver 


A709 


^701 0 


9090 PT9A rr-7-A^. 9ft^'*01 
ZUZU. XlZ4.gZ43 Z033U1 




TVAOOO** i \Oft9 A • A 1 9 
1V1UUU33U5ZA.A1Z 


u ^z~\^oioiiivieii-fiver 


A70^ 


JJJ /Z3 


9090 "fcT9A crvA'X IG^IflA 
ZUZ U . ISZh- .gZ4 j _Z 03 3 U4 


JT 


\a oon n ft <iri ^no 

IV1UUU3 3 U 5 3 .LJ . OUi/ 


u i^z-i^oioiHvieubiver 


A70A 

4 / y4 


^£9 7 Aft 
3DZ /4o 


9090 POI cttA'} 9£^9^ 
ZUZy . JrU 1 .gZ43_Z03 3Z3 


r 


A/rooo*;*; ill a »Pi o 

1V1UUU33 1 1 iA.r 1U 


TTP9 P^xlr\**AAxi+T -Itj-ch- 

u i^z-\^oioiiivieu-fiver 




*\£7QftO 


9090 T09 mA'X 9#W*'*d 


p 


tv/tooo*;^ i nnr«Hn^ 

1V1UUU33 lUU'O.Xl.UO 


u \^z-^oioniviciJ--rivcr 


A70£ 


S£99A^ 


9090 TO^ w^A^ 9^S^ i S0 


p 


aaooo^*; 1 00n-"R09 

IVIUUU^?^ 1 1/UX-/.X5UZ 


u i^z-*ooioiiivicXJL/i ver 


A707 


*;A97£ft 

jOZ / Oo 


9090 "R0£ arAI 0f^^Q^ 
ZA)Z.Z7 .I3UO .gZH j_ZOj J7 1 


p 


iv>rooo*;*;oQ^ a -po7 

lVlUUU33Uyo/V.X'U / 


u ^z , -^oioiiiviciJ->iver 


A7QR 


zlOl 1 77 
Hzrx 1Z / 


9090 T\(\£* mA'X 9A**^0^ 
ZUZ57.J-JUO.gZH3 Z03350 


P 


AAOOO** *JOQATl-POQ 


u \^z-\^oioiuviciijiver 


470Q 


S^S^O 
DJjjjDKJ 


9090 "R07 o-zA^ 96SA1^ 


p 


ivaooo*;^ i oo A po*; 


%j i^z - v^uioiuvxc iXjiver 


*f ouu 


S^970A 


9090 T 07 o^Al 9A5ZU 7 


P 


JLV1UUU33 1 U3 JL/.X3UO 


TTP9 Pi-*1rt-riAyr*»+T nwr 

U , oZ-l^OIOIllV16LL-fIV6r 


AROI 


A7£ 


709Q riOft crr/AL 9^*;a^^ 

ZUZ^7.WUo.gZ*t^ ZSjD^Oyj 


P 


iv>iooo*;^i ioa-po^ 

1V1UUU33 1 1U/\..\^U3 


TTP9 Pr»1/x«AA«*fT iirdf 

u \^z-i^oioiHvi6LJ-fiver 


40 UZ 


33 xyo / 


9090 "P0Q rr^A^ 9^^AA9 
ZUZi7.HU i/. gZHO ZD3'f*rZ 


P 
X* 


TVAOOO^^OQA A 'fins 
JV1U UU3 3 l/7uA. vJU 5 


u \^z-^oioiuvieLLfi ver 


Afto^ 


43J> j 


909Q POO mAI 0MAA\ 
ZUZ^.r 1 Ui/.gZ*f3 ^ZD34 i +3 


p 

X 1 


\>TOOO < \ < ;OQ7T*-'R 1 9 
!YlUUU33Uii' /X>.X>1Z 


TTP9 Prt1^nAAc»fT «7iaT- 


Aft OA 


^ QA 1 ftO 
O ^4 1 07 


909O nOO mAI 9A*;A^9 
ZUZy . \J\jy . gZ43_Z03 4 3 Z 


p 

X* 


\aooo*;^i ioA'Po*; 

1V1UUU33 1 1UA. , CU3 


u ^z-i^oioiiLviciXjiver 


Afto^ 

45U3 


30Z/ iy 


909O TT1 O rrrrAI 9^<;A^O 
ZUZi/.r lU.gZ4^_Z0343y 


p 

X 1 


\>TOOO ^ OQ9"R *POft 
lVLUUU33Uy /X3.rU5 


u i^z -^oioiiivieti^iver 


AftO£ 

45UO 


^£9^07 


9090 n 1 crrAI 9^*;A77 
2A)jL.y .Ksl 1 .gZ43_Z034 / Z 


p 
X* 


IVAOOO^^OOAR-T^I 1 
lVlUUU33Uy4I3.Jjl 1 


u v^z-i^oioiuvicXXjivcr 


Aft07 
45U / 


OftAftA 
y5454 


9090 Fill rr«y/11 9^<IA7^ 
ZUZy.L/ 1 1 .gZ4J__Zu34 / o 


p 


TVAOOO****OQ*l A '"HOft 
IVLUUU33 Ui^3 A.UU5 


u v^z-v^oioiiiviexj-fiver 


AftOft 


1 1 ft 


9090 rJI 1 /r-^A^ 9^<;A7< 
ZUZy.vJ 1 1 .gZ43__Z034 fO 


p 

X* 


< aoo a -an* 


U \^Z-V^OlOIllVie Uul Vol 


asoq 


*IA7ftft 1 
30Z551 


ZUZy.l514.gZ4J_ZD33 iy 


p 


XYIUUU33 Uy dd .vruo 


u i^z-i^oioiiJVicxijiver 


Aft i n 
4oiU 


33.534o 


9090 T?1A rr»yA1 9A<*l9ft 
ZUZy.lS.14.gZ43_Zo33Zo 


p 


JVluUu3 3 1 U4U .r uy 


u i^z-i^oioiiivietLfiver 


Aft 1 1 
451 1 


03 /yoo 


ZUZV.A13 .gZ43__Z03334 


p 

x* ; 


JVIUUU3 3 uyzxJ . ijuy 


ui^z-LxOioiiiYietxvivcr 


Aft 1 9 
4<S1Z 


3oU3 / J 


9H9Q T K n^AI 0«^/K 
ZUZy.l>13.gZ43_Zo3343 


p 
r 


iiyfnon<K 1 A£ A »PHA 
JVLUUU33 1UOA.XSU4 


u l^z -i^oioiimcti-fiver 


Aft 1 "2 
45 13 


3344yt> 


Z UZV . XZr 1 0 . gZ43 _Z03 3 3 4 


p 

X* 


X^OOO^^OO^P'OO^ 
J\lUUU33Uyol^.V/U-j 


u U/Z -v^oioiuvieuLriver 


Aft 1 A 
4514 


3ol 144 


ZUZy.J 10.gZ43_Z0333V 


p 

X 4 


1YLUUU33 1U^A.X11 1 


u i^z-i^oioiuvieiJLfiver 


Aft 1 < 


^£0/£9ft 


0090 T90 rr^AQ 9A*I/i90 
ZUZy .1ZU .gZ4^_Z03 ozz 


p 

X* 


A/fflOn^^ 1 A9 A -P1 1 
JYIUUU33 1UZA.X11 1 


i u^z-^oioiii\aeii-fiver 


Aft 1 A 
45lO 


3 JOJ 1 1 


ZUZy.JZU.gZ4j_Z030Zj 


■p 

X* 


JV1UUU33 1 U j \^ .X>U / 


ui^zH^oioiuYieiiwiver 


Aft 1 1 
451 / 


/i no Aft ft 

4UZ455 


9090 #^91 rrTA^ 9/;«;^Q9 
ZUZy.^Z 1 .gZ43_Z03O jZ 


p 
r 


A/TOOO^^OOAP-P 1 O 
lYlUUU33Uy4^.^1U 


u i^z-i^oioiiiviciijiver 


Aftl ft 


330O13 


9090 T91 crvAl 9A*;^0 
ZUZV .JZ 1 .gZ4«3_Z030^y 


p 


JV1UUU33 1Uj\^/.UU3 


u \^z -L^oioiuvicu-fiver 


Aft 1 Q 
45 iy 


^^nftoft 

3DU5y5 


909Q T99 cr-rAI 1 
ZUZy.JZZ.gZ43_ZD3033 


p 

X 1 


iv^f ooo ^ ^ i o^ r* • cu\i. 

1VIUUU33 1 U3*o.VJTU3 


T TPO P/^1/-fct-iAA*»4-T tiror 

u v^z -i^oi oniviexXyi ver 


Aft9H 
45ZU 


A^AAOO 

4344yy 


9090 T 99 rrrrA^ 9/;<;^***7 
ZUZ!7.JuZZ.gZ40_Z0303 / 


p 

X* 


\viooo^^ i n^P'RO/% 

JV1UUU33 IUO^.JjUO 


U l^Z-V^OlODIVlcTJ-AVCr 


Aft'? T 
4oZl 


33yy33 


9H9Q n-7A1 ItKATI 
ZUZy .JSZ3.gZ4j_Z030 /Z 


p 


JVIUUU33 1U3A.AU3 


u ^z-L^oioiiJVicLLiiver 


Aft99 
45ZZ 


j34Zjj 


90^0 TOI crvAl 9^<;90 , 3 
ZU 3 U. J Ul.gZ4j_Z03 /Uo 


p 

X 1 


Ayroon^^; i oft a »p i o 

iVLUUu33 1Z5A.L/ 1U 


u v^z-v^oioiuvicLi^iver 


Aft9^ 
45Z3 


^^9A1 ** 
33 /Ol3 


90^0 'K'OI cttAI 9A<;90A 
ZU^U.lVUl .gZ4o_Z03 /U4 


p 
r 


"\^oon^ ^ 1 9 ftTvriOA 

JVIUUU33 1Z5L/.JL/U4 


u v^z -i^oioiiJwicLLfiver 


4824 


449035 


2030 C02 £z43 265712 


p 

JL 


M00055 1 ] 5CG09 


T TP^-PnlnnMefF ,i ver 


4825 


452525 


2030.E03.gz43_265730 


F 


M00055118C:B03 


UC2-ColonMetLiver 


4826 


452775 


2030.L03.gz43_265737 


F 


M00055129D:C02 


UC2-ColoriMetLiver 


4827 


560868 


2030.E05.gz43„265762 


F 


M00055118D:B04 


UC2-ColonMetLiver ; 


4828 


538830 


2030.H05.gz43 265765 


F 


M00055125A:A02 


UC2-ColonMetLiver 


4829 


486683 


2030.K05.gz43_265768 


F 


M00055129A:B03 


UC2-ColonMetLiver 


4830 


488030 


2030.L06.gz43 265785 


F 


M00055129D:F11 


UC2-ColonMetLiver 
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Table 2 



SEQ 
ID 
NO 


CLUSTEF 


SEQ NAME 


ORES 
NT 


CLONE ID 


LIBRARY 


4831 


66678 


2030 M06 ez43 265786 


F 


M00055130D:F08 


UC2-ColonMetLiver 


483? 


528369 


?030 J07 ffz43 265799 


F 


M00055128BB12 


UC2-ColonMetLiver 


4833 


47973? 


2030 G08 £z43 265812 


F 


M00055124AF01 


UC2-ColonMetLiver 


4834 


557747 


2030 O08 or43 265820 


F 


M00055 133BE08 


UC2-ColonMetLiver 

V^^« V/V/XV/XXXTXVtXVX T vl 


4835 


558534 


2030 H09 ez43 265829 


F 


M00055125BF01 


UC2-ColonMetLiver 




312036 


2030 D10 ez43 265841 


F 


M00055117A:G08 


UC2-ColonMetLiver 


4837 


461653 


2030 J10 ffz43 265847 


F 


M00055128B:E12 


UC2-ColonMetLiver 


4838 


558060 


2030Gllez43 265860 


F 


M00055124BA11 


UC2-ColoiiMetLiver 


4839 

TOJ7 


557610 


2030 Lll ffz43 265865 


F 


M00055130A:D10 


UC2-ColonMetLiver 


4840 


557284 


2030 Pll ffz43 265869 


F 


M00055134C:A01 


UC2-ColonMetLiver 


4841 


562714 


2030 A12 Q743 265870 


F 


M00055113BF02 


UC2-ColonMetLiver 


484? 


453274 

tJJir / t 


7030C12a743 265872 


F : 


M000551 16AC07 


UC2-ColonMetLiver 


4843 


551068 


2030P12oz43 265885 

ZiV/JV/.X lL>g£itJ lujuuj 


F 


M00055134CB01 


UC2-ColonMetLiver 


4844 

to 1 ' 1 


125543 


2030 J13 ez43 265895 


F 


M00055128B:G01 


UC2-ColonMetLiver 


4845 


449454 


2030M13az43 265898 


F 


M00055131A:D07 


UC2-ColonMetLiver 


4846 


562760 


2030 013 ez43 265900 


F 


M00055133C:C06 


UC2-ColonMetLiver 


4847 

*to*t / 


557710 

J— J / / XV/ 


2030 P13 ez43 265901 


F 


M00055134C:E09 


UC2-ColonMetLiver 


4848 


526334 


2030 F14 ez43 265906 

Z.UjU,JUlt.gLtJ LUJ7V/U 


F 


M00055119D:F08 


UC2-ColonMetLiver 


484Q 


451392 


2030 015 ez43 265932 


F 


M00055133C:G07 


UC2-ColonMetLiver 


4850 


549591 


2030 D16 ez43 265937 


F 


M00055117C:C03 


UC2-ColonMetLiver 


4851 


557895 

J J / 027J 


2030H16ez43 265941 

LuJu.lllU.gitJ 


F 


M00055125C:H03 


UC2-ColonMetLiver 


485? 


554989 


2030J16oz43 265943 


F 


M00055128C:E03 


UC2-ColonMetLiver 


4853 


561265 


?030r>17o^43 265953 


F 


M00055117CF02 


UC2-ColoiiMetLiver 


4854 


55801 5 


?030O17az43 265964 


F 


M00055133CH11 


UC2-ColonMetLiver 


4855 


561741 


2030 A18 trz43 265966 


F 


M00055114AE02 


UC2-ColonMetLiver 


4856 


477295 


2030 018 2z43 265980 


F 


M00055133D:A02 


UC2-ColonMetLiver 


4857 
to J / 


556310 


2030E19az43 265986 

LV7«7 W .X-# X 7.^tJ ZiUJ7 0U 


F 


M00055120BF12 

iTj.\/V\/fc/*/ X*^WX^»X X 


UC2-ColonMetLiver 


4858 
to JO 


55845? 


2030 H20 *yz43 266005 


F 


M00055125DE02 


UC2-ColoiiMetLiver 


485Q 


610269 


2030 N21 P743 266027 


F 


M00055132DE07 


UC2-ColonMetLiver 


4860 


561279 


2030 J22 ez43 266039 


F 


M00055128D:C11 


UC2-Q)lonMetLiver 


4861 
tovJ i 


557719 

J J / / 17 


2030 N22 ffz43 266043 


F 


M00055132DE10 


UC2-ColonMetLiver 


4867 


451391 

t J 1J71 


2030 022 ^743 266044 


F 


M00055133DF02 

XTXvv V^/w' X m X X#X# 


UC2-ColonMetLiver 


4863 


5547^7 


?030 D?4 o743 266065 


F 

X 


M00055118BA09 

1tXv/v Vv'*' x x uxy*i ivy y 


UC2-ColonMetLiver 


4864 
toot 


452750 

tJZi / J7 


2030 K?4 CT743 266072 


F 

X 


M00055129CH08 


UC2-ColonMetLiver 


4865 


562683 


2031 A01 ^743 266078 


p 


M00055135A-E07 


UC2-ColonMetLiver 


4866 


558463 


2031.C04.gz43_266128 


F 


M00055138A:E08 


UC2-ColonMetLiver 


4867 


211273 


203 U04.gz43_266 134 


F 


M00055146A:B12 


UC2-ColonMetLiver 


4868 


468257 


203LN04.gz43_266139 


F 


M00055152D:C03 


UC2-ColonMetLiver 


4869 


558768 


203 1 .B05 .gz43_266143 


F 


M00055136D:D09 


UC2-ColonMetLiver 


4870 


561245 


2031.G05.gz43_266148 


F 


M00055144A:A11 


UC2-CoIonMetLiver 


4871 


557783 


2031.K05.gz43_266152 


F 


M00055149B:F09 


UC2-ColonMetLiver | 


4872 


549930 


2031.O05.gz43_266156 


F 


M00055154D:F06 


UC2-ColoiiMetLiver \ 
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Table 2 



SEQ 
ID 
NO 


CLUSTER 


SEQ NAME 


OKIE 
NT 


CLONE ID 


LIBRARY 


4873 


558619 


2031.C06.gz43_266160 


F 


M00055138AG08 


T 1C2.-Cn\ onMpfT iver 

\J V_/*J xVJJXlVXW LX-fl V CI 


4874 


450630 


2031.G06.gz43 266164 


F 


M00055 144AE09 


\J V^Z> w \^VIlV^lXiVXvvlArl VCI 


4875 


524721 


2031.O06.gz43 266172 


F 


M00055154DG12 


TTC?-Cn1nnlVrrtT ivpr 

w V^UlV/lXiVXwU-rl Vwl 


4876 


561624 


2031.P06.gz43 266173 


F ! 


M00055 156AD02 


TTP^-PnlnnMptT ivpr 

w V_/^- X^VXlUlXlVXwULfl V wl 


4877 


497493 


2031.N07.gz43 266187 


F 


M00055 152DH09 


w V^^> %fUl UIXLVXwLul VCI 


4878 


490846 


2031 M08 ez43 266202 


F 


M00055 152A-B05 


V_xZ. - V^v/IOxJUVxwLxjI V CI 


4879 


558645 


2031.P08.gz43 266205 


F 


M00055156BC11 


T TP 9 -Pol nn MpfT ivpr 


4880 


553969 


2031.D09.gz43 266209 


F 


M00055139BE10 


TTP?-Po1nnMp<-T ivpr 


4881 


558007 


2031.P09.gz43 266221 


F 


M00055 156C- A08 


TTP9-Pnlnn\fptT ivpr 

wVXlVJXlXYXCU-si VCI 


4882 


550047 


203 1 .B10.gz43_266223 


F 


M00055 137AE05 


UC2-PolonMetT iver 

Y^VXV/lXLVXVvLXwl VCI 


4883 


562000 


2031.I10.gz43 266230 


F 


M00055 146B-E09 


UP2-PoloriiVrefT iver 


4884 


160289 


2031.L10.gz43 266233 


F 


M00055 150DC06 


T TpO _Pn1 nn A/TpfT ivf»r 

wa» V^V/lVJlXlVXC LXj 1 VGA 


4885 


27586 


2031.Cll.gz43 266240 


F 


M00055138BH12 


T TP?«Pnlnn IViptT mpr 

U V^Z> "*V/VXXvJXLLVXCLL/i VCi 


4886 


558230 


2031.Dll.gz43 266241 


F 


M00055 13QBG03 


TTP9-PnlrmMptT ivpr 

VJ V> V^UHJlxlVlC LL/1 V CI 


4887 


549739 


2031.M12.gz43 266266 


F 


M0005 5 1 52 A-E09 


TTP^-PolnnA/TptT ivpr 

VJ \_/V/AvJlXiVXC LXwl V Ci 


4888 


561558 


2031.P12.gz43 266269 


F 


M00055 156C-D06 


TTP9-Pn1nnlV/r£*tT ivpr 

VJ V^^i wl VlixJ. VXC 11 ■»! V CI 


4889 


556630 


2031.B14.gz43 266287 


F 


M00055 1 37BB 1 1 

lTXV/V/V/>/»/ A *J 1 XJ ( XJ X X 


VJ V^VJlvJlXlVXCLX_/l V ci 


4890 


561178 


2031 P15 ez43 266317 


F 


M00055 156DA02 


TTP9-Pn1nn"MV*tT ivpr 1 

(JV^A~°V^vXlUIJlVxCIJ_rlVCI | 


4891 


558477 


2031.B16.gz43 266319 


F 


M00055 137B-F 1 2 


TTP9-Pn1rm1\/fptT ivpr 

Uv/Ai V_AJAVJ±x±VXCLX-/l VCI 


4892 


557250 


2031 M16 ez43 266330 


F 


M00055 152BA03 


TTP9-PfilftTiiVrf*i-T ivpr 

VJ V> V_/VJlVJxliVXC LXwi V CI 


4893 


556288 


2031P17ez43 266349 


F 


M00055 157AB04 


TTP9-Pn1onA/TptT ivpr 

VJ V^X>~ wlJlUlii YXCLLfl VCJ 


4894 


610893 


2031.L18.gz43 266361 


F 


M00055151AF10 

ITlWV/k't/ X^ 1/XtX XV/ 


TTP9-Pn1nnMpl"T ivpr 

VJv^Z, V^VJl\JxXiVXCl.X_/I VCI 


4895 


557708 


2031 024 ez43 266460 


F 


M00055155DB09 


VJ V_/^6 - V^OlVJliiVXC LJL/1 V CI 


4896 


562881 


2032 E01 sz43 266466 


F 


M00055 162A-B03 


TTP9-Pn1nnA/TptT ivpr 

VJ w ^ V^VJlUxxlVlC IXwi VCI 


4897 


494198 


2032.L01.gz43 266473 


F 


M00055 170D-B09 


TTP9-Pn1nnMptT ivpr 

VJ V>*J1 VJIHV J.C LX-/1 V CI 


4898 


551475 


2032 O01 ez43 266476 


F 


M00055 177AF05 


TTP9-PnlnnMptT .ivpr 

VJ ^, V^ VJlVJlXiVXC LX-/1 V CI 


4899 


558720 


2032.K02.gz43 266488 


F 


M00055169DA11 


T IP9-Pn1nn1\/fetT -iver 

VJ V-'-c. V_/\JlV/xxlVIVvLX-/IVCI 


4900 


2435 


2032 E03 ez43 266498 


F 


M0005 5 1 62 A* C 1 2 


TTP9-Pnlnn'MptT ivpr 

VJ VJIUlxl VICIxjJ VCI 


4901 


559464 


2032.I04.gz43 266518 


F 


M00055 166DF02 


TTP9-Po1riTiMptT .ivpr 

VJ V_> ^ V-/*Jl\JxXi VXC IX-ri V CI 


4902 


559389 


2032.A05.gz43 266526 


F 


M00055157CC11 


IJP2-PnlnnMetT iver 

VJ Vfii V-^VJ1 V/lIiVXC LX-f 1 V CI 


4903 


559562 


2032 H05 sz43 266533 


F 


M00055 1 65BO09 


' TTP9-Prk1onA/fpfT iver 

VJ V_/^ V^V7lUxXlVxCtX_*l VCI 


4904 


402799 


2032 M05 ez43 266538 


F 


M0005S179 A POO 


TTP9-Pn1nriiV^'ptT iver 
\J Z, ~ V^UIUlliVlC LLrl VCI 


4905 


557316 


2032 J06 ffz43 266551 


F 


M000SS 1 1 


TTP9-Prt1r»nlV/fptT ivpi- 


4906 


557209 


2032 K06 ez43 266552 


F 




TTP9-Pr»1rwilVApfT it/pr 
U Z, ** v^UlOIliVlO LL/1VC1 


4907 


554604 


2032 M06 £z43 266554 


F 

X 


M 000S S 1 79 A -Ffn 


TTP9-Pr»1nnTV>fptT ii/pr 
VJ - v^OiOxxiVlClvLfl vcr 


4908 


558917 


2032.H07.gz43_266565 


F 


M00055165C:H08 


UC2-ColonMetLiver 


4909 


493261 


2032.107.gz43_266566 


F 


M00055167A:A02 


UC2-ColoiiMetLiver 


4910 


558755 


2032.K07.gz43_266568 


F 


M00055170A:A11 


UC2-ColonMetLiver 


4911 


558900 


2032.M09.gz43_266602 


F 


M00055172B:B04 


UC2-ColonMctLiver 


4912 


553537 


2032.C10.gz43_266608 


F 


M00055160C:D02 


UC2-ColoiiMctLiver 


4913 


561413 


2032 .E 1 0.gz43_2666 1 0 


F 


M00055162B.B04 


UC2-ColonMetLiver 


4914 


561876 


2032.F10.gz43_266611 


F 


M00055163B:F07 


! UC2-ColonMetLiver 
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Table 2 



SEQ 
ID 
NO 


CLUSTEE 


SEQ NAME 


ORIE 
NT 


CLONE ID 


LIBRARY 


4915 


553797 


2032.H10.gz43 266613 


F 


M00055165D:C03 


UC2-ColonMetLiver 


4916 


558965 


2032 Gl 1 ez43 266628 


F 


M00055164C:C10 


UC2-ColonMetLiver 


4917 


561911 


2032.B12.gz43 266639 


F 


M00055159B:G09 


UC2-ColonMetLiver 


4918 


557961 


2032 112 gz43 266646 


F 


M00055167B:A08 


UC2-ColonMetLiver 


4919 


558022 


2032.K12.gz43_266648 


F 


M00055170B:B06 


UC2-ColonMetLiver 


4920 


409262 


2032.M12.gz43_266650 


F 


M00055172B:H07 


UC2-ColonMetLiver 


4921 


415538 


2032.I15.gz43J266694 


F 


M00055167B:H07 


UC2-ColonMetLiver 


4922 


559057 


2032 .M 1 5 .gz43_266698 


F 


M00055172D:D04 


UC2-ColonMetLiver 


4923 


215005 


2032.B16.gz43_266703 


F 


M00055159C:B02 


UC2-ColonMetLiver 


4924 


556542 


2032.M16.gz43_2667 14 


F 


M00055172D:D07 


UC2-ColonMetLiver 


4925 


562989 


2032.0 17.gz43_266732 


F 


M00055177D:F07 


UC2-ColonMetLiver 


4926 


394772 


2032.A19.gz43 266750 


F 


M00055158D:C01 


UC2-ColonMetLiver 


4927 


553877 


2032E19gz43 266754 


F 


M00055162C:E12 


UC2-ColonMetLiver 


4928 


558858 

+j *p/ \-* vj 


2032.I19.gz43 266758 


F 


M00055167D:B05 


UC2-ColonMetLiver 


4929 


559355 


2032.JI9.gz43 266759 


F 


M00055169B:F04 


UC2-ColonMetLiver 


4930 


551693 


2032.M19.gz43 266762 


F 


M00055172D:F12 


UC2-ColonMetI*iver 


4931 


323165 


2032.E20.gz43 266770 


F 


M00055162C:G03 


UC2-ColonMetLiver 


4932 


558981 


2032.O20.gz43 266780 


F 


M00055178A:C07 


UC2-ColonMetLiver 


4933 


561507 


2032 C21 ez43 266784 


F 


M00055161A:C02 


UC2-ColonMetLiver 


4934 


555883 

w/ \J \J 


2032E22ez43 266802 


F 


M00055162D:B01 


UC2-ColonMetLiver 


4935 


559088 


2032 022 ez43 266812 


F 


M00055178A:D03 


UC2-ColonMetLiver 


4936 


558395 


2032 C23 ez43 266816 


F 


M00055161A:E05 


UC2-ColonMetLiver 


4937 


447386 


2032 D23 ez43 266817 


F 


M00055161D:H03 


UC2-ColonMetLiver 


4938 


450566 

~mJ \J -J \J\J 


2032 L23 ez43 266825 


F 


M00055171C:C01 


UC2-ColonMetLiver 


4939 


561918 


2032 E24 sz43 266834 


F 


M00055162D:G04 


UC2-ColonMetLiver 


4940 


559389 


2032 G24 ez43 266836 


F 


M00055165A:F05 


UC2-ColonMetLiver 


4941 


555202 


2032.M24.gz43 266842 


F 


M00055174A:H12 


UC2-ColonMetLiver 


4942 


559752 


2032.N24.gz43 266843 


F 


M00055176D:H01 


UC2-ColonMetLiver 


4943 


478511 


2041 .B01 .gz43_266847 


F 


MOO055181B:A10 


UC2-ColonMetLiver 


4944 


456517 


204 1 .DO 1 .gz43_266 849 


F 


M00055183A:C06 


UC2-ColonMetLiver 


4945 


557895 


2041.H01.gz43 266853 


F 


M00055187D:G11 


UC2-ColonMetLiver 


4946 


561975 


2041 A02 ez43 266862 


F 


M00055179A:G08 


UC2-ColonMetLiver 


4947 


557714 


2041 D02 ez43 266865 


F 


M00055183A:E10 


UC2-ColonMetLiver 


4948 


550874 


2041 K02 gz43 266872 


F 


M00055192C:E04 


UC2-ColonMetLiver 


4949 


539142 


2041 L02 ez43 266873 


F 


M00055193C:C11 


UC2-ColonMetLiver 


4950 


549911 


2041.M02.gz43_266874 


F 


M00055194D:C05 


UC2-ColonMetLiver 


4951 


561830 


204 1 .N02 .gz43_266875 


F 


M00055195CJH05 


UC2-ColonMetLiver 


4952 


479851 


2041.A03.gz43_266878 


F 


M00055179A:H11 


UC2-ColonMetLiver 


4953 


549472 


2041.B03.gz43_266879 


F 


M00055181B:E06 


UC2-ColonMetLiver 


4954 


561687 


2041.F03.gz43_266883 


F 


M00055185D:A02 


UC2-CoIonMetLiver 


4955 


504880 


2041.L03.gz43J266889 


F 


M00055193C:E10 


UC2-ColonMetLiver 


4956 


558212 


2041.E04.gz43_266898 


F 


M00055184C:C07 


UC2-ColonMetLiver 
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Table 2 



SEQ 
ID 

NO ( 


DLUSTEF 


SEQ NAME 


ORIE 
NT 


CLONE ID 


LIBRARY 


A957 


559050 


9041 A 05 ct7 43 966910 


X 


M0Q055 179BD05 


UC2-ColonMetLiver 




55A590 


9041 fii5 c*743 966919 


F 

X 


M0005 5 1 82 A *D07 


UC2-ColonMetLiver 


A05G 


AOQA1 9 
^HJ^OXZ 


9041 F05 a-743 96691 A 


F 


M00055 1 84CD02 


T JC 2-ColoiiMetLiver 


AQ6n 

*t7DU 


ARR61 ^ 
tOOOIj 


90A1 T05 cr-743 9660 1Q 


F 


M00055191B-A10 


UC2-ColoiiMetLiver 




558396 


9041 KD5 <*?43 966990 


F 


M00055 192CH06 


UC2^oloiiMetLiver 


AQ69 


5553QA 

JJJJ7T' 


904 1 F06 cnr43 96693 1 


F 

X 


M00055185DD11 


UC2-ColonMetLiver 




s5Q6io 

JJ7U1U 


9041 T406 <*743 966933 


F 

X 


M00055 188AG1 1 


UC2^oloiiMetLiver 

V^-*-r V/V/XV/XXXtXVUI V V/J- 


AQ6A 


S600R0 


9041 T06 az43 266934 


F 


M00055189BB12 


UC2<^loriMetLiver 


A965 


5557A9 


9041 7\T06 <T743 966Q39 


F 


M00055195DB10 


UC2-ColoiiMetLiver 




460797 


9041 P06 asr4^ 966Q41 


F 


M00055 198A-E05 


UC2-ColoiiMetLiver 


AQ67 


A76199 


9041 T?07 crz43 966943 


F 


M00055181CB07 


UC2-ColoiiMetLiver 


AQ6& 
H7O0 


S 1 6A&A 


9041 rt07 cr74^ 966Q4^ 


F 

X 


M00055187AF02 


UC2-ColoiiMetLiver 

x^f V-^-z^ vyxvyxxx ▼ xvixji ▼ %-/x. 


AQ6Q 


1 R770A 

X O / / U*T 


9041 H07 a7-43 966Q49 


F 


M00055188AH10 


UC2-ColoixMetLiver 


AQ70 


559000 

JJ7UUU 


9041 A0R o743 966958 


F 


M00055 179B-G07 


UC2-ColonMetLiver 


AQ71 


S5^oo9 


9041 ROR o^4^ 966959 

ZW*t X .DUO .gZi*t J Z.UU7J y 


F 

X 


M00055181C*B12 

lvluuV/JJ IO X . X-/ X ^ 


UC2-ColoiiMetLiver 


AQ77 

*±zr /Z 


796^9A 
ZZDJ5Z*!' 


9041 P08 tr-74^ 966960 

ZvyrX «\^V/0.g^*T«J Z,UU7UU 


F 

X 


M00055 182BC07 


UC2-ColoiiMetLiver 


AQ7^ 


S69RA0 


90A1 D08 a^43 966961 


F 

X 


M0005 5 1 8 3 C * A02 


UC2-ColonMetLiver 


AQ7A 


561 513 


9041 FOR a^43 966969 


F 

X 


M00055184CF01 


UC2-ColoiiMetLiver 


AQ7S 


A77HA6 


9041 FOR otA^ 966963 
zuh x .ruo.gz4j_zuu70j 


F 

X 


M0005 5 1 85D*F07 


UC2-ColoiiMetLiver 


AO/7/£ 
*tV /O 


AO7A07 

4-yzoz / 


9HA1 IT-Oft ryvA^ 966Q6A 


17 
x^ 


M00055 1 87AF06 

xVXviUv/ J J X O / Jr\.X \J\J 


T IC2-ColonMetLiver 


AQ7T 
4V / / 


d jyzoz 


90A1 TTOR rr-yA^ 966Q65 




TVT00055 1 88R-F06 

IVXvUUJJ X OOJJ.iwUVy 


T JC2^oloiiMetLiver 


/IQ7Q 

/o 


S#£1 SI ^ 


9HA1 T^OO err ATI 966Q75 


F 


TVT0005 5 1 8 1 C • C09 


T JP9^o1oiiMetLiver 

\J VUIULLLT J.W IXJ1 V Wl 


AOTO 


D3003Z 


9HA1 POQ rr-rA^ 966Q76 


r 


M00055 1 89BF05 


I IC2-ColoiiMetLiver 

V-/ V/vlvlU'lW UUl V Wl 


4yoU 


DDZozy 


9AA1 flAQ rrrrAI 966QR0 


F 


7V/T0005 5 1 87 A -002 

1YxvJ\JW^«J lO 1/x.VJvZ/ 


T IC2-ColoiiMetLiver 

v7 V^u wUlvlilTXWlul V Wl 


/ino i 




90A1 A/fOO CT-7A3 966QR6 
ZU*f X .IVlU^.gZ^fO^ZOO^oO 


r 


M00055 1 95 A-B08 


UC2-ColoiiMwtLiver 




SSfi/SS9 


90A1 ATAQ rtfA.'X 9660R7 
ZU^f l.lNl/y.gZ*fJ_ZD070 / 


F 

X? 


M00055195HF1 1 


T JC2-ColoiiMw1Liver 

\J V/lvlUTlV IX fl V Wl 




AOA1 SA 


9HA1 T>C\Ck mA.'X 966QR0 
ZUH x .JrUy .gZ'»«^_ZOO^oy 


F 


M00055 1 98 TVH08 


TJC2-ColoiiMwtLiver 






90A1 A 1 H a^l^ 966Q00 


F 


M0005 5 1 79C F 1 1 

IVIUUUJJ X / s\~/.X X X 


ITC2^oloiiMwtLiver 






90A1 F10o^A3 966995 


F 

X 


M000551 85D*H01 

J.VXV7V/ V/^-' -J X .X XV X 


UC2-ColoiiMetLiver 

V-^^— r V'V/XV/XXl f J.V|iWA r 




A1 ^Q1 S 


90A1 TIOcvrA^ 966O0R 

ZU*f X .XlU.gZH J_Z.UU7 J^O 


F 


M 00055 1 89O*C01 


UC2-Q)loiiMetLiver 

v-*^* \yvx\ rill TAVLUl » 


AQfl7 




90A1 T lO ofrATK 967001 
ZU*f X Aj 1 U.gZ*r^__ZD /Uul 


F 


M000 5 5 1 94 A • A0 1 

1V1UUUJJ X j^txV.iTk.V/ X 


UC2-ColoiiMetLiver 

V7 V/viwiii»iwuwi v wi 


/iQOo 

*t700 




9HA1 niH rrrrA^ 967H0A 
ZU^f X ,LllU.gZ*f j ZO /UU*t 


F 
r 


M0O055 1 96DF07 

L\X\J\J\J J -J X 7UX/.X VI / 


I JC2-ColonMetLiver 

\^/Xmt V/V/XvXXXTXVtiJX T VI 




sso7i 0 

DOs* / XU 


90A1 All rr-rA^ 967O06 
ZUH X .AX I -gZ** D_ZSJ 1 UUU 


F 

X 


M 0005 5 1 79C -H02 

XVlvvvJJ X / 7V/«l*vA« 


UC2-ColoixMetLiver 


4QQO 


jOZj07 


90A1 R1 1 cr^A3 967007 

Z Lff 1.D1X .g^*r J 1 \J\f ' 


F 


M00055 181 CD06 


UC2-ColoiiMetLiver 


AQQ 1 




OOA1 F1 1 rrrrAI 96701 1 
Zl/*f i.rii .gZ' c r^_ZO / U X X 


F 

X 


M00055 1 86AD04 


UC2-ColonMetLiver 


4992 


560003 


2041 .Kl 1 .gz43_2670 16 


F 


M00055193A:A08 


UC2-ColoxiMetLiver 


4993 


550704 


2041.Mll.gz43_267018 


F 


M00055195A:C10 


UC2-ColoiiMwtLiver 


4994 


562569 


2041.Oll.gz43_267020 


F 


M00055196D.H02 


UC2-ColoiiMetLiver 


4995 


561963 


2041.D12.gz43_267025 


F 


M00055183C:D07 


UC2-<^loiiMetLiver 


4996 


557867 


2041.F12.gz43„267027 


F 


M00055186A:E08 


UC2-ColoiiMeti,iver 


4997 


143218 


2041 .M12.gz43_267034 


F 


M00055195AE07 


UC2-ColoiiMwtLiver 


4998 


495074 


2041 .N 12.gz43_267035 


F 


M00055196AC04 


; UC2-ColoiiMetLiver 
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Table 2 



SEQ 
ID 

NO ( 


XUSTEP 


SEQ NAME 


ORIE 
NT 


CLONE ID 


LIBRARY 


4000 


JUV/Z. J. — > 


2041 J13 ffz43 267047 


F 


M00055191D:C05 


UC2-ColonMetLiver 




SS841^ 

JJO't 1j 


2041 LI 3 o:z43 267049 


F 


M00055194A:E07 


UC2-ColonMetLiver 


jUUI 




9041 1M13 az43 267051 


F 


M00055196A:E01 


UC2-ColonMetLiver 




S5RSR9 

J JO J oz 


9041 F14crz43 267058 


F 


M00055185A:G11 


UC2-ColonMetLiver 




S4066S 

•JH^UvJ J 


2041H14ffz43 267061 


F 


M00055188C:F08 


UC2-ColoiiMetLiver 




4S1 544 


9041 K14 az43 267064 


F 


M00055193A:C06 


UC2-ColonMetLiver 


sons 


4S0693 


2041 M14 2z43 267066 


F ! 


M00055195B:B09 


UC2-ColonMetl,iver 


^nn6 


JJIOJJ 


9041 Ol 5 az43 267076 


F 


M00055187C:C02 


UC2-ColonMetLiver 




99 R4 


2041 115 ffz43 267078 


F 


M00055190A:A05 


UC2-ColonMetLiver 


^nftR 


SSORR^ 

JJ700J 


9041 T15o;z43 267079 


F 


M00055192A:A09 


UC2-ColonMetLiver 




614^60 


9041 Ml 5 ez43 267082 


F 


M00055195B:C04 


UC2-ColoiiMetLiver 


jUIU 


<J^9^7 


9041 N15 az43 267083 


F 


M00055196A:H07 


UC2-ColonMetLiver 


JUII 


SSORR^ 

JJ700J 


9041 F16crz43 267090 


F 


M00055185B:B01 


UC2-ColonMetLiver 


^ni 9 


4R1 T36 

HOI IjO 


9041 F16*>z43 267091 


F 


M00055186C:A02 


UC2-ColonMetLiver 




^QRR^ 


9041 (116ffz43 267100 


F 


M00055197B:A10 


UC2-ColonMetLiver 


^ni4 




9041 P16az43 267101 


F 


M00055198D:A12 


UC2-ColoiiMetLiver 


-SOI ^ 


S61^S1 


9041 R17az43 267103 


F I 


M00055181C:H01 


UC2-ColonMetLiver 


^ni 6 


SS6RR1 


9041 CM OVA'S 267104 


F 


M00055182C:E09 


UC2-ColonMetLiver 


JUl / 


S405RR 


2041 G17ffz43 267108 


F 


M00055187C:E07 


UC2-ColoiiMetLiver 


^01 R 




9041 TsT17 pz43 267115 


F 


M00055196B:A09 


UC2-ColonMetLiver 


JUl7 




9041 P17crz43 267117 

Zv*t X .X X / .gZ.*TJ ^.U /XX/ 


F 


M00055198D:B08 


UC2^olonMetLiver 


jUZU 


£94044 


9041 KIR <T743 267128 


F 


M00055193B:A08 


UC2-ColonMetLiver 


^fl91 


^691^6 


2041 N18 az43 267131 


F 


M00055196B:C06 


UC2-ColonMetLiver 


^fY99 
JUZZ 




9041 HI 9 az43 267137 


F 


M00055184A:G02 


UC2-ColonMetLiver 




SS79QR 


9041 Ir19az43 267140 


F 


M00055187D:A08 


UC2-ColonMetLiver 


5094 




9041 H19tyz43 267141 


F 


M00055188D:D05 


UC2-ColonMetLiver 




4S5R14 


9041 T1Qpz43 267142 


F 


M00055190A:F11 


UC2-ColonMetLiver 




S4QR9Q 


9041 A90 jrz43 267150 


F 


M00055181A:E01 


UC2-ColonMetLiver 


SH97 


47^749 


9041 T*91 az43 267167 


F 


M00055181D:D05 


UC2-ColonMetLiver 


Jl/Zo 


JJ7JJJ 


9041 T91 crz43 267168 


F 


M00055182D:E06 


UC2-ColonMetLiver 


^09q 


JJ70ZJ 


9041 TT91 az43 967173 

ZUH X .llZ X .gtrtJ *~\J I ± i ~J 


F 


M00055188D:H03 


UC2-ColonMetLiver 




£9^QRR 


9041 H91 a743 9671 80 

. ZUt 1 . V/Z 1 .g^t J^Z-U ; low 


F 


M00055197CD10 


UC2-CoIonMetLiver 




401 960 


9041 P91 a743 967181 
zut i ,rz i .g" » J i xox 


F 


M00055198DG03 


UC2-ColonMetLiver 


jUjz 


^7A44 


9A41 A99 az43 967189 


F 


M00055181A:G02 


UC2-ColonMetLiver 




41 1 1 1 3. 


9041 F99 oz4^ 967186 


F 


M00055185CB01 


UC2-ColoiiMetLiver 


5034 


557961 


2041.C23.gz43 267200 


F 


M00055182D:H08 


UC2-ColonMetLiver 


5035 


559194 


2041.A24.gz43 267214 


F 


M00055181A.H01 


UC2-ColonMetLiver 


5036 


558463 


2041.H24.gz43 267221 


F 


M00055189A:C11 


UC2-ColonMetLiver 


5037 


560859 


2041.I24.gz43 267222 


F 


M00055190C:G08 


UC2-ColonMetLiver 


5038 


456697 


2041.N24.gz43 267227 


F 


M00055196C:B07 


UC2-ColonMetLiver 


5039 


559127 


2054.E01.gz43 267234 


F 


M00055288B:D01 


UC2-ColonMetLiver 


5040 


559460 


2054.L01.gz43_267241 


F 


M00055300C:F11 


UC2-ColonMetLiver 
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Table 2 



SEQ 
ID 
NO 


CLUSTER 


SEQ NAME 


ORIE 
NT 


CLONE ID 


LIBRARY 


5041 


694133 


9054 M03 <*743 967974 

^U J"t.1V1U«J .gZrrJ Z*KJ i JL / *t 


-p 

X 


M00055309R / R1 0 


1109-^^10111^/1^1 iwr 1 

\J — V^OlLliliVXC LJ_rl VC1 


5049 


1070 
iu iy 


9054 1M05 a743 967^06 


r 


M00055^09"R-F07 


TTP9-Po1nnlV/rptT it/pt 


504^ 


5501 13 

JJ71 U 


9054 A 07 a743 967396 

ZU-/*r./\U f .g£>tJ £.\J 1 DZ*\J 




M00055989AA01 

lVXv/UUJJZOZ/l.X\V X 




5044 


500337 


9054 T13 crz4^ 9674^31 




M0005 59071^^09 


TTP9-Pr%1rmlV>fptT ivpr 


5045 


558730 


9054 PI 4 <y^4^ 967449 


F 


M00055988DA0^ 


T 109-1^010111^/1^1 ivpr 

^ V_*^ V> WlWli±YXC<LX_fl V Ol 


5046 


561770 

JO X / / ~ 


9054 T14 ar4^ 967446 




M00055996AC05 


T TP9-r , nlnnMetT .iver 

W V/UlUlliVHj LX-il V vl 


5047 


*3 07581 


9054 HI 7 074^ 


F 

X 


M00055904RC0^ 


T TP9-.r , n1onMe1-T ,iver 

KJ V/^r v/ V/IUUIVXvIXjI V vl 


5048 


560490 


9054 H18 <yy4^ 967516 


F 

X 


M0005 5^05r Tj08 

IYIWV/JJjV/JV^.J-'V/O 


TTP9-PolotiMptT ivpr 

w V^^r" v^V/lUXJXVXCULrX VCI 


5040 


550798 


9054P10cw^n 967590 


F 

X 


M00055986AH08 


T TP9-Pn1nnlVretT .iver 

VJ V/A V/UlUUlTltUji V vl 


5050 


55Q531 

J J7JJ 1 


9054 1490 a743 967541 

ZUJ^.lxZfU.JJZ/TJ ZVJ/ j^ri 


F 

X 


M00055904RO01 


T Tp 9 -Pol onlVfptT ivpr 

\J \-rZ* ""V^wlUlilVXCLXvl VCI 


5051 


93Q61 


9054 1O0 <*743 967544 


F 


1^00055^004-^06 

iVXV/UU J J J U V/O 


T TP9-Po1onlVTptT ivpr 

U V^^L vJlvllllVXC ULf 1 V Ci 




558446 
33 O^tHO 


9054 T91 <V743 967558 


F ' 


M00055906P-F08 
lvxuu \j j j z.y o . Xw \j o 


TTP9~Po1on'MptT ivpr 

W VvZ. ~\^UlUXXLVXCLX-/l VCI 


505^ 


5£993A 
30ZZ30 


9055 If 01 ot4^ 967694 

ZUJJ .JS.U I .gZHO ZQ / OZ*r 


F 


M0005 5^9^Tj • A 1 9 


TTP9-PolonA/TptT ivpr 


505zl 


55Q77£ 

JJ7 / /O 


9055 "M01 o^43 967697 
ZU J .5. IN U 1 .gZ*r .5 zo/oz/ 


F 


M00055^^0Tj'1 1 T1 9 


TTP9-Po1onTV/TpfT ivpr 

KJ V^Zr ~^01UJLULVXCLX-il VCl 




DDyD ft 


9fK^ Tftzl c*t43 967A71 
ZUJJ .JU*f.gZ £ t3_ZO / O / 1 


F 


A/T0OO55^99P*i r Tl 1 
lvxUUU J 3 3ZZV^. VJx 1 


TTP9-Po1onlV/rptT ivpr 
UvZ ~ V-/ UIUJULiVXC LI /l VCI 


3U3D 


JJ007U 


ZUDj.lOJ.gZ*f J_ZO /OoO 


F 


A/f 00055^9 1 Tl-Tll 0 
1VIUUU3 33 Z 115 .15 x U 


TTP9-Po1otilVApfT iT/pr 

U - V^UlOIliVXC LXjI V Cl 


jUj / 


jjZO / 3 


90^5 TO^ rr-T-A^ 967^517 
ZUjJ.JUJ.gZ'f j_ZO /Oo / 


F 
F 


M00055^99IY A01 

1V1UUU J J J ZZly .AU 1 


TTP9-Po1onlV^ptT ivpr 1 

U\_yZ, — \^UlUJLllVlCLX_flVCI 


DVDo 


/i >i /conn 


90^ TH7 rr«74^ 9677 1Q 


F 


M0OO55^99Ti*P19 

1VIUUUJ JjZZI/. V_/X^ 


TTP9-Po1onl\/rp+T ivpr 
U ~ V^UHJ1X1VXCLX_/1 VCI 




JJ7 1 1Z 


905^ POR crr/L^ 061109. 
ZU J D. U o. gZ*r J_ZO / /ZO 


F 

JT 


A/f00055^19 A*T>1 1 

1VXUUUJ J J lZ/l.X/1 1 


T TP9 -PolonTV/Tpf T ivpr 


jUOU 


n-^-U /U / 




F 

X* 


Ayf OOO 5 5 *\ 1 9 A • F 1 0 

1VIUUU3 J J 1Z/\.X-/1U 


TTP9-Po1oTiTV/TptT ivpr 
u ^z~\^uiuiixviCLx-»ivcr 


3UOl 


33333V 


zujj.r iz.gz*tj_zo / /7j 


F 
r 


1V1UUU333 1 / V/.fU** 


TTP9-Pi^i1rfcri1V/f^*fT iTr*»t- 
U \^Z"*V*010IHVxcTJjXVCT 


3UOZ 


A79 1 ftSi 
*f //loo 


90^^ PI 9 cttA'X 9675105 
ZUj D.r lZ.gZH-J_ZO / oUj 


F 

JT 


M00055^^5ri- AO^ 
1V1UU U J JjjjU ./VUj 


TTP9-Po1onlVTptT ivpr 

U V^Z^v^UlUlXlVlCULrl VCI 


3U0.3 


Ol /013 


9H<^ T71*^ rr«yA^ 9£75lin 
ZUjj.111 J.gZ < t3_ZO /olU 


F 
r 


A/ronn55^i 5P- aoq 

1V1UUU3 3313*^ .AU7 


TTP9-PoTot%lV/rpiT tvpr 


3UO*f 


33yuz / 


90^^ T1^ tYTi^ 967R14 


F 

JT 


M0OO55^9 1 Ti*P 1 9 


TTP9-Po1oniVrptT ivpr 

U V^Zr ~ V^UlUiliVXC LX-/1 VCI 


506^ 


'HfoO / / 


9055 1T1^ f»-y47 967R16 


F 


A/f00055^94P-Ml 0 

1V1UUU3 JjZnvv .XX X U 


TTP9-Po1onlV/rptT ivpr 

C V^Z. •*V-/UlUiliVXC LX-rlVCl 


50££ 


^5977 


9055 TT1 5 mA'X 967R45J 
ZU J J.JS.1 J .gZ t tJ_ZO / o*f O 


F 


M00055^94n-T : i09 

1V1.UUUJ J JZtU . 15 UZ. 


T TP9 -PolonlVTptT ivpr 

U \^>Z. "•V^UIUXXLVXCLX-/! VCI 


5067 
3UO / 


^60538 


9055 H16 or4^ 967868 

ZvJ J.Vy IO.gZ*TJ Z.\J I OOO 


F 
r 


M00055^4f , -F1 1 


T Tp 9 -PolonlVfptT ivpr 

VJ wZ» wL;HJ1XLVXv5LX-/1VC1 


jUOo 


DOVDoy 


905^ FI17 <*t4^ 967517^ 
ZUJJ . JL/ 1 / .gZ 4 !- J__ZO / o / j 


F 

X* 


M00O55^ 1 TD-F 1 0 

1V1UUU3 DD IJJL/.JCrlv 


T TP9-Polor>A/fptT ivpr 

. \J v>Z ~\^UIUlXlVXOuLflVCX 


506Q 


55Q7Q4 


9055 1^1 O ot4^ 967008 


F 


M00055^ 1 0R-TT06 

1VXUUUJJ-? X^JJ.XXV/U 


TTP9-Po1onlVfptT ivpr 

ij V^^"*V^VJ1IJ ) XLLVXCLJ^1 VCI 


5070. 


1 ^8470 


905^ "R9H rr-r4^ 967Q10 
ZUj j.X5ZU.gZ 4 r J ZD/7I7 


F 


M00055^ 1 0R-F09 

1V1UUU333 lUXJ.HUZ 


TTP9-PolonlVfptT ivpr 

KJ \^J~ ~^UlL)lXiVXCLX-/l V Cl 


5071 


5551105 


9055 W90 crrA'X 967095 
zuj j.nzv/.gZT- j_zo /7/j 


F 


M00055^90T)-FOO 

1VIV/UUJJj^U1/.DU7 


T TP9-PolonlvfptT ivpr 

Uv/Z. V^UlWllXYXwlvUlVCX 


5079 
jU /z 


*t 1 33*r3 


90^*\ 1T91 9^7Q/Ld 
ZU J j . J?wZ 1 .gZ*fr J_ZO / 7*rr 


F 


A/f 00055^95 A-F1 9 
1V1UUU3 33Z3/V.13 1 Z 


TTP9-Po1or#TV/fptT ivpr 

iJ V_/UlUIliVXCLL/lVCl 


3U / o 


33*H /O 


ZUjj.UZI .gZ*fj ZO/y^fo 


F 
r 


lvlUUU3333'fL/.VJU / 


TTP9-Po1o«1V^ptT ivpr 

u v>>z - v^oiOxUvicix-'ivcr 


5 07 A 


zl'7'7'7^7 
*f / / / 3 i 


9H« T799 rr-y/l^ 9^7Q^A 


F 


N/T00055^ 1 6P-R1 0 
1V1UUU333 1015.151U 


T TP 9 .Pol nn lVfptT ivpr 
U v^Zr ~\^UlUlXiVlC LL/1VC1 


3U /3 


333 i\jy 


9fl^^ 099 n**A1 9^70^^ 
ZU3 3 . UZZ .gz*f j_ZO / y 3 O 


F 


1YIUUU333 ky\j>.\AJJ 


TTP9-Po1otilvTp+T ivpr 
U \^Z~V^0101xlVlciJ-flVcr 


5076 


554585 


2055 K22 ez43 267960 


F 


M00055325A:H02 


UC2-ColonMctLiver 


5077 


451429 


2055.N22.gz43_267963 


F 


M00055333C:F12 


UC2-ColonMetLiver 


5078 


551811 


2055 .A23 .gz43_267966 


F 


M00055308D:C09 


UC2-ColoiiMctLiver 


5079 


62458 


2055 .C23 .gz43_267968 


F 


M00055312D:A09 


UC2-ColonMetLiver ; 


5080 


551912 


2055 .D23 .gz43_267969 


F 


M00055314B:G07 


UC2-ColonMetLiver 


5081 


552857 


2055 .G23 .gz43_267972 


F 


M00055319C:C07 


UC2-ColonMetLiver 


5082 


555349 


2055.N23.gz43 267979 


F 


M00055333C:H07 


UC2-ColoiiMetLiver 
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3NSOOCID: <WO 021 4500 A2_l_> 



WO 02/14500 



PCT/US01/25840 



Table 2 



SEQ 
ID 

NO ( 


CLUSTER 


SEQ NAME 


OWE 
NT 


CLONE ED 


LIBRARY 


5083 


560259 


2056.F01.gz43_268003 


F 


M00055343D:C12 


UC2-ColonMetLiver 


5084 


550063 


2056.G01.gz43_268004 


F 


M00055344D:A09 


UC2-ColonMetLiver 


5085 


560544 


2056.M01.gz43_268010 


F 


M00055352B:E06 


UC2-ColonMetLiver 


5086 


561108 


205 6 .P0 1 .gz43_2680 1 3 


F 


M00055356C:H02 


UC2-ColonMetLiver 


5087 


512014 


2056.L02.gz43_268025 


F 


M00055351B:H12 


UC2-ColonMetLiver 


5088 


627386 


2056.D03.gz43_268033 


F 


M00055341C:F04 


UC2-ColonMetLiver 


5089 


561124 


2056.E03.gz43_268034 


F 


M00055342C:H06 


UC2-ColonMetLiver 


5090 


418482 


2056.H03.gz43_268037 


F 


M00055346A:E12 


UC2-ColonMetLiver 


5091 


558562 


2056.A04.gz43_268046 


F 


M00055336D:B03 


UC2-ColonMetLiver 


5092 


551912 


2056.B04.gz43_268047 


F 


M0O055338C:FO3 


UC2-ColonMetLiver 


5093 


552669 


2056.D04.gz43_268049 


F 


M00055341C:G12 


UC2-ColonMetLiver 


5094 


559857 


2056J04.gz43_268055 


F 


M00055348D:A01 


UC2-ColonMetLiver 


5095 


557372 


2056.M04.gz43_268058 


F 


M00055352B:H05 


UC2-ColonMetLiver 


5096 


558182 


205 6.N04 .gz43_26805 9 


F 


M00055354AA01 


UC2-ColonMetLiver 


5097 


552430 


205 6.B05 .gz43_268063 


F 


M00055338C:G04 


UC2-ColonMetLiver 


5098 


560932 


2056.F05.gz43_268067 


F 


M0O055343D:GO3 


UC2-ColonMetLiver 


5099 


559049 


205 6 .G05 .gz43_268 068 


F 


M00055345AD05 


UC2-ColonMetLiver 


5100 


449405 


2056.C06.gz43_268080 


F 


M00O55340B:CO3 


UC2-ColonMetLiver 


5101 


558769 


2056.G06.gz43_268084 


F 


M00055345B-.B03 


UC2-ColonMetLiver 


5102 


556216 


205 6 .K06 .gz43_268088 


F 1 


M00055350A-.F01 


UC2-ColonMetLiver 


5103 


553591 


2056.M06.gz43 268090 


F 


M00055352CA07 


UC2-ColonMetLiver 


5104 


561024 


2056.B07.gz43 268095 


F 


M00055338C:H06 


UC2-ColonMetLiver | 


5105 


560652 


2056.C07.gz43_268096 


F 


M00055340B:E06 


UC2-ColonMetLiver 


5106 


559296 


2056 F07 ez43 268099 


F 


M00055343D:H04 


UC2-ColonMetLiver 


5107 


560174 


2056.P07.gz43_268 109 


F 


M00055356D:C11 


UC2-ColonMetLiver 


5108 


552357 


2056.D08.gz43_268 113 


F 


M00055341D:H07 


UC2-ColonMetLiver 


5109 


627139 


2056.E08.gz43 268114 


F 


M00055342D-.F07 


UC2-ColonMetLiver 


5110 


509505 


2056.H08.gz43_268 1 17 


F 


M00055346B:G03 


UC2-ColonMetLiver 


5111 


560369 

\J \J \J *r 


2056.C09.gz43_268 128 


F 


M00055340C:D01 


UC2-ColonMetLiver 


5112 


559375 


2056.G09.gz43_268 132 


F 


M00055345B:F03 


UC2-ColonMetLiver 


5113 


449927 


2056.K09.gz43_268 136 


F 


M00055350B:B12 


UC2-ColonMetLiver 


5114 


449751 


2056.P10.gz43_268 157 


F 


M00055357AA09 


UC2-ColonMetLiver 


5115 


560370 


2056.C1 1 .gz43_268 160 


F 


M00055340D:D01 


UC2-ColonMetLiver 


5116 


561124 


2056.E1 1 .gz43_268 162 


F 


M00055342D:H05 


UC2-ColonMetLiver 


5117 


549041 


2056.Jll.gz43 268167 


F 


M00055349A:F07 


UC2-ColonMetLiver ! 


5118 


560695 


2056.Kll.gz43 268168 


F 


M00055350B:D09 


UC2-ColonMetLiver 


5119 


471712 


2056.Nll.gz43 268171 


F 


M00055354A:G11 


UC2-ColonMetLiver 


5120 


448285 


2056.F12.gz43_268179 


F 


M00055344AG11 


UC2-ColonMetLiver 


5121 


561068 


2056.N12.gz43 268187 


F 


M00055354A:H08 


UC2-ColonMetLiver 


5122 


562883 


2056.B13.gz43_268191 


F 


M00055339B:D10 


UC2-ColonMetLiver 


5123 


560717 


2056.C13.gz43_268192 


F 


M00055340D:F08 


UC2-ColonMetLiver 


5124 


560199 


2056.E13.gz43_268194 


F 


M00055343AC09 


UC2-ColonMetLiver 
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INSDOCID: <WO_021 4S00A2_I_> 



WO 02/14500 



PCT/US01/25840 



Table 2 



SEQ 
ID 

NO C 


XUSTEB 


SEQ NAME 


OREE 
NT 


CLONE ID 


LIBRARY 


5125 


618311 


2056 F13 ez43 268195 


F 


M00055344B:A12 


UC2-ColoiiMetLiver j 


5126 


455379 


2056.M13.gz43 268202 


F 


M00055353B:B09 


UC2-ColoiiMetLiver 


5127 


471272 


2056.013. gz43 268204 


F 


M00055356A:B06 


UC2-ColonMetLiver 


5128 


562576 


2056 D14 ez43 268209 


F 


M00055342A:E08 


UC2-ColoiiMetLiver 


512Q 


626061 


2056 014 ez43 268220 


F 


M00055356A:D04 


UC2-ColonMetLiver 


5130 


558867 
~j -j \j \j \j i 


2056 A15 sz43 268222 


F 


M00055337D:B10 


UC2-ColonMetLiver 


5131 
j x-> i 


553875 


2056 K15 ez43 268232 


F 


M00055350C:G11 


UC2-ColonMetLiver 


5137 


556365 


2056 015 ez43 268236 


F 


M00055356A:D09 


UC2-ColonMetLiver 


^ i — ' — ' 


453004 


2056.A16.gz43 268238 


F 


M00055338A:A02 


UC2-ColonMetLiver 




558653 


2056 C16 ez43 268240 


F 


M00055341BA01 


UC2-ColonMetLiver 


5135 


560868 


2056 E16 sz43 268242 


F 


M00055343A:G03 


UC2-ColonMetLiver 


5136 


551305 


2056G16ez43 268244 


F 


M00055345D:A04 


UC2-ColonMetLiver 


5137 


559004 


2056.H16.gz43_268245 


F 


M00055347A:C03 


UC2-ColonMetLiver 


5138 


238146 


2056.N16.gz43 268251 


F 


M00055354C:C12 


UC2-ColonMetLiver 


5139 


559971 


2056.A17.gz43_268254 


F 


M00055338A:A03 


UC2-ColoiiMetLiver 


5140 


559963 


2056 117 ez43 268262 


F 


M00055348B:A02 


UC2-ColonMetLiver 


5141 


549781 


2056.O17.gz43 268268 


F 


M00055356B:B04 


UC2-ColonMetLiver 


5142 


491240 


2056.F18.gz43 268275 


F 


M00055344C:C08 


UC2-ColonMetLiver 


5143 


559087 


2056 G18 ez43 268276 


F 


M00055345D:D01 


UC2<:olonMetLiver 


5144. 

O X*T*T 


103173 


2056 118 ez43 268278 


F 


M00055348B:B05 


UC2-ColonMetLiver 


5145 


476698 


2056 M 18 sz43 268282 


F 


M00055353C:A05 


UC2-ColoiiMetLiver 


5146 


560254 


2056 C19 2z43 268288 


F 


M00055341B:C07 


UC2-ColonMetLiver 


514.7 

•J X*T / 


403419 


2056 D19 ez43 268289 


F 


M00055342B:C08 


UC2-ColonMetLiver 


5148 


400047 


2056 E19 ffz43 268290 


F 


M00055343C:B11 


UC2-ColonMetLiver 


5140 


559093 


2056 G19 sz43 268292 


F 


M00055345D:D12 


UC2-ColoiiMetLiver 


5150 


458618 


2056 J19 ez43 268295 


F 


M00055349C:G07 


UC2-ColonMetLiver 


5151 


558890 


2056 F20 ez43 268307 


F 


M00055344C:F09 


UC2-Q)lonMetLiver 


515? 


559746 


2056G20ez43 268308 


F 


M00055345D:E02 


UC2-ColonMetLiver 


51 53 


567849 


2056J20ez43 268311 


F 


M00055349C:H12 


UC2-ColonMetLiver 


5154 


5611 16 


2056A21sz43 268318 


F 


M00055338A:H10 


UC2-ColoiiMetLiver 


51 55 


558334 


2056 G21 ez43 268324 


F 


M00055345D:E05 


UC2-ColonMetLiver 


51 56 


552258 


2056 122 ez43 268342 


F 


M00055348B:F05 


UC2-ColonMetLiver 


51 57 


5603 1 1 


2056 K22 ez43 268344 


F 


M00055351A:C09 


UC2-ColonMetLiver 


51 58 


560714 

JVJvJ / X*T 


2056 A23 ez43 268350 


F 


M00055338B:F06 


UC2-ColonMetLiver 


51 59 


496460 


2056 D23 ez43 268353 


F 


M00055342C:E03 


UC2-ColoiiMetLiver 


5160 


487623 


2056.N23.gz43_268363 


F 


M00055355A:A10 


UC2-ColonMetLiver 


5161 


559806 


2056.F24.gz43_268371 


F 


M00055344C:H09 


UC2-ColonMetLiver 


5162 


560088 


2056.G24.gz43_268372 


F 


M00055346A:B01 


UC2-ColonMetLiver 


5163 


556011 


2056 J24.gz43 268375 


i f 


M00055349D:F02 


UC2-ColonMetLiver 


5164 


555095 


2056.L24.gz43_268377 


F 


M00055352B:E01 


UC2-ColonMetLiver 


5165 


558959 


2056.M24.gz43_268378 


F 


M00055353D:C05 


i UC2-ColonMetLiver 


5166 


466887 


2065.M01.gz43_268394 


F 


M00055373C:F05 


UC2-ColonMetLiver 
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3NSD0CID: <WO 021 4500Aa_l_> 



WO 02/14500 



PO7US01/25840 



Table 2 



SEQ 
ID 
NO 


:lustef 


SEQ NAME 


ORIE 
NT 


CLONE ID 


LIBRARY 


5167 


495799 


2065 .C02.gz43_268400 


F 


M00055360C:C05 


UC2-ColonMetLiver 


5168 


550562 


2065 .E02.gz43_268402 


F 


M00055363A:F07 


UC2-ColonMetLiver 


5169 


523606 


2065 .H02 .gz43_268405 


F 


M00055367D:A05 


UC2-ColonMetLiver 


5170 


549163 


2065 J02 az43 268407 


F 


M00055370B:F07 


UC2-ColotxMetLiver 


5171 


562844 


2065 .P02 .gz43_2684 1 3 


F 


M00055377B:E10 


UC2-ColonMetLiver 


SI 7? 


636651 


2065 F03 cz43 268419 


F 


M00055364D:E09 


UC2-ColoiiMetLiver 


SI 7^ 


543323 


2065 M03 ez43 268426 


F 


M00055373C:H10 


UC2-ColonMetLiver 


SI 74 


556416 


2065 L04 sz43 268441 


F 


M00055372D:C11 


UC2-ColonMetLiver 


S17S 


498509 


2065 M04 cz43 268442 


F 


M00055373D:B08 


UC2-ColonMetLiver 


Jl / u 




2065 N04 ez43 268443 


F 


M00055374C:F01 


UC2-ColonMetLiver 


SI 77 
ji / / 


SS9696 


2065 D05 ez43 268449 


F 


M00055361D:H12 


UC2-ColonMetLiver 


SI 78 


484748 


2065 K05 sz43 268456 


F 


M00055371D:B08 


UC2-ColonMetLiver 


5179 


561229 


2065.005 .gz43 268460 


F 


M00055376A:A08 


UC2-ColonMetLiver 


SI 80 


627297 


2065.A06.gz43 268462 


F 


M00055358A:F09 


UC2<:olonMetLiver j 


5181 


456469 


2065.C06.gz43 268464 


F 


M00055360C:G11 


UC2<:olonMetLiver 


SI 87 


55203 1 


2065.J06.gz43 268471 


F 


M00055370C:D02 


UC2-ColoiiMetLiver 


5183 

J X OJ 


527410 


2065.G07.gz43 268484 


F 


M00055366D:G10 


UC2-ColonMetLiver 


SI 84 


SS6959 


2065 107 2z43 268486 


F 


M00055369A:H08 


UC2-ColonMetLiver 


S18S 

J X OJ 


501534 


2065.B08.gz43 268495 


F 


M00055359C:H09 


UC2-ColonMetLiver 


S186 


91 178 

.7 X X # O 


2065.E08.gz43 268498 


F 


M00055363C:E02 


UC2^oloiiMetLivcr 


SI 87 

•J xo / 


461325 


2065 P08 sz43 268509 


F 


M00055377C.G01 


UC2-ColonMetLiver 


5188 

-J xoo 


561485 


2065 A09 ez43 268510 


F 


M00055358B:C01 


UC2-ColonMetLiver 


5189 


557783 


2065.109.gz43_268518 


F 


M00055369C:D04 


UC2^olonMetLiver 


5190 


474580 


2065.O09.gz43_268524 


F 


M00055376B:A03 


UC2-ColonMetLiver 


5191 


450637 


2065 ,P09.gz43_268525 


F 


M00055377C:H08 


UC2-ColonMetLiver 


5192 


504167 


2065 .N10.gz43_268539 


F 


M00055374D:C09 


UC2-ColonMetLiver 


5193 


561279 


2065.O10.gz43_268540 


F 


M00055376B:A11 


UC2-ColoiiMetLiver ! 


5194 


451124 


2065 .B 1 1 .gz43_268543 


F 


M00055359D:C12 


UC2-ColoiiMetLiver 


5195 


558412 


2065 Dll ez43 268545 


F 


M00055362C:B06 


UC2-ColoiiMeftLiver 


SI 96 


558334 


2065 Fll ez43 268547 


F 


M00055365C:D12 


UC2<;olonMetLiver 


SI 97 


SS2265 


2065 Kllez43 268552 


F 


M00055372A:H02 


| UC2-ColoiiMetLiver 


SI 98 


SS5796 


2065 Mil sz43 268554 


F 


M00055374A:B11 


UC2<:olonMetLiver 


SI 99 


558507 


2065Nllez43 268555 


F 


M00055374D:F10 


UC2-ColonMetLiver 


S700 


453846 


2065 Oil ez43 268556 


F 


M00055376B:B01 


UC2-ColonMetLiver 


S901 


556430 


2065 G12 ez43 268564 


F 


M00055367A:B11 


UC2-ColonMetLiver 


5202 


549984 


2065J12.gz43„268567 


F 


M00055370D:F06 


UC2-ColonMetLiver 


5203 


561825 


2065.F13.gz43_268579 


F 


M00055365C:F11 


UC2-ColoiiMetLiver 


5204 


560959 


2065J13.gz43_268583 


F 


M00055370D:H07 


UC2-ColonMetLiver 


5205 


636876 


2065.M13.gz43_268586 


! F 


M00055374A:E01 


UC2-ColoiiMetLiver 


5206 


562801 


2065.P13.gz43_268589 


F 


M00055377D:F12 


UC2-ColonMetLiver 


5207 


559675 


2065.D14.gz43 268593 


i F 


M00055362C:G08 


UC2-ColonMetLiver 


5208 


632260 


2065 .K14.gz43_268600 


F 


M00055372B:E01 


1 UC2-ColonMetLiver 
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Table 2 



SEQ 
ID 

NO ( 


:lustef 


SEQ NAME 


ORIE 
NT 


CLONE ID 


LIBRARY 


5209 


490308 


2065.L14.gz43_268601 


F 


M00055373B:A09 


UC2-ColonMetLiver 


5210 


559828 


2065 .B 1 5 .gz43_268607 


F 


M00055359D:H02 


UC2-ColonMetLiver 


5211 


559022 


2065.C15.gz43_268608 


F 


M00055361A:C01 


UC2-ColonMetLiver 


5212 


-556850 


2065 .F 1 5 .gz43_2686 1 1 


F 


M00055366A:B04 


UC2-ColonMetLiver 


5213 


508515 


2065J15.gz43_268615 


F 


M00055371A:B05 


UC2-ColonMetLiver 


5214 


465610 


2065.A17.gz43_268638 


F 


M00055358D.G04 


UC2-ColonMetLiver 


5215 


463304 


2065.E17.gz43_268642 


F 


M00055364B:D01 


UC2-ColonMetLiver 


5216 


551551 


2065 .F 1 7.gz43_268643 


F 


M00055366A:H08 


UC2-ColonMetLiver 


5217 


559963 


2065 .El 8 .gz43_268 65 8 


F 


M00055364B:E10 


UC2-ColonMetLiver 


5218 


562027 


2065J18.gz43_268663 


F 


M00055371A:H10 


UC2-ColonMetLiver 


5219 


561868 


2065 .Kl 8 .gz43__268664 


F 


M00055372B:F11 


UC2-ColonMetLiver 


5220 


561610 


2065J19.gz43_268679 


F 


M00055371B:D01 


UC2-ColonMetLiver 


5221 


562565 


2065.M19.gz43 268682 


F 


M00055374B:D05 


UC2-ColonMetLiver 


5222 


235456 


2065.P19.gz43 268685 


F 


M00055378A:B12 


UC2-ColonMetLiver 


5223 


553615 


2065.C20.gz43 268688 


F 


M00055361B:F12 


UC2-ColonMetLiver 


5224 


534054 


2065E20ez43 268690 


F 


M00055364C:B08 


UC2-ColonMetLiver 


5225 


511746 


2065F20gz43 268691 


F 


M00055366B:C04 


UC2-ColonMetLiver 


5226 


559495 


2065 A21 ez43 268702 


F 


M00055359B:F03 


UC2-ColonMetLiver 


5227 


559675 


2065 C21 sz43 268704 


F 


M00055361B:G08 


UC2-ColonMetLiver 


5228 


553705 


2065 J21 ez43 268711 


F 


M00055371B:F01 


UC2-ColonMetLiver 


S27Q 


559764 


2065 M21 ez43 268714 


F 


M00055374B:F06 


UC2-ColonMetLiver 


5230 


482090 


2065 N21 ez43 268715 


F 


M00055375BJI02 


UC2-ColonMetLiver 


5231 


562459 


2065.C22.gz43 268720 


F 


M00055361C:E05 


UC2<IolonMetLiver 


5232 


553904 


2065.K22.gz43 268728 


F 


M00055372C:E03 


UC2-ColonMetLiver 


5233 


555878 


2065.N22.gz43 268731 


F 


M00055375B:H07 


UC2-ColonMetLiver 


5234 


559676 


2065.A23.gz43 268734 


F 


M00055359B:G09 


UC2-ColonMetLiver 


5235 


505971 


2065.F23.gz43 268739 


F 


M00055366C:B11 


UC2-ColonMetLiver 


5236 


62458 


2065.H23.gz43 268741 


F 


M00055368D:E03 


UC2-ColonMetLiver 


5237 


207099 


2065.023 .gz43 268748 


F 


M00055377A:B11 


UC2-ColonMetLiver 


5238 


491635 


2065A24ez43 268750 


F 


M00055359B:H07 


UC2-ColonMetLiver 


5239 


32021 


2065 D24 ez43 268753 


F 


M00055363A:D02 


UC2-ColonMetLiver 


5240 


448741 


2065N24ez43 268763 


F 


M00055375C:C08 


UC2-ColonMetLiver 


5241 


504560 


2078 HOI sz43 269031 


F 


M00042953B:D02 


UC3-NonnColon 


5242 


516415 


2078 M01 az43 269036 


F 


M00042965A:G02 


UC3-NormColon 


5243 


452981 


2078J02ez43 269049 


F 


M00042958B:H04 


UC3-NonnColon 


5244 


514594 


2078.L02.gz43_269051 


F 


M00042963B:E12 


UC3-NoimColon 


5245 


451923 


2078.B03.gz43_269057 


F 


M00042586C:E01 


UC3-NormColon 


5246 


512721 


2078J03.gz43_269065 


F 


M00042958C:D04 


UC3-NormColon 1 


5247 


512051 


2078.K03.gz43 1 _269066 


F 


M00042960D:C11 


UC3-NoimColon 


5248 


506920 


2078.P03.gz43_269071 


! F 


M00042976B:F05 


UC3-NormColon 


5249 


451401 


2078.F04.gz43_269077 


F 


M00042628B:G11 


UC3-NormColon 


5250 


448594 


2078I04.gz43_269080 


F 


M00042955D:H03 


I UC3-NormColon 
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Table 2 



SEQ 
ID 
NO 


CLUSTER 


SEQ NAME 


OWE 
NT 


CLONE ID 


LIBRARY 


5251 


452898 


2078.M04.gz43 269084 


F 


M00042965B:G08 


UC3-NortnColon 


5252 


505679 


2078.N04.gz43 269085 


F 


M00042969B:E01 


UC3-NormColon 


525^ 


451618 


2078 O04 ez43 269086 


F 


M00042972C:F07 


UC3-NormColon 


5254 


451841 

"T~J X OT X 


2078 G05 ez43 269094 


F 


M0004295 1D:C05 


UC3-NormColon 


5755 


448090 


2078 105 ez43 269096 


F 


M00042956AH02 


UC3-NonnColon 


5256 


449000 


2078N05ez43 269101 


F 


M00042969B:G10 


UC3-NormColon 


5257 


373239 


2078 O05 ez43 269102 


F 


M00042972C:F09 


UC3 -NormColon 


5258 


451802 


2078.D06.gz43 269107 


F 


M00042623B:B09 


UC3-NormColon 


5259 


511746 


2078 E06 gz43 269108 


F 


M00042625D:C07 


UC3-NonnColoti 


5260 


516522 


2078 F06 ez43 269109 


F 


M00042628C:G10 


UC3 -NormColon 


5261 


423578 


2078.J06.gz43 269113 


F 


M00042959A:B07 


UC3-NoimColoii 


5262 


513888 


2078.E07.gz43 269124 


F 


M00042625D:E08 


UC3-NormColon 


5263 


452801 


2078.L07.gz43 269131 


F 


M00042963D:F11 


UC3-NormColon 


5264 


452775 


2078.M07.gz43 269132 


F 


M00042965C:E06 


UC3-NonnColon 


5265 


415825 


2078 C08 gz43 269138 


F 


M00042589D:D08 


UC3-NormColon 


5266 


447904 


2078.F08.gz43 269141 


F 


M00042628D:F12 


UC3-NormColon 


5267 


514142 


2078 J08 ez43 269145 


F 


M00042959A:E08 


UC3-NormColon 


5268 


Q2639 


2078 B09 ez43 269153 


F 


M00042587B:G07 


UC3-NoimColon 


S260 


4S7182 


2078 C09 ez43 269154 


F 


M00042589D:F02 


UC3-NormColon 


^270 


4477 1 0 


2078 D09 ffz43 269155 


F 


M00042623DC02 


UC3-NormColon 


^271 
jz / j. 


4S7500 


2078 O09 ez43 269166 


F 


M00042973B :B 1 0 


UC3-NormColon 


5272 


457204 


2078 A10 ez43 269168 


F 


M00042583D:F03 


UC3-NormColon 


S773 


447025 


2078 E10 2z43 269172 


F 


M00042626B:A12 


UC3-NormColon 


5274 


457324 


2078 G10 ez43 269174 


F 


M00042952A:H12 


UC3-NormColon 


5775 


506901 


2078 All ez43 269184 


F 


M00042583D:F11 


UC3-NormColon 


5776 


507349 


2078 Cll 2z43 269186 


F 


M00042590BG02 


UC3-NormColon 


5277 


452052 


2078 Fll cz43 269189 


F 


M00042629A:E11 


UC3-NormColon ; 


5278 


500853 


2078 Gil 2z43 269190 


F 


M00042952BA08 


UC3-NormColon 


527Q 


514160 


2078 Jll ez43 269193 


F 


M00042959BE1 1 


UC3-NormColon 


5780 


41 8340 


2078 C12 ez43 269202 


F 


M00042590CC09 


UC3-NontiColon 


5281 


452142 


2078 H12 sz43 269207 


F 


M00042954A:F04 


UC3-NormColon 


5282 


451994 


2078 112 ez43 269208 


F 


M00042957A:D06 


UC3-NomiColon 


528^ 


452615 


2078 J 12 ez43 269209 


F 


M00042959CC06 


UC3-NonnColon 


5284 


452899 


2078 K12 ez43 269210 


F 


M00042962AG04 


UC3-NormColon 


5285 


446438 


2078 K13 ez43 269226 


F 

X 


M00042962B A03 


UC3-NonnColon 


5286 


452462 


2078.D14.gz43_269235 


F 


M00042624D:B05 


UC3-NonnColon 


5287 


510169 


2078J14.gz43_269241 


F 


M00042959D:A05 


UC3-NormColon 


5288 


508088 


2078.B16.gz43_269265 


F 


M00042588A:G10 


UC3-NormColon 


5289 


452031 


2078.E16.gz43_269268 


F 


M00042626D:D12 


UC3-NormColon 


5290 


513155 


2078.K16.gz43_269274 


F 


M00042962C:D05 


UC3-NonnColon 


5291 


513156 


2078X16.gz43_269275 


F 


M00042964C:D06 


UC3-NormColon 


5292 


452695 


2078.N16.gz43_269277 


F 


M00042970D:D02 


UC3-NoimColon 
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Table 2 



SEQ 
ID 
NO 


CLUSTER 


SEQ NAME 


OKIE 
NT 


CLONE ID 


LIBRARY 


5293 


448332 


2078J17ez43 269289 


F 

X 


M00042060AB10 


T TPl-NnrmPoInn 

KJ V-/ J 1> \ja XllV^vllUIi 


5294 


513168 


2078 LI 7 ez43 269291 


F 


M00042964C *D 1 0 


T TP 1 -Norm Pnl nn 

V7 V-^«7 i ^ V7X llXVwVJXUXl 


5295 


510717 


2078 N17 ez43 269293 


F 


M00042071 A-B00 


KJ J 1 ^ VII 1 1 1 UXvlXX 


5296 


456545 


2078 01 7 ffz43 269294 


F 


M00047074DB07 


T TP 1 - Wnrm Pr»1 nn 


5297 


4244 


2078P17ffz43 269295 


F 

X 


M00047 078R-F01 


T TP 1 JMnrm Pnl rtn 


5298 


452376 


2078 El 8 ffz43 269100 


F 

X 


M00047677B- A 1 0 


T TP 1 -NTnrm Pr»1 nn 
Uv>J 1> UXXIlV>UXv>XX 


5299 


451993 


2078 H18 ez43 269101 


F 

X 


M0ft047054r>-r)ft4 


T TP1 -Nfr*rmPr»1rm 
UV/J -IN yJl IIiv^UIULL 


5300 


451929 


2078 118 ffz41 269104 


F 

X 


M00O42057TVC00 


¥ TPl-MnrmPr»1nn 

\J V-r^7 "IN KJl i IlV^OlOll 


5301 


511351 


2078 K18 ez43 269106 


F 

X 


M0ft049069n-B00 


T TP 1 -TsJnrm Pr\1 nn 
\J v^. D -IN UI lllV^UiOIl 


5302 


505858 

~J \J ~> O mJ %J 


2078B19ffz43 269111 


F 

X 


M00049 5 88P F07 


T TP 1 -lSJnrm P nl nn 
vJVv'^-lNUXIIlv-'UlvlIl 


5303 


501401 


2078 C19 2z43 269314 


F 


M0004262 ID* A03 

xtx vvvta vi x aj .rvv/j 


T TP 1 -MnrmPnl rm 


5304 


452662 


2078N19sz43 269325 


F 


M0004297 1 AD 12 

lTlVVUtA(7 1 111, A-f A 


T TP1 -NnrmPnl nn 

V7 V^«7 —X > Ul U1V/ V/XUXt 


5305 


512059 


2078 L21 ez43 269355 


F 


M00042064DC02 


T TPl-MnrmPnlon 


5306 


450805 


2078 021 ez43 269358 


F 


M00042975CA08 

XVA\J\J \J~ I t/V/./W/O 


T TPl-NnrmPoInn 

Vy V-/«7 X v/X 1X1 V/ V?XVJ11 


5307 


504007 


2078 B22 ez43 269361 


F 


M00042589BD04 


1 TP 1 -Norm Polnn 

KJ V>«7 1^1 UXXllVAJXVXIX 


5308 


450262 


2078 A23 ffz43 269176 


F 

X 


M000425 8 6B • A 09 

iYXvy V7\VT^^._^ OUJJ.xlV/7 


T TP 1 -TMnrm P nl nn 

\J v/ J n *J1 iilvxUXUXl 


5309 


452066 


2078 B23 ez43 269377 


F 


M00042589BE03 


T TPl-MnrmPnlnn 

V/ V-' J X > \JA lllV^UxVUl 


5310 


452459 


2078 K23 ffz43 269186 


F 

X 


M00042061BA02 


T TP 1 -MnrmPnl nn 
\J —IN \JX XIlv^UXUIl 


5311 


452506 

*T— J X—^J \J\J 


2078 L23 ffz43 269187 


F 

X 


M00042065AB01 

ivxuv/\/TA(7vJr\..x>v/j 


T TP 1 -Mnrm Pnl nn 

vJ V^--7 —IN \Ja lllV^V/lLUX 


5312 


448451 


2078 A24 cr741 269102 


F 

X 


M00O42586R- A 1 0 


T TP 1 -Nnrm Pnl nn 
U v>-7 -IN Ul illv^vlXv/XX 


5313 


7022 


2080 A01 pz41 260708 


F 

X 


M0004761 1 A- A01 


T TP 1 -Prim Pnl nn 
Uw x I XIxlv^UlUXl 


5314 

«✓ -J X"T 


521182 


2080 F01 &741 260712 


F 


M00041070TVfr1 0 


T TP 1 -Prim Pnl nn 
uv/j *ri iiiiv^uioxx 


5315 


2930 


2080 L01 <*t41 260710 


F 

X 


M00041 1 08BT) 1 9 


T TPl-PrimPnlnn 
KJ \~s~> "XT XXIII V^Ulv/XX 


5316 


454701 


2080 H02 0-z41 260711 


F 

X 


M00041001DF1 7 

lVlvvl/T'Ju7Ji/.r 1Z. 


T TP 1 -Prim Pnl nn 

Uw ri XIllv^UlL/U 


5317 


454226 


2080 102 0741 260717 


F 

A? 


M0ftn41007Tj-B1 9 


T TP 1 -Prim Pnl nn 


5318 


524624 

«^ VXZ»T 


2080 O02 fT741 260718 


F 

X 


M00041 1 10D- A 06 


T TPl-PrimPnlnn 
\J K^D"JT I UXXv^vllvul 


5310 


451220 


2080 R01 trail 760741 

^V/07.XJVl0.gZ*TJ AU7/*tl 


F 


A/T0nn476 1 4TV A 1 7 


1 TPl-Prin-tPnlnn 
U V-/ 3 - x OIlX V^OIOII 


5320 


451801 


7080 P01 0^41 260747 


F 
r 


M00n4761 X&-Ct(\& 


T TP 1 -Prim Pnl nn 
U vO "x llITl vAJlOIl 


5191 


45147ft 




F 


A/innnzr*na6 a ^07 


UL/J -x ITIllVxOxOIl 


5122 


524478 


90R0 ^tOI a^41 960746 

il«v07.VJUJ.gZ. I TJ /.U7 /*fO 


r 


IVXvUUHOU57UJJ .XxU / 


Uw -x IXllTv^OlOn. 


5323 


529219 


2080 H01 0741 760747 

^ v O 7 .IXv J ,gZi*f J Z«U7/t / 


F 


M00041004 A *F0 1 


T TPl-PrimPn1nn 
VJ KsD xl IlilV^vJlUIl 


5324 


454050 


7080 Pftl 0t41 760755 


F 


A/T00O41 1 44D-W01 

xVXV/v/V/HO X H'+X-/.Xx\7 X 


T TPl-PrimPnlnn 
U Vv ^ x IIITl v^OIUxX 


5325 


X 7700 


2080 F04 0r41 960760 


r 


lv400041080Tl-P1 1 


T TPl-PrimPnlnn 


5326 


523590 


7080 T04 0741 760765 


r 


M00041 1 07 A -frl 9 


T TPl-PrimPnlnn 

vJ V^-7 ~X 1 1111 V^OlUIl 


5327 


452016 


7080 1ST04 0741 760760 


r 


A/T00H41 1 17P-n07 


1 TP 1 -P rim Pnl nn 
ULJ x 1 xIIlv^UlOlT 


5328 


453761 


2089. J05.gz43_26978 1 


F 


M00043102B:F05 


UC3-PrimColon 


5329 


405102 


2089.A06.gz43_269788 


F 


M00042612A:B06 


UC3-PrimColon 


5330 


454720 


2089.L06.gz43_269799 


F 


M00043109C:F04 


UC3-PrimColon 


5331 


454438 


2089.N06.gz43_26980 1 


F 


M00043137D:D10 


UC3-PrimColon 


5332 


448985 


2089.D07.gz43_269807 


F 


M00043077B:H01 


UC3-PrimColon 


5333 


530656 


2089.G07.gz43_269810 


F 


M00043091C:H05 


UC3-PrimColon 


5334 


454134 


2089.L07.gz43_269815 


F 


M00043131B:A11 


UC3-PrimColon 
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Table 2 



SEQ 
ID 

NO ( 


XUSTEF 


SEQ NAME 


ORBE 
NT 


CLONE ID 


LIBRARY 


*j -j ~j -j 


454910 


2089 G08 ez43 269826 


F 


M00043091C:H08 


UC3-PrimColon 


5336 


417549 


2089 D09 ez43 269839 


F 


M00043077D:E12 


UC3-PrimColon 


5337 
j j j / 


454374 


2089G09ez43 269842 


F 


M00043091D:C01 


UC3-PrimColon 


5338 

J -J o 


454518 


2089 M09 ez43 269848 


F 


M00043134C:D06 


UC3-PrimColon 


5339 


573753 


2089 N09 ez43 269849 


F 


M00043138A:H03 


UC3-PrimColon 


5340 


577648 


2089D10ez43 269855 


F 


M00043077D:F04 


UC3-PrimColon | 


5341 


407775 


2089 N10 ez43 269865 


F 


M00043138B:B08 


UC3-PrimColon 


534? 


415376 


2089 O10 ez43 269866 


F | 


M00043141C:C12 


UC3-PrimColon 


5343 

J J*T«7 


453756 


2089P10ez43 269867 


F 


M00043146C:F10 


UC3-PrimColon 


5344 


4^3766 


?089 Jl 1 ez43 269877 


F 


M00043102D:F11 


UC3-PrimColon 


5345 


450787 


2089Kllez43 269878 


F 


M00043106B:F07 


UC3-PrimColon 


5346 


454X25 


2089Lllez43 269879 


F 


M00043131B:G10 


UC3-PrimColoii 


5347 

-7J*T / 


449335 


2089 J12.cz43 269893 


F 


M00043103A:G05 


UC3-PrimColon 


JJtO 


4*53572 


2089 K12 ez43 269894 


F 


M00043106C:D05 


UC3-PrimColon 


5340 

•7«7*Tj7 


454550 


2089 M12 ez43 269896 


F 


M00043135C:E07 


UC3-PrimColon 


5350 


73649 


2089N12ez43 269897 


F 


M00043138B:F02 


UC3-PrimColon 


5351 


440430 


7089F1 3^743 269905 


F 


M00043087B:D10 


UC3-PrimColon 


535? 


5781 34 


2089 H13 ez43 269907 


F 


M00043096A:E01 


UC3-PrimColon 


5353 


451391 


2089 L13 ez43 269911 


F 


M00043131C:A11 


UC3-PrimColon 


5354 


454196 


7089 M13 oz43 269912 


F 


M00043135D:A11 


UC3-PrimColon 


5355 


437064 

*T*7 / V/Vl*T 


2089N13ez43 269913 


F 


M00043138B:G11 


UC3-PrimColon 


5356 


454129 


2089 013 ez43 269914 


F 


M00043141D:A12 


UC3-PrimColon 


5357 
j j j # 


519109 


2089C14cz43 269918 


F 


M00043075A:B12 


UC3-PrimColon 


5358 

J J J o 


519641 


2089D14ez43 269919 


F 


M00043078C:C04 


UC3-PrimColon ! 


535Q 


576575 


2089 M 14 ez43 269928 


F 


M00043135D:C07 


UC3-PrimColon 


5360 

»7-7\J\7 


453783 


2089A15ez43 269932 


F 


M00042613A:H01 


UC3-PrimColon 


5361 


453533 


2089 115 ez43 269940 


F 


M00043100C:D08 


UC3-PrimColon 


536? 

~7«7 UZ, 


449394 

TT7J 


2089N15ez43 269945 


F 


M00043138C:D09 


UC3-PrimColon 


5363 


573732 


2089 D 16 ez43 26995 1 


F 


M00043078C:H05 


UC3-PrimColon 


5364 


395536 


2089 F16 ez43 269953 

^\/U7.X X s/ » jj* »~w ^•v^-'— 


F 


M00043088B:D07 


| UC3-PrimColon 


5365 


454509 


2089 H16 ez43 269955 


F 


M00043096C:D02 


UC3-PrimColon 


5366 


454562 


2089 E17 ez43 269968 

/.lilt 7 , 1 J 1 / wfm^t iTmJ Arf V/ ^ ^ U 


F 


M00043082D:B05 


UC3-PrimColon 


5367 

DD\j 1 


45 1 07? 

*tJ 17 / ^ 


9089 HI 7 az43 26997 1 

X>v07Xll 1 'O .. * * 


F 


M00043096C:H04 


UC3-PrimColon 


536R 


4^3776 

*vDD / JLSj 


70X0 K 1 7 cr743 269974 


F 


M00043107A:E07 


UC3-PrimColon 


536Q 


^97361 


7089 Ml 7 <*743 269976 

Z,U07. 1YX X / ,gii*TJ_jt.U7 7 ' v 


F 


M00043136A:D03 


UC3-PrimColon 


5370 


453783 


2089.A18.gz43_269980 


F 


M00042613C:F06 


UC3-PrimColon 


5371 


453385 


2089.B20.gz43_270013 


F 


M00042617C:B07 


UC3-PrimColon 


5372 


454177 


2089.G20.gz43_270018 


F 


M00043093A:A06 


UC3-PrimColon | 


5373 


801 


2089.I20.gz43 270020 


F 


M00043101A:F12 


UC3-PrimColon 


5374 


453494 


2089J20.gz43_270021 


F 


M00043104B:C09 


UC3-PrimColon 


5375 


453202 


2089.P20.gz43_270027 


F 


M00043149B:A01 


UC3-PrimColon 


5376 


520616 


2089.F2Lgz43_270033 


F 


M00043089A:D06 


UC3-PrimColon 
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PCT/US01/25840 



Table 2 



SEQ 
ID 
NO 


CLUSTER 


SEQ NAME 


OKIE 
NT 


CLONE ID 


LIBRARY 


5377 


527873 


2089 H21 ez43 270035 


F 


M00043 097 A ■ D 1 1 

XVXV/V/V/t J V7 / rv.i/ X X 


T Tf^-PrimPnlrm 


5378 

— '— J / o 


450566 


9089 K91 a743 970038 


F 
r 


Mf)004^ 1 07D-M04 


Uv/j -i'ixiiiv-/Oion 


537.9 


520074 


9089 A22 az43 970044 


17 

X 


M0004961 4 A -P04 


ULj - Jt IxIilv^OlOxl 


5380 


518566 


9089 C22 &743 970046 


F 


M00043076 A • A 09 


T TP ^ -Prim Pol on 


5381 


387530 

JO / *JJ\J 


2089H99«*743 970051 


F 


M00043007AF06 


T TP ^ -PrimPol on 
IJVy-> ~x xilTi o ion 


5382 


453846 


2089 T22 ez43 970052 


F 

X 


M00043 1 0 1 C *F 1 9 


T TP ^ -Prim Pol on 
vJ -XT XXxTlv>010Il 


5383 


454483 


2089 T 99 ot43 970055 


F 

X 


M0004^ 1 39^*1^09 


T TP^-PrimPolon 


5384 


524157 


2089 A23 ff743 970060 


F 

X 


\f 000496 1 4 A -H09 


T TP 3 -Prim Pol on 


5385 


453132 


9089 K93 ^z43 970070 


F 

X 


1U00043 1 08R- A01 


T TP^-PrimPolon 
Uw ~x IiIIlv^UlOIl 


5386 


524622 


2089 N23 ez43 270073 


F 

X 


M00043139PA05 


T TP^ -Prim Pol on 

\J D"ST 1 XlII\_y01011 


5387 


447446 


2089 A24 ez43 270076 


F 

X 


M0004961 4r^R09 


T TP^ -PrimPolon 
KJ \— -J xx xf 11 V^OlUll 


5388 


594470 


9089 F94 or43 970081 


F 

x^ 


M0004^ 000"R*FT06 


T TP 3 -Prim Pol on 
Uvj ""Jr IxIxlV^OlOIl 


5389 


446747 


2066 1M03 a743 970430 


F 


M000 5 5 3 09 A -FT06 


T TP9_Po1onAA*=»tT ivAr 

u z ~ oi oiiivxc LLii vcr 


5300 


554008 


9066 P07 oy43 970505 


F 


1V/T00O5 5^QA"n -F0^ 


TTP9-PolonlX/T/atT iTfPr 


5391 
~j ~j ^ x 


551 905 


9066 F08 cr743 9705 1 1 


F 

x^ 


M0005 53 8^"R-1400 


TTP9-Po1onlV/rf*fT t*\/*»r 


5309 


561 395 


9066 FT08 <jv43 970513 


F 


IVylOOO 5 5 3 8 5 P -F06 


TTP9-Po1onA/f*»tT nrt*r 




553944 


9066 T08 o^d.3 970515 
ZUOO.JUo.gZHO^Z 1J 


F 
r 


K/rn0055^88 A* A 00 


TTP7 r^/^1i^-r»A/fo+T iitar 

u i^z-v^oioiiivieiJLfiver 




569876 


9066 nOO o^43 970595 


F 


A/inoo5 5*3 a i r* -ruY* 


TTPO-P/xInnlMWT i-*T£*r 

u v^z- v^oioiixvicTxwi ver 


5^Q5 


569916 


9066 1700 ctt43 970597 
zuoo.r oi?.gz*Tj z / u->z i 


F 


A/fn0O55^8^r t - AH8 


u v^z - oionivxcu-jiver 


5**06 


56O048 


0Clf%& TOO ot43 970531 
ZOOO.JOy.gZ*IO Z, /UJD x 


F 
x^ 


K/Tnn055^88 A -R06 
JVIUUU-/ J jooA.DUO 


u z - v^oionivxeix-»i ver 


5^Q7 


446531 


9066 POO cy-743 970537 


F 


Ayfn0055^O5 A«r*09 


TTPO_P^1rtnA4"<a+T nrat> 


5^08 


556703 


9066 1^10 970548 


F 

JT 


TUn0O55^88Ti-F1 1 


TTP7-Pi%1rtnAA<*fT iiro** 

u v^z- v^oioiuvieixwiver 


5^QO 


4.00600 


9066 A/NO o^43 970550 

ZOOO.±Y±10.gZ*IO Z/UJJU 


F 


M00055^01 Fl-n05 


TTPO-PAlnnAAofT iir/>r 
U V^Z-^OIOIUVlClJLflVcr 


5400 


558907 


9066 Tl 1 <v'y43 970569 

ZUOO.ll l.gZ^tj Z/uJOZ 


F 


aahoo 55^8 6*n *oo9 


TTP7 Pi^l^nA^^+T i-iror 

u wz-v^oioxxivicuL/iver 


5401 


453508 


9066 Ml 1 cr743 970566 

ZUUU.1V1 1 1 .gZt j Z/vJUD 


F 


Mfi0O55^O1R-Ti07 


T TP9-Po1onlV/IXT i-irp»r 
U V^Z-V^OlOxxxyietLflVcr 


5409 


551640 

«»/.7 J.U*tU 


90661SJ11 at743 970567 


F 

JT 


M000553Q9P-007 

1V1UUOJ J-J^ZVy.VJO / 


TTP9-Po1onA/ff»tT ix/*»r 
U \^Z"*V>010IXiYlCLLflVCX 


540^ 


569085 


9066P19ar43 970579 

ZvUU.V/lZ.gZHj Z /U-J /z 


F 


M0005 5^8 OFi ■ W09 
iVIUUV D JJO uiv .nuz 


TTP9-Po1onlV/rp+T i\/p»r 
U V^Z-V^OxOxliVxCLLixVCr 


5404 

»x"tO*t 


446300 

*rtUJ77 


9066F19ar43 970574 

ZUOU.EflZ.gZHj Z/V/-7/*T 


F 

x^ 


M0005 5*3 89 P -1406 


TTP9-Po1onlV>ff«fT iv#*r 
u v^z~v^oioxxivicLi-*ivcr 


5405 

»7*TO-J 


567408 


9066 Pr19 tr?43 970576 


F 


Mn0055' : ?84R-Tj10 


T TP9-PnlonA/ff*tT ivpr 
u ~ OlOiXiVXC UU I V CI 


5406 

»/"TOO 


569594 


9066 A13 az43 970586 


F 


M0005 5 378X1^104 


TTP9-Polon*N/TptT iwr 

U>^Z V^OlOxxlVlCLXjlVCI 1 


5407 


465836 


2066J13a743 970595 


F 

X 


M00O55388RT509 

lVlUvv/J JJOOD.DvZ 


TTP9-PnlonMptT ivpr 

\J \~r OlOlXl VXC LX_rX V Cl 


5408 


569701 


9066 "Ml 3 <*743 970590 


F 
r 


M00O553O9n-A06 


TTP9-Po1onTV4"**tT iv*»r 


5400 


561457 


9066 Ol 3 o-t-43 9706OO 

ZUUU , O J. J .gZ*r j Z/UOOO 


F 


A>rn0O55^O4R-P06 


TTP9-Po1onM**tT iv*»r 
U V^Z*^^010IMVIOlJLflVCr 


541 0 


540089 


9066 CI 4 o^43 970604 


F 

X* 


A>T00055^81 A-F09 


TTP9-PolonA/T**tT tf/^r 

u v^z-v^oionivxciJLrxvcr 


541 1 
jt i x 


41 8699 


9066 Ml 4 0^43 970600 


F 


M00O55^85D-ri0^ 

1YIUUU <-> J J OJ XV .LJKJO 


TTP9-PolonlV/r#*fT v\tc*t 
\J V-/Z- V^OlOIlxVxCULflVCr 


5412 


238196 


2066 K14 ez43 270612 


F 


M00055389AD08 


UC2-ColonMetLiver 


5413 


560868 


2066.D15.gz43 270621 


F 


M00055381D:D08 


UC2-ColonMetLiver 


5414 


459764 


2066.E15 .gz43_270622 


F 


M00055382D:D04 


UC2-ColoixMetLiver 


5415 


555639 


2066.M15.gz43 270630 


F 


M00055391B:H08 


UC2-ColoiiMetLiver 


5416 


556011 


2066.O15.gz43 270632 


F 


M00055394B:D08 


UC2-ColonMetLiver 


5417 


468565 


2066.A17.gz43 270650 


F 


M00O55379A:DO7 


UC2-ColoiiMetLiver 


5418 


448949 


2066.B17.gz43 270651 


F 


M00055380A.C06 


UC2-ColonMetLiver 
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Table 2 



SEQ 
ID 
NO 


CLUSTER 


SEQ NAME 


OKIE 
NT 


CLONED) 


* 

LIBRARY 


5419 


636532 


2066J17.gz43_270659 


F 


M00055388B:E01 


UC2-ColonMetLiver 


5420 


561487 


2066.P 1 7.gz43_270665 


F 


M00055395B:C04 


UC2-ColonMetLiver 


5421 


50604 


2066.L18.gz43_270677 


F 


M00055390C:E06 


UC2-ColonMetLiver 


5422 


557249 


2066.J20.gz43_270707 


F 


M00055388B:H04 


UC2-ColonMetLiver 


5423 


562849 


2066.H2 1 .gz43_27072 1 


F 


M00055386AG05 


UC2-ColonMetLiver 


5424 


562386 


2066.C22.gz43_270732 


F 


M00055381B:C10 


UC2-ColonMetLiver 


5425 


635965 


2066.J22.gz43_270739 


F 


M0O055388C:D01 


UC2-ColonMetLiver 


5426 


493487 


2066.D23 .gz43_270749 


F 


M00055382B:E02 


UC2-ColonMetLiver 


5427 


44015 


2066.N23 .gz43_270759 


F 


M00055393B:F04 


UC2-ColonMetLiver 


5428 


559854 


2066.F24.gz43_270767 


F 


M00055383D:H11 


UC2-ColonMetLiver 


5429 


282015 


2066.K24.gz43_270772 


F 


M00055389C:F12 


UC2-ColonMetLiver ! 


5430 


561206 


2067.F02.gz43_270799 


F 


M00055402AA05 


UC2-ColonMetLiver 


5431 


486834 


2067.G02.gz43_270800 


F 


M00055402DA11 


UC2-ColonMetLiver 


5432 


550730 


2067.K02.gz43_270804 


F 


M00055407B:G07 


UC2-ColonMetLiver 


5433 


560838 


2067.P03 .gz43_270825 


F 


M00055412DA12 


UC2-ColonMetLiver 


5434 


556382 


2067.E04.gz43_270830 


F 


M00055400D:G01 


UC2-ColonMetLiver 


5435 


561236 


2067.105 .gz43_270850 


F 


M00055405AA02 


UC2-ColonMetLiver 


5436 


562584 


2067. J05 .gz43_27085 1 


F 


M00055406B:E07 


UC2-ColonMetLiver 


5437 


550135 


2067.N05.gz43_270855 


F 


M00055410C:G08 


UC2-ColonMetLiver 


5438 


556380 


2067.B06.gz43_270859 


F 


M00055397AH07 


UC2-ColonMetLiver 


5439 


511792 


2067.C06.gz43_270860 


F 


M00055398C:F07 


UC2-ColonMetLiver 


5440 


452761 


2067.106.gz43_270866 


F 


M00055405AC01 


UC2-ColonMetLiver 


5441 


465284 


2067.K06.gz43_270868 


F 


M00055407C:E04 


UC2-Ck)lonMetLiver ! 


5442 


419489 


2067.107 .gz43_270882 


F 


M00055405AG11 


UC2-ColonMetLiver 


5443 


540000 


2067.P07.gz43_270889 


F 


M00055413AA02 


UC2-ColonMetLiver 


5444 


556105 


2067.K08 .gz43_270900 


F 


M00055407C:G04 


UC2-ColonMetLiver 


5445 


562001 


2067.N08 .gz43_270903 


F 


M00055410C:H11 


UC2-ColonMeULiver 


5446 


288626 


2067.N09.gz43_2709 19 


F 


M00055410D:B02 


UC2-ColonMetLiver 


5447 


561325 


2067JH10.gz43_270929 


F 


M00055404AB10 


UC2-ColonMetLiver 


5448 


554742 


2067.11 0.gz43 270930 


F 


M00055405B:H05 


! UC2-ColonMetLiver 


5449 


560003 


2067.A1 1 .gz43_270938 


F 


M00055396B:C06 


UC2-ColonMetLiver 


5450 


555856 


2067.G1 1 .gz43_270944 


F 


M00055403AC07 


! UC2-ColonMetLiver 


5451 


633189 


2067.N 1 1 .gz43_27095 1 


i F 


M00055410D:G01 


UC2-ColonMetLiver 


5452 


562147 


2067.B12.gz43_270955 


F 


M00055397DA01 


UC2-ColonMetLiver 


5453 


553766 


2067.H12.gz43_27096 1 


F 


M00055404AD08 


i UC2-ColonMetLiver 


5454 


63669 


2067.113. gz43_270978 


F 


M00055405C:C04 


UC2-ColonMetLiver 


5455 


491260 


2067J13.gz43_270979 


F 


M00055406DA03 


UC2-ColonMetLiver 


5456 


559776 


2067.L13.gz43_270981 


F 


M00055408C.E04 


UC2-ColonMetLiver 


5457 


561922 


2067.A14.gz43_270986 


F 


M00055396B:G02 


UC2-ColonMetLiver 


5458 


559102 


2067.I14.gz43 270994 


F 


M00055405C.H07 


UC2-ColonMetLiver 


5459 


440833 


2067.O14.gz43_271000 


F 


M00055412A:C05 


I UC2-ColonMetLiver 


5460 


561212 


2067.K15.gz43_271012 


F 


M00055408AA05 


UC2-ColonMetLiver 
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Table 2 



SEQ 
ID 

NO ( 


XUSTEF 


SEQ NAME 


ORIE 
NT 


CLONE ID 


LIBRARY 


5461 


556734 


2067 015 ez43 271016 


F 


M00055412A:F09 


UC2-ColoiiMetLiver 


5462 


487893 


2067J16az43 271027 


F 


M00055406D:C10 


UC2-ColonMetLiver 


5463 


558180 


2067G18az43 271056 


F 


M00055403B:A10 


UC2-ColoxiMetLiver 


5464 


557853 


2067 J18 &z43 271059 


F 


M00055406D-G12 


UC2-ColonMetLiver 


5465 


446171 


2067 P19 ffz43 271081 


F 


M00055413CB09 


UC2-ColoiiMetLiver 




561438 


2067 B20 ffz43 271083 


F 


M00055398AC11 


UC2^olonMetLiver 


5467 

»7*TvJ / 


551617 


2067 120 ez43 271090 


F 


M00055405D:G05 


UC2-ColoixMetLiver 


5468 


46085? 


2067 EP1 ffz43 271099 


F 


M00055398BA05 


UC2-ColonMetLiver 


5469 


556126 


2067P21ez43 271113 


F 


M00055413C*G09 


UC2-ColonMetLiver 


5470 


551928 


2067 F22 <^z43 271119 


F 


M00055402C:C12 


UC2-ColonMetLiver | 


5471 


562236 


2067 A23 cz43 271130 


F 


M00055397A:B10 


UC2-ColonMetLiver 


5472 


456687 


2067B23ez43 271131 

^.V/V7 / . -L/^r^ . f2)£t~~J £*t 1 X X «7 X 


F 


M00055398B-C05 


UC2-ColonMetLiver 


5473 

«7*T / J 


530715 


2067H23ez43 271137 


F 


M00055404CC11 


UC2-ColoiiMetLiver 


5474 

,7*T / T 


66014 


2067K23ez43 271140 


F 


M00055 408 A:F 1 2 


UC2-ColoixMetLiver 


5475 


100821 

X Vv OL J. 


2067L23ez43 271141 


F 


M00055409A:E06 


UC2-ColoixMetLiver 


5476 


154980 


2067 A24 ez43 271146 


F 


M00055397A:C06 


UC2-ColonMetLiver 


5477 


561236 

~J\J 1Z. JVJ 


2067 G24 ffz43 271 152 


F 


M00055403B-G12 

1TXW Wk/«/ ■ -J X^ « XtaJ X 


UC2-ColonMetLiver 


5478 


516043 


2067 L24 ez43 271157 


F 


M00055409A:E10 


UC2-ColonMetLiver 


547Q 


550380 


2068 K01 ffz43 271172 


F 


M00055424BD04 


UC2-ColoxiMetLiver 


5480 


440705 

117 / .7-7 


2068 003 ez43 271208 


F 


M00055474A:G06 


UC2-ColonMetLiver 


5481 


550166 


2068 D04 ez43 271213 


F 


M00055417CA05 


UC2<k)loxiMetLiver 


5482 


561646 


2068 F04 ez43 271215 


F 


M00055419DD04 


UC2-ColoixMetLiver 


5483 


5611 12 

JUH XL 


2068 D05 ez43 271229 


F 


M000554 1 7C G03 


UC2-ColoiiMetLiver 


5484 


418562 


2068 E05 ez43 271230 


F 


M00055418CD08 


UC2-ColonMetLiver 


5485 


550385 


2068 J05 ffz43 271235 


F 


M00055423 C *D05 


UC2-ColonMetLiver 


5486 


56141 1 


2068 M05 az43 271238 


F 

X 


M00055472AB 12 

ITIUvUJ*/" / Z>/X«JU A X* 


UC2-ColonMetLiver 

Vri/ X-h>^^ vVXVlXlfXVvXJX ▼ vx 


5487 


56 1 770 

JO X / / \J 


2068 C07 a743 271260 


F 


M000554 1 6B*F 1 1 

1VXV/ V7 UJ ~J^T X V/JJ .X XX 


UC2-Q>loxxMetLiver 

Vrf/ V/XV V^\/X\/X X,l T1VIXJX ▼ WX 


5488 

J*tOO 


557760 


2068 Tj08 0743 271277 


F 


M00055417DD08 

ItIuuvjjti / x«^ .x/ vy tj 


UC2-ColoiiMetLiver 


548Q 


468672 


2068 M08 az43 271286 


F 


M00055472AF02 


UC2-ColonMetLiver 


540ft 


556400 


2068N08ffz43 271287 


F 


M00055473BD01 

iVXVfV/ V/«7«7^T / JU.^/wl 


UC2-ColoiiMetLiver 


54Q1 


55001 8 

J JV/V X o 


2068 F00 oz43 271295 

Z<UV7U .X U7,ft4i*TJ x» / 1x7^ 


F 


M000554 1 9D *H07 

J-VX V/ V/V/»7— /~ X 3—r .X XX/ » 


UC2-ColotiMetLiver 


5409 


561837 


2068 H00 az43 271297 


F 


M0005 542 1 CF06 

1V1 v V V«/ «/Ta X v^»x W* 


UC2-Q>lonMetLiver 


5403 


554028 


2068 100 az43 271298 


F 


M00055422CE10 


UC2-ColonMetLiver 


5404 


470667 


2068 D10 0-7-43 271300 


F 


M0005541 7r>H08 

IVXV/VVJ JTl / X-^ .X XV/ «J 


UC2-ColoxiMetLiver 

V** vX/XVXUTAVVXwX ▼ W-*« 


5405 


552561 


2068F10&743 271310 

H-/V7l7.Xj X 1Z1 l<7lv 


F 

X 


M000554 1 8DA03 


UC2-ColoiiMetLiver 


5496 


556793 


2068.G10.gz43_271312 


F 


M00055421A:A09 


UC2-ColonMetLiver 


5497 


561718 


2068.N10.gz43_271319 


F 


M00055473B:E10 


UC2-ColoiiMetLiver 


5498 


630348 


2068.Ell.gz43_271326 


F 


M00055418D:B05 


1 UC2-ColoiiMetLiver 


5499 


491728 


2068.Hll.gz43_271329 


F 


M00055421C:G07 


UC2-CoIonMetLiver 


5500 


562477 


2068.C12.gz43_271340 


F 


M00055416D:B03 


UC2-ColonMefLiver 


5501 


402836 


2068.D12.gz43_271341 


F 


M00055418A:C12 


UC2-ColonMetLiver 


5502 


556511 


2068J12.gz43 271347 


F 


M00055424A.A08 


UC2-ColonMetLiver 
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3NSDOCID: <WO 0214S00A2_I_> 



WO 02/14500 



PCT/US01/25840 



Table 2 



SEQ 
ID 

NO ( 


XUSTEF 


SEQ NAME 


ORIE 
NT 


CLONE ID | 


LIBRARY 


5503 


559076 


2068.N12.gz43_271351 


F 


M00055473C:A05 


UC2-ColonMetLiver 


5504 


417426 


2068.P12.gz43_271353 


F 


M00055475D:A05 


UC2-ColonMetLiver 


5505 


456520 


2068.A13.gz43_271354 


F 


M00055414C:C02 


UC2-ColonMetLiver 


5506 


492982 


2068J13.gz43_271363 


F I 


M00055424A:C10 


UC2-ColonMetLiver 


5507 


558679 


2068.C14.gz43_271372 


F 


M00055416D:D11 


UC2-ColonMetLiver 


5508 


561707 


2068 .F 14 .gz43_27 1 375 


F 


M00055420A:E12 


UC2-ColonMetLiver 


5509 


561180 


2068.114.gz43_271378 


F 


M00055423A:A10 


UC2-ColonMetLiver 


5510 


554953 


2068.K14.gz43_271380 


F 


M00055425A:H03 


UC2-ColonMetLiver 


5511 


492876 


2068 X 1 4 .gz43_27 1381 


F 


M00055471C:B01 


UC2-ColonMetLiver 


5512 


555933 


2068 .0 14.gz43_27 1384 


F 


M00055474C.H12 


UC2-ColonMetLiver 


5513 


562137 


2068 P 1 4.gz43_27 1385 


F 


M00055475D:A10 


UC2-ColonMetLiver 


5514 


561500 


2068 .B 15 .gz43_27 1 3 87 


F 


M00055415D:C12 


UC2-ColonMetLiver 


5515 


561682 


2068.D15.gz43_271389 


F 


M00055418A:E07 


UC2-ColonMetLiver | 


5516 


560118 


2068 .1 15 .gz43_27 1 394 


F 


M00055423A:D02 


UC2-ColonMetLiver j 


5517 


5 12392 


2068 J 1 5 .gz43_27 1 3 95 


F 


M00055424A:F04 


UC2-ColonMetLiver 


5518 


449356 


2068 J16.gz43_27 1410 


F 


M00055423A-.G08 


UC2-ColonMetLiver 


5519 


447098 


2068.L16.gz43_271413 


F 


M00055471C:D04 


UC2-ColonMetLiver 


5520 


522220 


2068.N16.gz43_271415 


F 1 


M00055473C:H05 


UC2-ColonMetLiver 


5521 


453715 


2068.P16.gz43_271417 


F 


M00055475D:B07 


UC2-ColonMetLiver 


5522 


554774 


2068.D17.gz43_271421 


F 


M00055418A:H04 


UC2-ColonMetLiver 


5523 


553987 


2068.117.gz43_271426 


F 


M00055423BA.06 


UC2-ColonMetLiver 


5524 


561645 


2068.K17.gz43_271428 


F 


M00055425B:D12 


UC2-ColonMetLiver 


5525 


549511 


2068 .M 1 8 .gz43_27 1446 


F 


M00055472C:G11 


UC2-ColonMetLiver 


5526 


560700 


2068 .P 1 8 .gz43_27 1449 


F 


M00055475D.G08 


UC2-ColonMetLiver 


5527 


493410 


2068.K20.gz43_271476 


F 


M00055425B:F06 


| UC2-ColonMetLiver 


5528 


482425 


2068 . A2 1 .gz43_27 1482 


F 


M00055414D:G10 


UC2-ColonMetLiver 


5529 


561497 


2068.B2 1 .gz43_27 1483 


1 F 


M00055416A:C04 


UC2-ColonMetLiver 


5530 


131130 


2068.C2 1 .gz43_27 1484 


F 


M00055417A:G11 


UC2-ColonMetLiver 


5531 


524736 


2068.F22.gz43_271503 


F 


M00055420C:E08 


UC2-ColonMetLiver 


5532 


58680 


2068.G22.gz43_271504 


F 


M00055421B-.E05 


UC2-ColonMetLiver 


5533 


562302 


2068.N22.gz43_2715 1 1 


F 


M00055474A:B05 


UC2-ColonMetLiver 


5534 


514824 


2079M01gz43 271558 


F 


M00043021D:F10 


UC3-NormColon 


5535 


452523 


2079.P02.gz43 271577 


F 


M00043031A:B01 


UC3-NormColon 


5536 


453572 


2079K04ez43 271604 


F 


M00043012A:F06 


UC3-NormColon 


5537 

~J -J ~s 1 


406092 


2079 105 sz43 271618 


F 


M00043007D:F07 


UC3-NormColon | 


5538 


452759 


2079.O05.gz43 271624 


F 


M00043029A:E11 


UC3-NormColon 


5539 


456756 


2079.B06.gz43 271627 


F 


M00042983D:A03 


j UC3-NormColon 


5540 


505226 


2079.C06.gz43 271628 


F 


M00042987C:E04 


UC3-NormColon 


5541 


238196 


2079.D06.gz43 271629 


F 


M00042991C:C02 


UC3-NormColon 


5542 


235874 


2079.H06.gz43 271633 


i f 


M00043005A:B06 


UC3-NonnColon 


5543 


512432 


2079.007 .gz43 271656 


F 


M00043029B:C09 


i UC3-NormColon 


5544 


269927 


2079.O09.gz43 271688 


F 


M00043029B:F09 


UC3-NormColon 
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1NSDOCID: <WO 021 4500A2_I_> 



WO 02/14500 



PCT/US01/25840 



Table 2 



SEQ 
ID 
NO 


CLUSTER 


SEQ NAME 


ORIE 
NT 


CLONE ID 


LIBRARY 


5545 


449510 


2079.K10.gz43 271700 


F 


M000430 12D* A06 

JLTlv Vv ••/V X m «* IW 


UC3-NormColon 


5546 


516799 


2079O10ez43 271704 

X-r Vr / — ' a Vj* X Vr « f~i ■ * ■ l_ «W / X # V/ T 


F 


M00043029BG10 


UC3 -NormColon 


5547 


455716 


2079 Ell ez43 271710 


F 


M00042995CE04 


UC3-NormColon ! 


5548 


452976 


2079 Gil ez43 271712 


F 


M00043003AH07 


UC3-NormColon 


5549 


45203 1 


2079 Nil ez43 271719 


F 


M00043026DD06 


UC3 -NormColon 


5550 


452957 


2079 Pll ez43 271721 

X^ V/ # . JL. XX* » > ««/ aw # X / Xr X 


f ; 


M00043032BH01 


UC3-NormColon 


5551 


452830 


2079D12ez43 271725 

X*r Vr § — ' • 1 ^ X «w » C-j-CJ 1 X# 9X9 ft<t/ 


F 


M00042991D-F06 


UC3-NormColon 

W*' Jk " Vr JL X IvX^^rVyXVrXJL 


5552 


452220 


2079 C13 ez43 271740 


F 


M00042988DG10 


UC3-NormColon 


5553 


500896 

^» \f V V V/ 


2079 K13 sz43 271748 


F 


M00043013B:A06 


UC3-NormColon 


5554 


455855 


2079.L13.gz43 271749 


F 


M00043017D.B06 


UC3«"NormColon 


5555 


513381 


2079.M13.gz43 271750 


F 


M00043023D:D06 


UC3-NormColon 


5556 


448200 


2079 B14 ez43 271755 


F 


M00042985B:C05 


UC3-NormColon 


5557 


509027 


2079 F14 ez43 271759 


F 


M00043000CH06 


UC3-NormColon 


5558 


452618 


2079 N14 ez43 271767 


F 


M00043027BC04 


UC3-NormColon 


5559 


510254 


2079 C15 sz43 271772 

X^ Vr / y ( V X igXi i «^ iV / JL / / X* 


F 


M00042989CA06 


UC3~NormColon 


5560 


451812 


2079 D15 ez43 271773 


F 


M00042992B B03 


UC3-NormColon 


5561 


452392 

^T*J £*~J 7x- 


2079 G15 ez43 271776 


F 


M00043003CA02 


I JC3 -NormColon 


5562 


504812 


2079 115 ez43 271778 

X*VS / 7 .X X >/ . r ' ■ »v 1 X 1 1 U 


F 


M00043008DE12 


UC3-NormColon 


5563 


504431 


2079 J15 ffz43 271779 

.X»Vs / .J X « fo£dT^J / X / / ^ 


F 


M00043011BD03 


UC3 -NormColon 


5564 


452440 


2079 L15 az43 271781 


F 


M00043020AA03 


UC3-NormColon 


5565 


450225 


2079 B16 ez43 271787 


F 


M00042985C-B03 


! UC3 -NormColon 


5566 


511348 


2079 CI6 ez43 271788 


F 

X 


M000429 89C *B07 


T JC3-NormColon 


5567 


502343 


2079 D16 ez43 271789 


F 


M00042992BB 1 1 


UC3-NormColon 


5568 


451850 


2079 F16 ffz43 271791 


F 


M0004300 1 AC 1 0 


UC3-NormColon 


5569 


455117 


2079 K16 ez43 271796 


F 


M000430 1 3 C AO 1 


UC3 -NormColon 


5570 




2079 L16 ez43 271797 


F 


M00043020AG 1 2 


UC3-NormColon 

V— / X l V/X XiLV/V/XvXX 


5571 


504501 


2079 B17 ez43 271803 

i y . xj x i «q* i luvj 


F 


M00042985CD02 


UC3-NormColon 


5572 


503122 


2079 F17 ez43 271807 


F 

X 


M00043001BC01 

IVIvUV/tJvvIJJ.v/v X 


TJC3 -NormColon 


5573 


508126 


2079 117 ez43 271810 


F 


M00043008DH09 


UC3-NormColon 


5574 


448687 


2079 A18 ffz43 271818 


F 


M00042982CG02 


UC3-NoimColon 

W X T V/X lilV/VAVii 


5575 


451491 


2079 B18 ez43 271819 


F 


M00042985C D04 


UC3-NormColon 


5576 


452969 


2079 F18 sz43 271823 


F 


M00043001BE02 

1 YX V w vi */ Vr V/ X Xj* »Jlp^\/^ 


UC3-NormColon 


5577 


452866 


2079 G18 ez43 271824 


F 


M00043003C-G10 


UC3-NormColon 

W ^^*r i 1 VI XXX vxx 


5578 


446936 


2079 K18 ez43 271828 


F 


M00043013DF03 


UC3-NormColon 

V-/ v_^/ w x ^ V/X XXX WvAVXI 


5579 


452445 


2079L18ez43 271829 


F 


M0004302 1 A* A09 


UC3-NormColon 


5580 


451092 


2079.N18.gz43_271831 


F 


M00043027D:G12 


UC3-NormColon 


5581 


75212 


2079.D19.gz43_271837 


1 F 


M00042992D:H02 


UC3-NormColon 


5582 


452325 


2079.F19.gz43_271839 


F 


M00043001B:H01 


UC3-NormColon 


5583 


502984 


2079J19.gz43_271842 


F 


M00043009B.C09 


UC3-NonnColon 


5584 


508125 


2079 J19.gz43_271 843 


1 F 


M00043011C:H09 


UC3-NormColon 


5585 


452528 


2079X19.gz43_271845 


F 


M00043021A:B06 


UC3-NormColon 


5586 


509505 


2079.C20.gz43_271852 


F 


M00042989DA07 


UC3-NormColon 
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3NSDOCID: <WO 0214500A2_I_> 



WO 02/14500 



PCT/US01/25840 



Table 2 



SEQ 
ID 
NO 


XUSTEF 


SEQ NAME 


ORIE 
NT 


CLONE ID 


LIBRARY 


SSR7 


4SS139 


2079 G20 ez43 271856 


F 


M00043003D:D10 


UC3 -NormColon 


SS88 




2079 T20 oz43 271858 


F 


M00043009B:F11 


UC3-NontiColon 


SSRQ 




9070 M90 0743 971862 


F 


M00043024B-H11 


UC3-NonnColon 


sson 




9070 C91 0743 271868 


F 

X 


M00042990A-E05 


UC3-NormColon 


SSQ1 

DDZ7 X 


SI ^ 1 7R 

I/O 


9070 TV? 1 or43 271869 


F 


M00042993 A*D 12 


UC3-NormColon 


SSQ9 


4S^804 


2079 E21 ffz43 271870 


F 


M00042996D:D12 


UC3-NormColon 


SSO'} 

JJ7J 


4S^07R 


9070 T 91 crr43 271877 


F 


M00043021A:H06 


UC3-NormColon 


SSQ4 


4S9£1 1 

*t DmC\J x JL 


9070 P91 0743 271881 


F 


M00043033CC05 


UC3 -NormColon 


SSQS 




2079 A22 ez43 271882 


F 


M00042983A:E09 


UC3-NormColon 


JJ7O 


44744S 


2079 B22 ez43 271883 


F 


M00042986B:E06 


UC3-NormColon 


SS07 


S0^69S 


2079G22oz43 271888 


F 


M00043004A:C08 


UC3-NormColon 


SSQ8 


S09614 


2079 J22 2z43 271891 


F 


M00043011D:C12 


UC3-NormColon | 


ssoo 


4S4796 


2079K22ez43 271892 


F 


M00043014B:G12 


UC3-NoimColon 


s^oo 




2070 T/22 f>743 271893 


F 


M00043021B:H03 


UC3-NonnColon 


S601 


4S67SS 


2079 1^22^43 271894 


F 


M00043025A:C01 


UC3-NonnColon 


s^09 


SI SI 97 


2079 022 £z43 271896 


F 


M00043030C:F03 


UC3-NonnColon 


s^oi 


S 074 SO 


2079 F23 ffz43 271903 


F 


M00043002A:G09 


UC3-NormColon 


S£04 


44$n81 


2079 G23 ffz43 271904 


F 


M00043004A:C10 


UC3-NormColon 


s^ns 


S0S0^ 


7070 K23 ez43 271908 


F 


M00043014C:F07 


UC3-NormColon 


S£06 


4S9648 


2079 M23 ffz43 271910 


F 


M00043025A:D09 


UC3 -NormColon 


S£H7 


AS^O^R 


?070N23pz43 271911 


F 


M00043 028 C :H09 


UC3-NormColon 


JOU5 


4S9£1 1 


9070 P93 or43 771013 


F 


M00043033CD05 


UC3-NonnColon 


JOU!/ 


SOI S34 


9070 R74 ot43 271 01 5 


F 


M00042986CB07 


UC3-NormColon 


s^ 1 0 


ASAftAO 


9070 F?4 <r^43 271 01 9 


F 


M00043002BE06 


UC3-NormColon 


jOI 1 


JOODU / 


9070 ^94 a74^ 771 0? 1 


F 


M00043007BD12 


UC3-NormColon 




S070A^ 


9070 K94 or43 971024 


F 


M00043014CG06 


UC3-NormColon 




J17J / O 


9000 R01 a^43 973008 

Zv/7U.Dvl.gZ<*tJ / J7UO 


F 


M00043153BB09 


UC3-PrimColon 






9000 M0 1 o-r4^ 97^014 


X 


M00043 172C-G06 


UC3-PrimColon 


S£l S 

jO JL j 


S97£7Q 


9000 T 01 az43 973018 


F 


M00043 1 85BD02 


UC3-PrimColon 


JO 10 


4S44fV3 


9000 P01 0743 273922 


F 


M00043201D:D03 


UC3-PrimColon 


S£1 7 


S9 1 RAO 


9000 H09 0-74^ 97^096 


F 

X 


M00043160BE05 


UC3-PrimColon 


S£1 ft 


JZ070 X 


9000 M09 (7743 273935 


F 


M00043189C:F08 


UC3-PrimColon 


SA10 


S9RAOA 


9000 ro^ »743 973041 


F 

X 


M00043156CE05 


UC3-PrimColon 


S690 


S93674 


9000 N04 a743 973068 


F 


M00043193CG11 


UC3-PrimColon 


S£91 


AAA7AR 


9O00 F0S 074*3 97^076 


F 

X 


M00043 168CF03 


UC3-PrimColon 


5622 


128749 


2090.H06.gz43_273994 


F 


M00043172D:H03 


UC3-PrimColon 


5623 


525781 


2090.K06.gz43 273997 


F 


M00043183C:B08 


UC3-PrimColon 


5624 


452775 


2090.N06.gz43_274000 


F 


M00043193C:H09 


UC3-PrimColon 


5625 


524721 


2090.L07.gz43 274014 


F 


M00043186C:A11 


UC3-PrimColon 


5626 


454202 


2090.M07.gz43 274015 


F 


M00043190A:B10 


UC3-PrimColon 


5627 


452325 


2090.J08.gz43 274028 


F 


M00043180C:B02 


UC3-PrimColon 


5628 


453804 


2090.F09.gz43_274040 


F 


M00043169A:F05 


UC3-PrimColon 
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WO 02/14500 
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Table 2 



SEQ 
ID 
NO 


CLUSTER 


SEQ NAME 


ORIE 
NT 


CLONE ID 


LIBRARY 


5629 


454007 


2090 009 ez43 274049 


F 


M00043 1 97DH12 


I IC3 -PrimCnlnn 

KJ ^s—r X X XXXL\*s\JX\JXX 


5630 


425455 


2090 L10 ez43 274062 


F 


M00043 186DB09 


T Tr*3-PrimPn1nn 

A 1 XJX1V_/\JXU11 


5631 


453354 


2090 HI 1 2z43 274074 


F 

X 


M0004^ 1 73 R-FO^ 


L7V_/*7 XT 1 Alll wUlftJll 


5632 


525315 


2090 111 ffz43 274075 


F 

X 


M0004 1 ? 1 76P- A 0 1 


KJ V-^ X X XXllV_xWlVJiX 


5633 


453890 


2090 Nl 1 ez43 274080 


F 


M00043 1 94 A *G06 


T TP^ -PHmPnlon 

\J V^— ' X X XX X X k7X V7XX 


5634 


45446O 


2090 A12 &z43 274083 


F 


M00043 1 5 1 R-D09 


T rP3-Primrn1nn 

VJ X 1 XiXXV^vJXVJlX 


5635 


577869 


2090C12ffz43 274085 

Vs A^r .g^<T«s / "TV/O-7 


F 

X 


M0004^1 58AF03 


L/ J X 1 XXlXVyUlL/xl 


5636 


454664 


2090 113 2z43 274107 


F 


M0004', 1 76C F 1 2 


T TC^ -PrirTiCnlnn 

v7 V^«7 X X XX X X V7XVJXX 


5637 


528369 


2090 L13 ffz43 274110 


F 


M00043 187CE06 


T TC3-PriTnCo1on 

UV/*/ X X XXXXV>^XV7XX 


5638 


596539 


2090 Jl 4 az43 9741 94 


F 

X 


M00043 1 8 1 CC 1 0 


T TP 3 -Prim Pnlnn 

U K^fJ X X XXX X^^k7XV7XX 


5639 


452182 

"t«j.^. A *J.£* 


2090 014 ez43 274129 

A.V/7 V/. V/ / " X ^ 7 


F 


M00043 199 A-F03 


T Tr*3-PrimPn1on 

V7 V-'^' X X XI XXV-/ UXIXLX 


5640 


590599 

.J^\J.J 77 


2090 HI 5^743 274138 


F 

X 


M00043 1 73rT)l 9 


T TP3-PrimPo1nn 

\J KsJ X X XXX X\w/ VJ xuxx 


5641 


530971 

.J J v/7 1 X 


2090 LI 6 az43 274158 

x>v7v<Ju1U.k£iT*/ .— • it J. JO 


F 

X 


M0004^ 1 87DH04 

1VXV7V/V/*T J AO/ X-r .A XV/*T 


T Tr*3-Pri mPnlfin 

X X XXXXV^/VJXvJXX 


564? 


98931 6 

A VJ 


9090 Nl 6 <xt43 9741 60 


F 


iVron04^ 1 94TV F 1 0 

lVXWv/VJ*T*? l7TJL/.Ayl v 


T Tr'3-PrimCnl nn 

L7 J x 1 XXllv^vJXkJll 


5643 


454731 

*T«7*T / %7 A 


9090 A 1 7 cr743 9741 63 


F 

Mr 


M0004^ 1 59 A -F1 0 


U V,/-? X 1 11 1 X Ul vJll 


5644 


453730 


2090 D17 ffz43 274166 


F 

X 


M00043 1 62DF06 


T TP 3 -Prim Colon 

VJ V— ' X XXXXXV-/V71V71X 


5645 


450784 


2090 G17 ez43 274169 


F 

X 


M00043 1 7 1 D -G08 


T TP3 -Prim Col on 

KJY^r^J X X JXXXVsVJlUXX 


5646 


404081 


2090 Ml 7 ffz43 274175 


F 

X 


M00043 1 9 1 C -G05 

AVXV/V/V/-T.J X 7 X V/i\JvJ 


T TC3 -Prim Col on 

V7 V^-7 X X XXXXV^UXUIA 


5647 


456723 


2090A18ez43 274179 


F 


M00043 152AG08 


T TC3-PrimColon 

\J V»^-7 X X AXXX\_y\JXVJXL 


5648 


526449 


2090 C18 ez43 274181 


F 


M00043 159 AC01 

AtXV/V/ V/T^«^ la77/\.v/V/l 


T IC3-PrimColon 

KJ V_v— ' X X XXXl\wsVJX\-UX 


564Q 


413693 


2090 HI 8 &z43 274186 


F 

X 


M00043 1 74R* 1306 


T TC ^ -Prim Col on 

VJ X^fJ X X XXAX\_sV7Xv7XX 


5650 


4543 1 1 

tj*tj x x 


2090 J18 ez43 274188 


F 

X 


M00043 1 82B-C02 


TJC3-PrimColon 

UV/J X X XXXXV>V7XV^XX 


5651 


596733 


2090 H90 cr743 9749 1 8 


F 


M00043 1 74r*P06 


T TC^ -Prim Col on 

VJ \sJ X 1 Alllv^V/lLUl 


5657 


439159 


2090 K20 a743 274221 

^Vs7\s.A\~£»VJ.££ZjT«s X«/*TZ»Z«A 


F 

X 


M00043 1 84D-G06 


T TC3-PrimCo1on 

VJ X X XXXXV_/VJXUXX 


5653 


453983 

TJJ70J 


2090 M20 ffz43 274223 


F 

X 


M00043 1 99RH06 


T TC3-PrimColon 

VJ A X XXXX V^VJXVJXl 


5654 

mJ \J*J 


518179 

•7 J. O X / ^ 


2090 N20 ffz43 274224 


F 

X 


M00043 1 95D- A05 


T TC3 -Prim Col on 

VJV_s^J XT 1 A111V^V/X V/XA 


5655 


454699 


2090 P99 &743 974258 


F 

A 


M0004^9 1 0PF08 


T TC3 -Prim Col on 

VJ \sJ X 1 Xlll VvUl VJll 


5656 


530939 


9090 A93 cr743 974259 


F 


M0004^ 1 59 13FT04 


T TC3 -Prim Col on 

VJ \s~s XT X XXXXV^UXVJil 


5657 

•J VX»7 / 


934970 


2090 G23 ff743 274265 


F 

A 


M0f)04^ 1 79CR05 


T TC3 -Prim Colon 

VJ V-/«7 A 1 XXXXV^VJXVJXX 


5658 

•7VJ.-7 0 


599742 


2090 T24 ffz43 274283 

^VJ7V/.A^*T.^fc»T»7 ,A./*T^.CJ«J 


F 

A 


M00043 1 78PG06 


T TC 3 -Prim Colon 

VJ V_/~7 X X XXXIV^ VJXVJ11 


5650 


404461 


9099 Ct04 <^43 974444 


F 


M0004^4381 : i*F1 0 

xVAvUV/TO'-r J OXJ .Aw X 1/ 


T TC^-ColonA/fpfT ftrpr 

VJ V_y«J V^ VJlVJlllVAC UL/1 VOl 


5660 


AS 1093 


9099 T04 W743 974446 


F 


M0004^448Ti-W09 


T TP Q _C ol r\rt \Af>iX ivf»r 

j-l^UlLJxxlVACLL/lVCl 


5661 


535436 


9099 00 5 ot43 974460 


F 

A^ 


M000434^ 9iC -D 1 9 


TTC3-Co1onMRtT ivf»r 

VJ -V^UlVJlAiVACLXwl VC1 


5669 


456001 


9099 "NT05 ot43 974467 


F 


M00043470P * A 0 1 


T TpQ _ColonA/fptT ivpr 

U V,/ -J - VJlVjlXlVAC U_#l V CI 


5663 


450794 


9099 O06 a743 974484 


F 


Mn0043478RTj0 1 


TTC3-Co1on"MV*rT ivf*r 

Uw V-^VJlVjlUVAC LI— (1 VOl 


5664 


539142 


2092.A07.gz43_274486 


F 


M00042592A:H02 


UC3-ColonMetLiver 


5665 


450914 


2092.K07.gz43_274496 


F 


M00043455C:G07 


UC3-ColonMetLiver 


5666 


455855 


2092.P07.gz43_274501 


F 


M00043484D:H08 


UC3-ColoiiMetLiver 


5667 


537451 


2092.F08.gz43_274507 


F 


M00043433A:F02 


UC3-ColonMetLiver 


5668 


535123 


2092.G08.gz43_274508 


F 


M00043439C:D05 


UC3-ColoiiMetLiver 


5669 


452830 


2092.J08.gz43_274511 


F 


M00043452D:D05 


UC3-ColonMetLiver 


5670 


534054 


2092.L08.gz43_2745 13 


F 


M00043460D:C03 


UC3-ColonMetLiver 
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SEQ 
ID 
NO 


CLUSTER 


SEQ NAME 


ORE 
NT 


CLONE ID 


LIBRARY 


5671 


162851 


2092 M08ez43 274514 


F 


M00043465B:G08 


UC3-ColonMetLiver 


567? 


455980 


2092 D09 ffz43 274521 


F 


M00043420C:C02 


UC3-ColonMetLiver 


5673 

JO / J 


456090 


2002 H09 ez43 274525 


F 


M00043446B:H01 


UC3-ColonMetLiver 


5674 


448946 


2092 109 cz43 274526 


F 


M00043449C:H09 


UC3-ColonMetLiver 


5675 


539353 

«j ^ y -j -~j ~j 


2092L09ez43 274529 


F 


M00043460D:H01 


UC3 -ColonMetLiver 


5676 


450658 


2092 P09 ez43 274533 


F 


M00043485A:C04 


UC3-ColoiiMetLiver 


5677 

— ' V7 / / 


544355 


2092 E10 ez43 274538 


F 


M00043427C:E11 


UC3 -ColonMetLiver 


5678 


452198 


2092 F10 ez43 274539 


F 


M00043434A:H02 


UC3-ColonMetLiver 


5679 


447534 


2092 110 ez43 274542 


F 


M00043449D:A06 


UC3-ColonMetLiver 


5680 


455256 


2092 P10 sz43 274549 


F 


M00043485B:C09 


UC3-ColonMetLiver 


5681 


455201 


2092 Cll sz43 274552 


F 


M00042640C:C10 


UC3-ColonMetLiver 


568? 


535190 

1^7 


2092Fllez43 274555 


F 


M00043434B:D06 


UC3-ColonMetLiver 1 


5683 


455808 


2092Hllffz43 274557 


F 


M00043446C:E04 


UC3-ColonMetLiver 


5684 


915366 


2002 Tl 1 &z43 274558 


F 


M00043449D:E09 


UC3-ColonMetLiver 


5685 


456954 


2002 Jl 1 ffz43 274559 


F j 


M00043453BC06 


UC3-ColonMetLiver 


5686 


536415 

JJUtlJ 


2092 Nil ez43 274563 


F 


M00043472A:E06 


UC3-ColonMetLiver 


5687 


459801 


9002F12az43 274571 


F 


M00043434BE04 


UC3-ColonMetLiver 


5688 


400958 


9002 HI 2 ez43 274573 


F 


M00043446C:E08 


UC3-ColonMetLiver 


5680 

JU07 


546191 


9002J12az43 274575 


F 


M00043453BF02 


UC3-ColonMetLiver 


5600 

JU7U 


4.54691 


9002K12&z43 274576 


F 


M00043457CB12 


UC3-ColonMetLiver 


5601 


4551 69 


?002D13oz43 274585 


F 


M00043422BC03 


UC3-ColonMetLiver 


JU7Z 


54^490 


?nO?P13^743 274507 


! F 


M00043486BD02 


UC3-CoIonMetLiver 




456940 


?00? T14 <*743 274606 


F 


M00043450BC1 1 


UC3-ColonMetLiver 


56Q4 


531461 


2092 E15 ez43 274618 


F 


M00043428C:A08 


UC3-ColonMetLiver 


5605 

J07J 


45557? 


?002H15ez43 274621 


F 


M00043447AF02 


UC3-ColonMetLiver 


5606 

JU7U 


456783 


2092 115 ez43 274622 


F 


M00043450BH05 


UC3-ColoiiMetLiver 


5607 


2544 


2092N15ez43 274627 


F 


M00043473C:B11 


UC3-ColonMetLiver 


5608 

JU70 


54^4^ 1 


2092 P15 ez43 274629 


F 


M00043486C:D02 


UC3-ColonMetLiver 


5600 

~J \J ^ 


402534 


2092 016 ez43 274644 


F 


M00043481A:G02 


UC3-ColonMetLiver 


5700 


454563 


2092 117 ffz43 274654 


F 


M00043450C:F11 


UC3-ColonMetLiver 


5701 


541001 


?00?K17p743 274656 


F 


M00043459AB08 


UC3-ColonMetLiver 


5709 
J / uz 


536995 


?00? T 17 ^743 ?74657 


F 


M00043462CE12 


UC3-ColonMetLiver 


570^ 


455370 


?00? Ml 7az43 ?74658 


F 


M00043468CD08 

1TXV *J^%\J\J v^.X«^ w 


UC3-ColonMetLiver 


57fY4 


45£5£7 


?O0? PI 7 0743 ?74661 


F 


M00043486CF07 


UC3 -ColonMetLiver 


^7fV5 
J /Uj 


^QRfl4 

JJ70U*f 


?O0? PI 8 0-743 ?74664 


F 

X 


M00043417AH12 


UC3-ColonMetLiver 


5706 


452618 


2092.N18.gz43_274675 


F 


M00043473D:D11 


UC3-ColonMetLiver 


5707 


543855 


2092J19.gz43_274687 


F 


M00043453D:D02 


UC3-ColonMetLiver 


5708 


415326 


2092X19.gz43_274689 


F 


M00043462D:C09 


UC3-ColonMetLiver 


5709 


455117 


2092.B20.gz43_274695 


F 


M00042600D:B08 


UC3-ColonMetLiver 


5710 


455601 


2092.C20.gz43_274696 


F 


M00043417B:F12 


UC3-ColonMetLiver 


5711 


455254 


2092.E20.gz43 274698 


F 


M00043430D:C01 


UC3 -ColonMetLiver 


5712 


402916 


2092.I20.gz43 274702 


F 


M00043451A:B02 


UC3-ColonMetLiver 
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SEQ 
ID 
NO 


CLUSTER 


SEQ NAME 


ORIE 
NT 


CLONE ID 


LIBRARY 


5713 


455864 


2092 N20 ez43 274707 


F 

X 




UvyJ -V^OlOIlLVlCLl-ilVcr 


5714 


27586 


2092 P20 ez43 274709 


F 


M00043489C-FT06 

1VX \J \J \J *T J *T O ^ . X XVI vy 


v^ Vs-J — V^vJlUIliVXC LLfl vcr 


5715 


452392 


2092B21ez43 274711 


p 


M00042600DE1 1 

XVXvvy v^^vyVyv/X-J.XwX X 


Uv>J V^vJiOIxiVlCtX-/l V CI 


5716 


447380 


2092 C21 ez43 274712 


p 


M000434 1 7C-F 1 2 

xvxv/V/vy*T x / v_>,x ixi 


KJ \sJ V^v7lvyIlXVxCLL/lVCr 


5717 


448250 


2092 121 ez43 274718 


F 


M0004345 1B-D08 


1 TP^-T*o1nnA/f<*fT ix/^r 
VJ «^ \^ vJlvJIXiVxC uui vcr 


5718 


536171 


2092 B22 2z43 274727 


F 


M0004263 7C*E 1 2 

±TXvVvy fc r^«vy»y / vv.Xwiz> 


vj j~v^\jiuiiiviciJL»ivcr 


5719 


455501 


2092 H22 ez43 274733 


F 

X 


M00043448B-E08 

XtXV/ \J \J*t -J f T OJJ . Xw vy O 


vJ V-/-7 V^vJlvylxiVlCLX-fl VCI 


5720 


455808 


2092 022 ez43 274740 


F 


M00043483B G0 1 

a »xvy vy \y*T jtu J u . vj vy x 


V7 v^-y "V^vJlUi-LLVxC LXjI VC1 


5721 


422223 


2092 C23 ez43 274744 


F 


M000434 1 8 A • A06 

xvxvyvy v/"«y~ x o/x./vvvj 


TTP3-Pn1nnlV/rf*tT ivpr 

KJ V-*«7 wVJllJlXlVXCULyl V CI 


5722 


455814 


2092 023 ez43 274756 


F 


M00043483B*G1 1 

j-» A\y vy v/~»^ ~u -j xj . \*j x x 


T JP3-Po1onMetT /tver 


5723 


538582 


2092 H24 ez43 274765 


F 


M00043448C-G07 

xTxvyvy vy i «y ■ i uv>>\iv / 


TJC3-PnlnnMetT iver 

Vy \_/V/1v7XX1VACUXj1 V t/X 


5724 


447238 


2092 124 sz43 274766 


F 


M0004345 1 C*H03 


TTP^-PolnniVfptT ivpr 

VJ V_/-7 V^ vJlvyXXiVXC ULyl V CI 


5725 


456577 


2092 P24 ez43 274773 


F 


M00043490A-F03 

j-TAvy vy vy^«y ~ vyy*..x 


T TP3-Pr»1nnMe*f*T Jver 

UV/J V_/V7XvJXXXVXwLXyXVd 


5726 


11576 


2176F01ez43 281685 

A* X / \y .X V/ X » ^> w X W 


F 


M00039503B-GO5 

iVXvyvy \J J 17 *J \J -JXJ. VJ \J-J 


X XXVJXw V-/ VyXXL 


5727 


401426 


2176P01ez43 281695 

a# x / v/.x vy x 'Jiyt iT«y ^«uiu7J 


F 

X 


M00039605DF02 

lVXvy Vvy *J ^\j\J-JU .Xu\JZ* 


FfMFP-TTtit 

X XiVXX_/V^ vJXlL 


5728 


409330 


2176 B02 ez43 281697 


F 


M00039363BH1 0 


FfMFP-TTtit 

X XXVXXJf V-/ V^XIL 


5729 


407195 


2176 C02 ez43 281698 


F 

X 


M00039369r>ni 1 


X XlVXJCr V^ UXXl 


5730 


411985 


2 1 76 D02 ez43 281 6QQ 


X 


M00039379 A DO^ 


XIXVXXJ/V^-"* vJill 


5731 


412621 


2176 F07 ^743 281701 


F 


1txvIUv7 J \7-J\JDxJ .Xivy O 


XXlVxX-fV^™' UI1L 


5732 


40771 1 

*TV //XX 


2176 HO? cr74^? 781 70^ 


F 


lVxvyULU^J 1 H-xJ ,xZ>vy / 


x^Vxxwv-/- , um 


5733 


411113 


?176G0^a743 281718 


i? 


IVxv/ \J \J D ^7 J v OJL/ . V/VJ 


XXlVxxZfV^"" U I1L 


5734 

/«y*T 


40241 1 


2 1 76 T04 074^ 28 1 736 




M00n^9S9^ A -IjOO 
i vi u w \/ d y j 2i ^? ^v. . xj \jy 


xxlvlxj/Vv"" uni 


5735 


147279 


2176F06Q743 98176S 

^X /vy.X V/vJ.g^i*T-y ZrO X / vJ_y 


F 

X 


Mn00^9S04RFnS 

lVXvy \3 \j D y J Vy*+J_> ,ruj 


XXXVl LJV,^"" um 


5736 


400628 


2176 L06 $yz43 281771 


F 

X 


1VXUUV/J7J vyvJX^.XwVSO 


XXlVI L_<V> UUL 


5737 


407443 


2176C07oz43 281778 

z- x / vy . v-^vy / ,gZi*T*y z#o x / / o 


F 

X 


M00039371 AF01 


FTN/rFP-TTnt 

XXlVI c«v_^- U JxL 


5738 


413346 


2176N07q:z43 281789 

a> x / vy .l ^ vy / .yrfj z#o x / o ^ 


F 

X 


M000^959?PF09 


X XiVJLx-*v^~ vJXXL 


5739 


125543 


2176 O07 ffz43 281790 


F 

X 


M000^9599 A F08 


XJUVJXyV^~ UI1L 


5740 


202308 


2176P07ff743 281791 

^ x / vy .x vy / .gLtJ ^.o x / 7 X 


F 

X 


MOOO'? 9606P R07 


HMFP-TTnt 

XxiYiDV/ L/XlL 


5741 


400428 


2176K08ffz43 281802 


F 


M00039562A-D1 0 


FfMFP-TTnt 

X XiVXJ_/Vx VyXXL 


5742 


400426 


2176H09ffz43 281815 

a»x / w.xxvy>' ♦^/-<~»y z»u 10 u 


F 

X 


M000395 16CH06 

±vx\j\j\j-j y *j x vjv^.xxvyvy 


FTMFP-TTnt 

X XXVXXwV^ ly XXL 


5743 


402147 


2176 P09 ffz43 281823 


F 

X 


M00039607AF05 

1VAv/vvJ7v\/ /xx.i. \J-J 


HMFP-TTnt 

X XLVXX_#Vv VyXXL 


5744 


400233 


2176 K10 0^43 281834 


F 

X 


M0003 9 5 62D Rfi9 


HMFP-TTnt 

X XXVXX-/V^ UXXL 


5745 


408986 


2176Kllffz43 281850 


F 


M00039562D-Gr01 

iviuvuj7^Ui-iy,vjui 


HMEP-Unt 

X XXVI 1 /V-> V7XXL 


5746 


400171 


2176B12az43 281857 


F 

X 


M00039366r"R10 

XTXV v vy J 17 «7 \J V/V^ »XJ X v 


HMFP-TTnt 

JTXXVXXiV-^ UXXL 


SIM 


403671 


2176ri2iyz43 281858 


F 

X 


M00039373RBO0 

±VX\J\J\S~J s~J 1 -JxJ ,llV/y 


HMFP-TTnt 

xxIyxxZi V-^ UxxL 


5748 


400047 


2176.A13.gz43_281872 


F 


M00039184A:A08 


HMEC-Unt 


5749 


404736 


2176.B13.gz43_281873 


F 


M00039366D:A02 


HMEC-Unt 


5750 


413627 


2176.F13.gz43_281877 


F 


M00039505B:G07 


HMEC-Unt 


5751 


407260 


2176.K13.gz43_281882 


F 


M00039563C:D01 


HMEC-Unt 


5752 


155374 


2176.M13.gz43_281884 


F 


M00039586A:C03 


HMEC-Unt 


5753 


409612 


2176.114.gz43_281896 


F 


M00039525B:H03 


HMEC-Unt 


5754 


402411 


2 176. Al 6.gz43_28 1 920 


F 


M00039184C:C05 


HMEC-Unt 
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Table 2 



SEQ 
ID 
NO 


CLUSTER 


SEQ NAME 


ORE 
NT 


CLONE DD 


LIBRARY 


5755 


411124 


2176.C17.gz43 281938 


F 


M00039375A:C09 


HMEC-Unt 


5756 


412416 


2176 J17 ez43 281945 


F 


M00039534D:E07 


HMEC-Unt 


5757 


406734 


2176.M17.gz43 281948 


F 


M00039586D:D05 


HMEC-Unt ] 


5758 


408306 


2176 K18 ez43 281962 


F 


M00039564B:F08 


HMEC-Unt 1 


5759 


1054 


2176E19ez43 281972 


F 


M00039502D:H03 


HMEC-Unt 


5760 


406855 


2176 119 e:z43 281976 


F 


M00039527BD06 


HMEC-Unt 


5761 


400454 


2176 H20 ez43 281991 


F 


M00039521A:D02 


HMEC-Unt 


5762 


402147 


2176 J20 sz43 281993 


F 


M00039536B:E10 


HMEC-Unt 


5763 


202308 

X> VX, mJ \I \J 


2176 M20 ez43 281996 


F 


M00039588A:H12 


HMEC-Unt 


5764 


413997 


2176 J21 ez43 282009 

X* X / V/ • ttf X* X • f)f t » X* w X« \J\J 


F 


M00039536B:H03 


HMEC-Unt 


5765 


409589 


2176K21sz43 282010 

X* X # V/ • X^ r r X . r^.ttf «. ' X* UX* V v 


F 


M00039564C:H05 


HMEC-Unt 


5766 


409262 


2176.M21.gz43 282012 


F 


M00039588B:H02 


HMEC-Unt 


5767 


405932 


2176.H22.gz43 282023 


F 


M00039521C:B01 


HMEC-Unt 


5768 


20517 


2176 M22 ez43 282028 


F 


M00039588D:F10 


HMEC-Unt 


5769 


413915 


211 r 6.P23.gz43 282047 


F 


M00039610A:G11 


HMEC-Unt 


5770 


412483 


2176.F24.gz43 282053 


F 


M00039507D:E03 


HMEC-Unt 


5771 


402392 


2176 G24 ez43 282054 

X* X / v,V»**~.fct< i** x»u<t»v— »^ 


F 


M00039514AG03 


HMEC-Unt 


5772 


401510 


2176 J24 ez43 282057 


F 


M00039537B:F06 


HMEC-Unt 


5773 


408649 


2176L24ez43 282059 


F 


M00039583A:G09 


HMEC-Unt 


5774 


418682 


2178 K02 ez43 282160 


F 


M00040183B:C06 


HMEC-bFGF 


5775 

U I I -> 


417259 


2178E04ez43 282186 


F 


M00040141AG10 


HMEC-bFGF j 


5776 


20517 


2178 P04 ez43 282197 

x. x / v «x \y i . ~3 x»«jx» m. ^ * 


F 


M00040235C:D02 


HMEC-bFGF 


5777 


414821 


2178 G05 ez43 282204 


F 


M00040160C:A04 


HMEC-bFGF 


5778 

*s l l \J 


402534 


2178 K05 ez43 282208 


F 


M00040183D:C08 


HMEC-bFGF 


5779 


402411 

TV/i<" X X 


2178 D07 2z43 282233 


F 


M00040 1 34 A: A07 


HMEC-bFGF 


5780 


419255 


2178 L07ez43 282241 

X. X # *-» .Uv / X« VJ r > f r T X 


F 


M00040188B:E05 


HMEC-bFGF 


5781 

~7 / O X 


417426 


2178 Gil ffz43 282300 

X> JL / VJ . VJ X X i I — ' X« *J X. ^ W 


F 


M00040161C:H06 


HMEC-bFGF 


5787 

•J / OX, 


415527 
*tijj£> / 


2178 G12 ez43 282316 

X* X / U a VJ XX. . tij 1 - ' X. X V/ 


F 


M00040161D:C03 


HMEC-bFGF 


578^ 


418^40 


2178 J12 ez43 282319 

X. X / U> .*/ X X. ,gx~^ £>U«<J X 


F 


M00040181BB06 


HMEC-bFGF 


5784 


40*? 1 54 


2178 L12 ez43 282321 

Xy X / <J «XW Ifa.K/j^J XyUX««/X.X 


F 


M00040189D A06 


HMEC-bFGF 


5785 


40^837 


2178 J13 ffz43 282335 


F 


M00040181BC05 


HMEC-bFGF ! 


5786 


418482 


2178 L14 ez43 282353 


F 


M00040190BC02 


HMEC-bFGF 


5787 


419751 


2178 C15 ez43 282360 


F 


M00039760BF12 


HMEC-bFGF 


5788 


409070 


2178B16ez43 282375 


F 


M00039752DD07 


HMEC-bFGF 


57RQ 


409070 


2178N16ffz43 282387 

x. x / o.i^i x v7 -r. x, o x. u / 


F 


M00040227A-E07 


HMEC-bFGF 


5790 


420504 


2178.C18.gz43_282408 


F 


M00039760C:H07 


HMEC-bFGF 


5791 


147279 


2178.P19.gz43_282437 


F 


M00040248A.G09 


HMEC-bFGF 


5792 


402353 


2178.H20.gz43_282445 


F 


M00040171BH03 


HMEC-bFGF 


5793 


416914 


2178.L20.gz43_282449 


F 


M00040196B:F10 


HMEC-bFGF 


5794 


163970 


2178.E21.gz43_282458 


F 


M00040145B:C12 


HMEC-bFGF 


5795 


416762 


2178.021.gz43 282468 


F 


M00040233C:F09 


HMEC-bFGF 


5796 


57183 


2178.M22.gz43 282482 


F 


M00040222A:E06 


HMEC-bFGF 
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Table 2 



SEQ 
ID 
NO 


CLUSTER 


SEQ NAME 


ORIE 
NT 


CLONE ID 


LIBRARY 


5797 


403306 

r \jj m j *j vv/ 


2184 DO 1 ez43 282670 


F 


M00039950CC05 


HMEC-VEGF i 

X XJ.VXX-fV-' V XjVJX 


5798 


402298 


2184 L01 ez43 282678 


F 


M00040328DA03 


HA4EC-VRGF 

XXXYI l-/V^ V XwVJX 


5799 


420958 


2184 K02 ez43 282693 


F 


M00040322BA08 


HMFC-VFGF 

■ iivii a y X— 'vJX 


5800 


423884 


2184A03ez43 282699 


F 

X 


M00039747AH06 


HMFC-VFGF 1 


5801 

JOU X 


268336 


71X4 T04 o^43 7X7774 


F 

X 


M000403 1 XCROO 


TnVTFP-VFGF 

XXLVXXwVw' V Xw VJJT 


5802 


422590 


?184O07o743 7X7777 


X 


M00040364BF1 1 


FIMFr-VFGF 

X XxVXXjV^ V X-/VJX 


5803 


402298 


2184A08oz43 282779 


F 


M00039749BG05 


HMEC-VFGF 

X XXYXXwV^ V X^VJX 


5R04 


473534 


71X4 Til o^43 7X7X36 


F 

X 


M000403 1 9DCt1 0 

JLVXv/V/V/*TV/«J X^X-/.VJ XV/ 


T4MFC-VFGF 

X XIV 1 1 V XJ/V_I X 


5805 


473008 


?184K13oz43 7X2X69 


F 

X 


M00040323DF04 

lVXV/V/V/*TV/«7^»-7X-'.X V/*T 


HMFP-VFGF 

X XIVXXjV^ V XJ^VJX 


5806 


474723 


21X4N13oz43 2X2872 


X 


M00040347BB1 1 

JLVXV/V7V/*TV/-7~T / XJ . X-> X X 


HMFC-VFGF 

X XXVXXww V XJ/VJX 


5807 


20517 


21X4D16tr743 2X2910 

Z-lO'T.L/JU.g^TJ Z.OZ/7 1U 


F 


M0003 995 8D D05 


HMFC-VFGF 

X XlVXXvV^ V X^VJX 


5808 


403306 

TV/.J-J V/V7 


21X4 A17 0743 7X2923 


F 

X 


M00039750C-F0X 

1V1vvV»/7 / w'V/V^.X v/O 


HMFC-VEGF 


5809 

JOV/7 


140224 

X *T V/ /» />"T 


21X4 A77 0743 7X3003 


F 

X 


M00039762CD1 1 

XVXV/V/V/«7^ / VJi«V^ .X-f X X 


HMFC-VFGF 

X XI Y.I l.j\*f V XJVJX 


5810 


407708 


71 X4 G74 0743 7X3041 


F 

X 


M000407 04 R • F00 

1VXV/ \J V/*T -7 *TX^ . X-»V7 J' 


HMFC-VFGF 
xxivxxwV>~ v X-/VJX i 


581 1 


468Q30 


1564 A01 0743 706552 


F 

X 


M00042709RG05 


T TC7-NnrmCr»lnn 

KJ — l>t V/lXIlv^V71V7XX 


mJ o xz# 


463 1 43 


1564R01 ov43 796553 


F 

X 


M0004271 1BTT04 

1VXV/V/V7*TZ. / X XXJ.XXWt 


T TC 9 -NInrm Cnl nn 

VJ V/i li 171 XllVsVJJLVJXX 


5813 

JO X-J 


46578S 


1564 CO 1 o743 7Q6554 

X JUt, v^V/ X .ei^t J ^.^VJ-JJ^ 


F 

X 


M0004771 3RP0X 

1YXV'\JV/*tZ» / 1 JD.V/UO 


T TC7JMnrmCo1on 

VJ Vw^ LN V7iXliV_/V7lV7Xt 


5814 
J o x*t 


453766 

TJJ / \J\J 


1564 T0 1 0743 7Q6560 


F 

X 


M00047774 A -Ct07 

lYXV/V/V/'T^L. / -tr"T^V. VJ V/^l» 


T TC7 .Norm Pol nn 

VJ V^Z» 1> V71 III V> V71UX1 


5815 


467398 


1564 T01 0743 706561 


F 

X 


M00042776D-fir1 1 


T TC7-NnrmCn1on 

KJ V>X> 1^1 V/lXXiV^V/XV/XX 


5816 


300563 


1564 O01 0743 706566 


F 

X 


M00047 73 7tt-f 1 07 

lVXV/V/V/*tA» / -7 / XJ.VxV / 


T TC9 -NnrmCnl on 

V7 V^xit 1^1 V71 Xliv^ V/1V/XX 


5817 

-/OX/ 


447189 

1 "1 / X 


1564 C03 oz43 296586 

X v/*T. V-^W«7 .fc^^T-? /,7UJUv 


F 

X 


M000427 1 3BF03 

IVXV/V/V/^Ta. / 1JJL/.1 V7J 


T TC9 -NormCnl nn 

UV/A X^IV/llI.IwV7XV7XX 


5818 


462348 


1564H03ffz43 296591 


F 

X 


M00042722A-G0X 

lVXV/V/V/*"rAr / iZ,A.\JuO 


T TC7-NnrmCnlnn 

VJ X^i 1711I1V_^V7XV/XX 


5819 


446740 


1564O04oz43 296614 

X.J UT.V/vt.gti"-' X»^VJV/X"T 


F 


M00042737CC07 


T TC? -Norm Colon 

VJ V^^» X i V71 XlXVsVXLVMX 


5820 


447268 


1564 C05 oz43 296618 

X^ V7T. V^V/i-7 .fa/iT-J ^>7WJLU 


F 

X 


M000427 1 3 C BOX 

1VXV/V/V/"TX» / 1 JV/.JJUO 


T TC7 -NormCol on 

UV>i> l^lV711Xiv^V7-iV7XX 


5821 


461316 


1564 K05 ez43 296626 


F 

X 


M00042729 A*F1 1 

1tXV/V/V/T\^» / Zi7rV,J. X X 


T TC2-Norm Colon 

UVyX, X^IV7X1X1V^V7XV7XX 


5X27 


4706X6 


1 564 N05 0743 706629 


F 

X 


M00047735rO07 

IVXV/V/ V/TA / JJV/.VJV//- 


T TC7 -Norm Col on 

VJ V^^> 1 1 \Ji. 1 11 V71V/XX 


5873 


447579 


1564 O05 oz43 296630 


F 

X 


M00042737CE03 


T TC7-NormCo1on 

VJ V^^-» 1 1 V71 lllV/Ui UXX 


5874 


447006 


1 564 F*06 0743 206633 

X *J WT .XJV/v/ .£^£jT+s Xr7V/V/«/«/ 


F 

X 


M0004771 1C-H10 

1VXV7 V/V/*T^, /XX Vy .XXXV/ 


T TC 7 -Norm Col on 

VJ V^X* JL~ V/i 1 1 1 V_^V71 V7XX 


5875 


463X74 


1 564 N06 0743 706645 

X^VJ*T.1^I V/V/.g-fciT-J ^,3'V/V/*T«J 


F 

X 


M00047735D-A07 

iVXV/V/V/*T^r / JJJL/.AV / 


T TC? -Norm Colon 

VJ X V/l X11\-^V71V/XX 


5876 


461 135 

*TV/ 1 iJJ 


1564 O07 0743 706654 

IJUt.VJU / ,giTJ ^»7VJVJ«J*T 


F 


lVf 0004777 OP F03 

1VXV7V/U*T^ / Z«V/V_y.XjV/-7 


T TC? -Norm Colon 

VJ V>/« 1> V/l 111 v> V/1UX1 


5877 


388688 


1 564 A08 oz43 296664 

X ^vI*T.^\,\/0.^*jT^ ^l*^ V_/V7V7*T 


F 

X 


M0004771 0A*O 1 0 

XVXV/V/V/'tX / XV/XTl.VJ XV/ 


T TC? -Norm Colon 

UV^< 1^IV/1111\_AJ1V/XX 


5878 


467 QX6 


1 564 K0X 0743 706674 


F 

X 


M00047770AH0X 

iVXV/V/V/*TX / ^sJt\. . X XV/ o 


T TC ? -Norm Col on 

VJ V^^> 1^1 V/l 111 V^V71V/XX 


5870 

JOZ7 


454875 


1564 M0X 0743 706676 


F 


M00047733rr , 05 


T TC ? -Norm Col on 

VJ V^Z, lNV/llllV^/VJlV/XX 


5830 

JojV 


466614 


1 564 O0X 0743 7 0667 X 


F 

Mr 


M00047737Tj-T>03 


T TC? -Norm Col on 

VJ ll V/i lll\_> V/lV/Xl 


5831 


457075 


1564 A 00 0743 7066X0 


F 


M0004771 013 BOO 

iVXV/V/V/*T^ / X KJD .XJ\J2s 


T TC? -Norm Colon 

VJ V/Zr 1" V/l lllV^vJlV/Xl 


5832 


446345 


1 564.109 .gz43_296688 


F 


M00042724D:B04 


UC2-NonnColon 


5833 


456530 


1564.Cll.gz43_296714 


i F 


M00042714A:G04 


UC2-NormColon 


5834 


462337 


1564.Ell.gz43_296716 


F 


M00042716D:G04 


UC2-NormColon 


5835 


447387 


1564.N1 l.gz43_296725 


F 


M00042736A:F03 


UC2-NormColon 


5836 


467780 


1564.B12.gz43_296729 


F 


M00042712A:E08 


UC2-NonnColoxi 


5837 


447561 


1564.C12.gz43_296730 


F 


M00042714A:H05 


UC2-NormColon 


5838 


461313 


1564.K12.gz43_296738 


F 


M00042729B.F10 


UC2-NormColon 
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Tabid 



SEQ 
ID 
NO 


2LUSTEE 


SEQ NAME 


ORIE 
NT 


CLONE ID 


LIBRARY 


5839 


453679 


1564.L12.gz43_296739 


F 


M00042732A:A11 


UC2-NormColon 


5840 


446964 


1564.G13.gz43_296750 


F 


M00042721A:G07 


UC2-NonnColon 


5841 


459158 


1 564.H1 3 .gz43_29675 1 


F 


M00042722D:C12 


UC2-NormColon i 


5842 


450723 


1564.M13.gz43_296756 


F 


M00042733D:G08 


UC2-NormColon 


5843 


447826 


1564.C15.gz43_296778 


F 


M00042714B:E01 


UC2-NormColon 


5844 


446741 


1564.D15.gz43_296779 


F 


M00042715D:E02 


UC2-NormColon 


5845 


452687 


1 564 .F 1 5 .gz43_29678 1 


F 


M00042719A:H06 


UC2-NoimColon 


5846 


446601 


1 564.H15 .gz43J296783 


F 


M00042723A:D09 


UC2-NormColon 


5847 


447649 


1564.B19.gz43_296841 


F 


M00042712C:E06 


UC2-NormColon 


5848 


465528 


1564.C19.gz43_296842 


F 


M00042714C:C10 


UC2-NormColon 


5849 


462865 


1 564.E 1 9.gz43_296844 


F 


M00042717B:G11 


UC2-NormColon 


5850 


405932 


1564.F19.gz43_296845 


F 


M00042719C:H05 


UC2-NormColon 


5851 


457922 


1564.G19.gz43_296846 


F 


M00042721D:B03 


UC2-NormColon 


5852 


463821 


1564 .N 1 9.gz43_296853 


F 


M00042737A:A07 


UC2-NormColon 


5853 


463217 


1564.019.gz43_296854 


F 


M00042738D:H12 


UC2-NormColon 


5854 


389425 


1564.A20.gz43_296856 


F 


M00042711A:F01 


UC2-NoimColon 


5855 


446242 


1564.O20.gz43_296870 


F 


M00042739A:C06 


UC2-NormColon 


5856 


449171 


1564.C21.gz43_296874 


F 


M00042714C:F12 


UC2-NormColon 


5857 


450723 


1564.G21.gz43_296878 


F 


M00042721D:D01 


UC2-NormColon 


5858 


469766 


1 564.M2 1 .gz43_296884 


F 


M00042735A-.G12 


UC2-NonnColon 


5859 


446866 


1564.D22.gz43_296891 


F 


M00042716A.F09 


UC2-NormColon | 


5860 


403949 


1564.M23.gz43_296916 


F 


M00042735B:A06 


UC2-NormColon 


5861 


446922 


1564.023.gz43_296918 


F 


M00042739A:G07 


UC2-NormColon 


5862 


235874 


1564.C24.gz43_296922 


F 


M00042714D:E11 


UC2-NormColon 


5863 


558301 


168 1 .AO 1 .gz43_296936 


F 


M00055739A:C09 


UC2-NonnColon 


5864 


426366 


1 68 1 .10 1 .gz43_296944 


F 


M00055749D:B07 


UC2-NonnColon 


5865 


638943 


168 1 JO 1 .gz43_296945 


F 


M00055750C:H10 


UC2-NonnColon I 


5866 


645603 


168 1 .00 1 .gz43_296950 


F 


M00055758C:H10 


UC2-NormColon 


5867 


642644 


1681.E02.gz43_296956 


F 


M00055744C:D02 


UC2-NormColon 


5868 


644105 


168 1 J02.gz43_296960 


F 


M00055749D:C01 


UC2-NormColon 


5869 


468613 


1681 . A03 .gz43_296968 


F 


M00055739B:B06 


UC2-NormColon 


5870 


648905 


168 1 .B03 .gz43_296969 


F 


M00055740CA07 


UC2-NonnColon 


5871 


446925 


1681 .H03 .gz43_296975 


F 


M00055748C:C07 


UC2-NormColon 


5872 


649356 


1681 .103 .gz43_296976 


F 


M00055749D:D06 


UC2-NormColon 


5873 


510724 


168 1 .K03.gz43_296978 


F 


M00055752C:C06 


UC2-NonnColon 


5874 


446676 


1681.L03.gz43_296979 


F 


M00055753D:C06 


UC2-NormColon 


5875 


644937 


1681.B04.gz43_296985 


F 


M00055740C:E06 


UC2-NormColon 


5876 


644342 


1681.104.gz43_296992 


F 


M00055749D:F12 


UC2-NormColon 


5877 


640264 


168 1 .004.gz43_296998 


F 


M00055758D:E02 


UC2-NormColon 


5878 


463951 


1681.D05.gz43_297003 


F 


M00055743C:D12 


UC2-NormColon 


5879 


644345 


1 68 1 .K05 .gz43_2970 10 


F 


M00055752C.F06 


UC2-NormColon 


5880 


646552 


1681.N06.gz43_297029 


F 


M00055757B:B07 


UC2-NormColon 
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Table 2 



SEQ 
ID 
NO 


pLUSTEli 


SEQ NAME 


ORIE 
NT 


CLONE ID 


LIBRARY 


5881 


611927 


168 1 ,K07.gz43_297042 


F 


M00055752CH07 


T JC2-NormCo1on 


5882 


639420 


1681 .N07.gz43 J297045 


F 


M00055757BC04 


I JC2-NorniColon 


5883 


568467 


1681.A08.gz43 297048 


F 


M00055739B-H08 


I JC2-NormCol on 


5884 


645147 


1681.C08.gz43 297050 


F 


M00055742BH06 

AVX.Y/V/ W«/«/ /TLJJ.llvU 


I TCl-NormCoInn 


5885 


447936 


1681.E08.gz43 297052 


F 


M00055744DA1 1 

i»lvv V/ w' # 11 X^ »X X X X 


IIC2-NormCo1on 


5886 


492917 


168 1 .008.gz43_297062 


F 


M00055759AB02 


UC2-NormColon 


5887 


649427 


1681.D09.gz43 297067 


F 


M00055743DD04 

lTXV/V/Vv'*/ / W mm* Mm** m m\mf 


UC2-NormColon 


5888 


556654 


1681.F09.gz43 297069 


F 


M00055745DF11 

ATX WW m*r+*r J T —J JK~J .X X X 


UC2-NormColon 


5889 


594040 


168LH09.gz43_297071 


F 


M00055748DC03 


UC2-NormColon 


5890 


488432 


1681 . J09 .gz43_297073 


F 


M00055751A:F06 


UC2-NormColon 


5891 


641144 


168 1 .K09.gz43_297074 


F 


M00055752D:C01 


UC2-NormColon 


5892 


550223 


168 1 .L09.gz43_297075 


F 


M00055754A:E07 


UC2-NormColOxi 


5893 


491544 


1681.M09.gz43 297076 


F 


M00055755DC09 


UC2-NormColon 


5894 


648481 


1681A10ez43 297080 


F 


M00055739CD11 


UC2-NormColon 

A ^ V/A AAA\—»V/AWAA 


5895 


150839 


1681.B10.gz43 297081 


F 


M00055740DG12 


UC2-NormColnn 

A ^ V/A AAAV^VAVJAA 


5896 


644053 


1681 F 10 az43 297085 


F 


M0005 5746A- C09 


V-/ V^^« A ^1 V/A AAAV^v/AwAx 


5897 


644342 


1681K10ez43 297090 


F 


M0005 5752DF0 1 


T TP 9 -NnrmPnlon 

V/ V^^r A ~ V/A AUV/UlUli 


5898 


419479 


1681.Cll.gz43 297098 


F 


M00055742C-C01 


UC2-NormColon 

V A l V/A J A A V-xV/AV/AA 


5899 


645000 


1681 Ell ez43 297100 


F 


M0005 5 744D F03 


I JC2-NonnCnloii 

»J V/A. A> V/l AAAV-^V^lvJAA 


5900 


478833 


1681 Lll ez43 297107 


F 


M00055754AH06 


VJ V_xZ» A ^ V/A AAA V_/ V/AV/AA 


5901 


644684 


1681 A13 ez43 297128 


F 


M00055739DB12 


T TC? -NTntTiiPnl nn 

V/ V/X 1> V/A AA1V_/V/Av/AA 


5902 


640504 


1681.B13.gz43 297129 


F 


M00055741AD09 


UC2-NormColfin 

V** V-x w A 1 V/A A A A V/ V/A V/AA 


5903 


463815 


1681 A14 ez43 297144 


F 


M00055739DC03 


11(^2^01111001011 

W V/X A N V/A A A A V>U A\J A A 


5904 


640997 


1681 B15 ez43 297161 


F 


M00055741BB12 


UC2-NormColon 

KJ V/A A ^ V/AA AAVx V/A V/AA 


5905 


642263 

V~ mm* *m* \S 


1681 D15 sz43 297163 


F 


M00055744A-B04 


T JP^-NnrmPnlon 

V/V/Z* A >V/A AAAV_/V/Av/AA 


5906 


451508 


1681.P15.gz43 297175 


F 


M00055761AE06 


UC2-NormPolon 

\«/^r A ^1 V/A AAA Va^V/A V/AA 


5907 


644919 


1681 A16 ez43 297176 

X W V* X .X X X V/ »tj f *W *J X« / X / V/ 


F 


M00055739DE04 


UP2-NoTmPolon 

w V^X^ A^l V/A AAA V_^ V/A V/AA 


5908 


639703 


1681 K16 ez43 297186 


F 


M00055753AD04 


UP2-NormColon 

w V^x» A ^1 V/A A A lv^ V/A V/AA 


5909 


639750 


1681M16ez43 297188 

X V# V X tlTX X a CT*m\m\ Y mm* *T § X *J V* 


F 


M00055756CC03 


UP2-NormPolon 

V-/ V/X A ~ V/AA A A V^ V/A V/A A 


5910 


587696 


1681.F17.gz43 297197 


F 


M00055746CF06 


UC2-NormColon 

V-> V/i- A ^ w A lliV/UlVJU 


5911 


461917 


1681.H17.gz43 297199 


F 


M00055749BC10 


UC2-NormColon 

V V/X XiV/llllV/VlUJUl 


5912 


644510 


1681.I17.gz43 297200 


F 


M00055750BH01 


UC2-NoiAAiColon 

V-/ ^_/X^ X ^ V/A X UV/ v/xv/xx 


5913 


526459 


1681.B18.gz43 297209 


F 


M00055741CA09 


UC2-NormColon 

V-/ VX A ™ V/A AA A V-^ V/A V/AA 


5914 


471277 


1 68 1 .F 1 8 .gz43_2972 1 3 


F 


M00055746CF10 


UC2-NormColoii 

W/ V^X> X ^ wX ltlV/VlVJU 


5915 


483042 


1681 K18 ez43 297218 

X X • X B» X l/^CMT*/ 4mW *r i mmmr X 


F 


M00055753BA02 


UC2-NormColoii 

W V_>X< A ^ V/A AAA Vv V/A V/AA 


5916 


514838 


1681.L18.gz43__297219 


F 


M00055755A-.B11 


UC2-NormColon 


5917 


643984 


1681.N18.gz43_297221 


F 


M00055757D:B05 


UC2-NormColon 


5918 


462245 


1681.A19.gz43_297224 


F 


M00055740A:B03 


UC2-NormColon 


5919 


640017 


1681.D19.gz43_297227 


F 


M00055744B:B02 


UC2-NormColon 


5920 


453726 


1681.E19.gz43_297228 


F 


M00055745A:H02 


UC2-NormColon 


5921 


644468 


1681.F19.gz43__297229 


F 


M00055746C-.G06 


UC2-NormColon 


5922 


644105 


1681.H19.gz43_297231 


F 


M00055749C:B03 


UC2-NormColon 
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Table 2 



SEQ 
ID 
NO 


XUSTER 


SEQ NAME 


ORIE 
NT 


CLONE ID 


LIBRARY 


5923 


447485 


1681.K19.gz43_297234 


F 


M00055753B:A06 


UC2-NormColon 


5924 


218416 


1681.D20.gz43 297243 


F 


M00055744B:C08 


UC2^ormColon 1 


5925 


489040 


1681.H20.gz43 297247 


F 


M00055749C:C04 


UC2-NoraiColon 


5926 


649702 


1 68 1 . J20.gz43_297249 


F 


M00055752A:E10 


UC2-NormColon 


5927 

•J Z? 1 


643142 


1681.A21.gz43 297256 


F 


M00055740B:C06 


UC2-NormColon 


5928 


448251 


1681 G21 ez43 297262 


F 


M00055748A:D07 


UC2-NormColoii 


5929 


644461 


1681.N21.gz43 297269 


F 


M00055758A:G02 


UC2-NormColon 


5930 


641968 


1681.P22.gz43 297287 


F 


M00055761B:F09 


UC2-NonriColon 


5931 

*J S ~s X 


470667 


1681 .E23 .gz43_297292 


F 


M00055745B:H02 


UC2-NormColon 


5932 

•J Zs -J 


644479 


1681.J23.gz43 297297 


F 


M00055752A:G10 


UC2-NormColon 


5933 


463824 


1681.A24.gz43 297304 


F 


M00055740B:F09 


UC2-NormColon 


5934 


389377 


1681.C24.gz43 297306 


F 


M00055743B:E01 


UC2-NormColon 


5935 


468959 


1681.E24.gz43 297308 


F 


M00055745C:A06 


UC2-NormColon 


5936 


553675 


1681 .M24.gz43_2973 1 6 


F 


M00055757A:A07 


UC2-NormColon 


5937 


449978 


1561 .CO 1 .gz43_3 14395 


F 


M00042344D:F03 


UC2-NonnColon 


5938 

yj Z7 J t» 


463920 


1561 .GO 1 .gz43_3 14399 


F 


M00042526D:A05 


UC2-NonnColon \ 


5939 


456267 


1 5 6 1 JO 1 .gz43_3 14402 


F 


M00042531B:G12 


UC2-NormColon 


5940 


459997 


1561 M01 ez43 314405 


F 


M00042537A:D12 


UC2-NormColon 


5941 


447386 


1561.I02.gz43 314417 


F 


M00042529D:D07 


UC2-NormColon 


5942 


468257 


1561.K02.gz43 314419 


F 


M00042533B:F11 


UC2-NormColon 


5943 


18786 


1561 E03 sz43 314429 


F 


M00042523C:H06 


UC2-NormColon 


5944 

*J Z? T I 


447494 


1561A04ez43 314441 


F 


M00042341A:D08 


UC2-NormColon 


5945 


459893 


1561 B04 ez43 314442 


F 


M00042343B:D05 


UC2-NormColon 


5946 


462558 


1561 M04 ez43 314453 


F 


M00042537A:G09 


UC2-NormColon 


5Q47 


458425 


1561 O04 ez43 314455 


F 


M00042541A:B07 


UC2-NormColon 


5Q48 


249178 


1561 C05 ez43 314459 


F 


M00042345B:A05 


UC2-NonnColon 


5949 


462815 


1561 J05 ez43 314466 


F 


M00042531D:G08 


UC2-NormColon 


5950 


417291 


1561K05ez43 314467 


F 


M00042533C:F04 


UC2-NormColon 


5951 


463341 


1561 M05 ez43 314469 


F 


M00042537A:H04 


UC2-NormColon 


5952 


447868 


1561 A06 ez43 314473 


F 


M00042341A.H04 


UC2-NonnColon 


5953 


461653 


1561 B06 ez43 314474 


F 


M00042343B:F12 


UC2-NormColon 


5954 


465127 


1561 K06 ez43 314483 


F 


M00042533D:B05 


UC2-NormColon 


5955 


460246 


1561 N06 ez43 314486 


F 


M00042539B:D09 


UC2-NonnCoIon 


5956 


33139 


1561 006 az43 314487 


F 


M00042541A:E06 


UC2-NormColon 


5957 


457508 


1561N07ez43 314502 


F 


M00042539C:A04 


UC2-NormColon 


5958 


461917 


1561. B09.gz43_3 14522 


F 


M00042343D:F08 


UC2-NormColon 


5959 


447568 


1561.H09.gz43_314528 


F 


M00042528D:D09 


UC2-NormColon 


5960 


467094 


1561. J09.gz43_3 14530 


F 


M00042532A:D08 


UC2-NormColon 


5961 


447087 


156 l.N09.gz43_3 14534 


F 


M00042539C:H08 


UC2-NoimColon 


5962 


458419 


1561.O09.gz43_314535 


F 


M00042541B:B05 


UC2-NormColon 


5963 


446952 


1561.D10.gz43__314540 


F 


M00042522C:H10 


UC2-NonnColon 


5964 


470513 


1561.H10.gz43 314544 


F 


M00042528D:H03 


UC2-NonnColon 
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Table 2 



SEQ 
ID 
NO 


CLUSTEB 


SEQ NAME | 


ORIE 
NT 


CLONE ID 


LIBRARY 


5965 


459974 


1561.M10.gz43_314549 


F 


M00042537C:D04 


UC2-NormColon 


5966 


1454 


1561.N10.gz43 314550 


F 


M00042539DA04 


UC2-NormC'olon 


5967 


464510 


1561 .Dl Lgz43_3 14556 


F 


M00042522D:A08 


UC2-NormColon 


5968 


457846 


156LGll.gz43_3 14559 


F 


M00042527B:B01 


UC2-NormColon 


5969 


15296 ; 


1561.11 l.gz43 314561 


F 


M00042530OH02 


UC2-NormColon 

»— ' V_^^ A ^VyAAll«^sVSAV/lX 


5970 


446578 


1561.C12.gz43 314571 


F 


M00042345D:D04 


UC2-NormColon 


5971 


463086 


1561.F12.gz43 314574 


F 


M00042525C:H07 


UC2-NormColon 


5972 


464287 


1561.I12.gz43 314577 


F 


M00042530D:A02 


UC2-NoimColoiA 


5973 


447805 


1 56 l.A14.gz43_3 14601 


F 


M00042341D:G11 


UC2-NormColon 


5974 


446188 


156 1 .G14.gz43_3 14607 


F 


M00042527C:A10 


UC2-NormColon 


5975 


468565 


1561.I14.gz43 314609 


F 


M00042530DF09 


UC2-NormColon S 


5976 


470600 


1561.J14.gz43 314610 


F 


M00042532A:H03 


UC2-NormColon 

V_>^-r A 1 Villi WVlVil 


5977 


457772 


1561.A15.gz43 314617 


F 


M00042342A*B04 


UC2-NormColon 

' A ^ V/A AIA^^V^AVaIA 


5978 


446383 


1561.E15.gz43 314621 


F 


M00042524CC12 


UC2-NormColon 

x 1 wixix v< vivix 


5979 


446673 


I561.F15.gz43 314622 


F 


M00042525D*E01 


UC2-NormColon 

x^^j a ^ uimvuivxi 


5980 


462779 


1561.L15.gz43 314628 


F 


M00042536BG08 


UC2-NormColon 

V-' V^^# A 1 V/llllVyvlvll 


5981 


446289 


1561.C16.gz43 314635 


F 


M00042521A:E10 


UC2-NonriColon 

V-^^* 1 i \ J1 III. Vy vl VIA 1 


5982 


458146 


1561.K16.gz43 314643 


F 


M00042534B:B08 


UC2-NormColon 

V-/ V>^rf A ^ V/X 111 V/V/Xv/lX 


5983 


447690 


1561.L16.gz43 314644 


F 


M00042536C:A06 


UC2-NormColon 

V> A 1 V* 11X V/V1VU 


5984 


452888 


1561.N16.gz43 314646 


F 


M00042540B-B03 


UC2-NormColon 

V*' V— A ™ vtXIXWIVll 


5985 


461327 


1561.B17.gz43 314650' 


F 


M00042344B :F02 


UC2-NormColon 

A ^ V/X 11 A Wl V/Xl 


5986 


456958 


1561.E17.gz43 314653 


F 


M00042524D:A10 


UC2-NormColon 


5987 


467901 


1561.F17.gz43 314654 


F 


M00042526AE10 


UC2-NormColon 

V*' A 1 YA1 lllv/VXV/XX 


5988 


459981 


1561.K17.gz43 314659 


F 


M00042534B-D06 


UC2-NonnColoAi 


5989 


457218 


1561N17ez43 314662 


F 


M00042540C*A12 


UC2-NormColon 


5990 


463008 


1561A18sz43 314665 


F 


M00042342CH03 


UC2-NormColon 

V-* V^A» A ^1 V/l lllVvvlVll 


5991 


446341 


1561 LI 8 ez43 314676 


F 


M00042536CB03 


UC2-NormColon 


5992 


446389 


1561.018.gz43 314679 


F 


M00042542Ad 1 


S UC2-NormColon 

V-/ V_> X«r A 1 V/A llXV/vlvU 


5993 


447096 


1561 P18 ez43 314680 

A»^V7A..M. X Kf »f_j' ■ \J VJ V 


F 


M00042543B-H12 


UC2-NormColon 


5994 


446984 


1561 B19 ez43 314682 


F 


M00042344B HI 2 


UC2-NormColon 


5995 


468767 


1561 F19 ez43 314686 

a.-'va.a x *y. — / X isj U \-» 


F 


M00042526A-F 1 1 


1 JC^-NnrtnCnl on 


5996 


467884 


1561 H19 ffz43 314688 


F 


M00042529B-E03 


T JC2-NormCo1 on 


5997 


459881 


1561 A20 ffz43 314697 


F 


M00042342D-D03 


UC2-NormColon 


5998 


447027 


1561 D20 ez43 314700 


F 

X 


M00042523B-H01 

iViUUUTl,JI,JJJ .A AV/ A 


T IC^-NormColon 


5999 


447380 


1561 F20 p?43 314707 


F 

X 


M000425 26B • C 1 2 

1VXV/ W \JT£. U Z*\JXJ . X Z» 


T TC ? -IMorm Col on 

a ^ L/l liiv^ V-Jlvll 


6000 


446304 


1561.021.gz43_314727 


F 


M00042542B:B01 


UC2-NormColon 


6001 


447072 


156 LC22.gz43_3 14731 


F 


M00042521D:G09 


UC2-NormColon 


6002 


379335 


1561.122.gz43_314737 


F 


M00042531B:D12 


UC2-NonnColon 


6003 


228873 


1561. L22.gz43_3 14740 


F 


M00042537A:A07 


UC2-NormColon 


6004 


446663 


1561.D23.gz43_314748 


F 


M00042523C:E08 


UC2-NormColon 


6005 


464791 


1561.F23.gz43_314750 


F 


M00042526C:B12 


UC2-NormColon 


6006 


446595 


1561.K23.gz43_314755 


F 


M00042534D:D10 


UC2-NoimColoii 
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Table 2 



SEQ 
ID 
NO 


CLUSTEF 


SEQ NAME 


ORIE 
NT 


CLONE ID 


LIBRARY 


6007 


446439 


1561. 023.gz43_3 14759 


F 


M00042542B:C11 


UC2-NormColon 


6008 


458979 


1561. A24.gz43_3 14761 


F 


M00042343A.C05 


1 UC2-NormColon 


6009 


446620 


1561.C24.gz43_314763 


F 


M00042522A:A05 


UC2-NormColon 


6010 


452507 


1561. F24.gz43_3 14766 


F 


M00042526D:A02 


UC2-NormColon 
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Table 3A Nearest Neighbor (BlastN vs. Genbank) 


SEQID 
NO 


ACCESSN 


DESCRBP 


P VALUE 


1 


AJ251957 


Ciona intestinalis mRNA for nuclear lamin (lamin L2 gene) 


2.2 


2 


AF133812 


Strongylocentrotus pallidus isolate pR31 bindin gene, partial cds 


0.88 | 


3 


AF051651 


Homo sapiens squalene synthase gene, intron 3, 5' end 


0.0001 


4 


NM 016701 


Mus musculus nestin (Nes), mRNA 


0.24 


5 


Y09000 


Rnorvegicus mRNA for dendrin 


0.11 


6 


U67519 


Methanococcus jannaschii section 61 of 150 of the complete genome 


6.1 


7 


AK005889 


Mus musculus adult male testis cDNA, RIKEN full-length enriched 
library, clone: 17000 12B08, full insert sequence 


e-139 


8 


AB029343 


Homo sapiens HCR (a-helix coiled-coil rod homologue) gene, 

complete cds ; 


0.023 


9 


AB017505 


Homo sapiens mRNA for Nori-2p, complete cds 


0 


10 


AY024246 


Oryza sativa microsatellite MRG6571 containing (TCTA)X6, 
genomic sequence 


6.3 


11 


U32309 


Acipenser fulvescens Afl4 cytochrome b gene, partial cds, tRNA-Thr, 
tRNA-Pro and partial D-loop, mitochondrial genes encoding 
mitochondrial products 


6.3 


12 


XM 046070 


Homo sapiens KIAA0853 protein (KIAA0853), mRNA 


3E-48 


13 


AK001945 


Homo sapiens cDNA FLJ1 1083 fis, clone PLACE1005232 


2.2 


14 


AF1 11457 


Meleagris gallopavo clone TUCA1141 microsatellite sequence 


0.076 


15 


AK018499 


Mus musculus adult male colon cDNA, RIKEN full-length enriched 
library, clone:9030409O13, full insert sequence 


0 


16 


AE002648 


Drosophila melanogaster genomic scaffold 142000013385436, 
complete sequence 


2.2 


17 


AC001011 


Homo sapiens (subclone l_e5 from PI H43) DNA sequence, complete 

sequence 


0.003 


18 


XMJ)39942 


Homo sapiens Meis (mouse) homolog 3 (MEIS3), mRNA 


0.75 


19 


AF099003 


Caenorhabditis elegans cosmid Y59C2A 


0.75 


20 


U92972 


Mus musculus protease-activated receptor 3 (PAR3) mRNA, complete 

cds 


0.08 


21 


AE003935 


Xylella fastidiosa 9a5c, section 81 of 229 of the complete genome 


0.067 


22 


AB031009 


Homo sapiens DNA, MHC class I CL region, 7. 1 ancestral haplorype 


0.019 


23 


X94207 


H.sapiens TPR gene (1365bp) 


4.8 


24 


AK025678 


Homo sapiens cDNA: FLJ22025 fis, clone HEP08518 


' 2.2 


25 


AKO 14552 


Mus musculus 0 day neonate skin cDNA, RIKEN full-length enriched 
Hbrary, clone:4632404O06, full insert sequence 


e-176 


26 


XMJ)44013 


Homo sapiens kinesin-like protein 2 (hklp2), mRNA 


0 


27 


NM_032993 


Homo sapiens nucleolar protein family A, member 1 (H/ACA small 
nucleolar RNPs) (NOLA1), transcript variant 2, mRNA 


4E-50 


28 


NM_032315 


Homo sapiens hypothetical protein MGC4399 (MGC4399), mRNA 


0 


29 


AK020701 


Mus musculus 6 days neonate skin cDNA, RIKEN full-length 
enriched library, clone: A030009B12, full insert sequence 


0.0000002 


30 


! AK021717 


Homo sapiens cDNA FLJ11655 fis, clone HEMBA1004554 


0 
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Table 3A Nearest Neighbor (BlastN vs. Geribank) 


SEQID 
NO 


ACCESSN 


DESCRIP 


P VALUE 


31 


AL5 12453 


Human DNA sequence from clone CTC-1516K22 on chromosome 6, 
complete sequence [Homo sapiens] 


0.00004 


32 


XM 005034 


Homo sapiens hypothetical protein FLJ10511 (FLJ10511), mRNA 


2E-22 


33 


AF3 18896 


Cricetulus griseus beta-l,4-galactosyltransferase 1 (beta4GalT-l) 
mRNA, complete cds 


0.78 


34 


D29757 


Human gene for cytochrome P-450 aromatase, exon 1 (brain specific) 


3E-24 


35 


AJ3 11520 


Ostreopsis ovata 5.8S rRNA gene, internal transcribed spacer 1 (ITS1) 
and internal transcribed spacer 2 (ITS2), strain Al > 


0.79 


36 


XM 038521 


Homo sapiens phosphatidylserine decarboxylase (PISD), mRNA 


0 


37 


AK023043 


Homo sapiens cDNA FLJ12981 fis, clone NT2RP2006454 


3E-96 


38 


XM_045520 


Homo sapiens KIAA1595 protein (KIAA1595), mRNA 


4E-16 


39 


AB017505 


Homo sapiens mRNA for Nori-2p, complete cds 


0 


40 


AK002682 


Mus musculus adult male kidney cDNA, RDCEN full-length enriched 
library, clone:0610027B03, full insert sequence 


e-138 


41 


NC_001712 


Locusta migratoria mitochondrion, complete genome 


0.009 


42 


AL512453 


Human DNA sequence from clone CTC-1516K22 on chromosome 6, 
complete sequence [Homo sapiens] 


0.00001 


43 


AK002682 


Mus musculus adult male kidney cDNA, RIKEN full-length enriched 
library, clone:0610027B03, full insert sequence 


e-125 


44 


XM_039088 


Homo sapiens KIAA0914 gene product (KIAA0914), mRNA 


0.76 


45 


U30291 


Schistosoma mansoni synaptobrevin-like protein gene, exon 2 and 

partial cds 


0.69 


46 


XM 052543 


Homo sapiens mitochondrial ribosomal protein L42 (MRPL42), 

mRNA 


1E-26 


47 


AK026225 


Homo sapiens cDNA: FLJ22572 fis, clone HSI023 13 


0.0008 


48 


XM_043007 


Homo sapiens hypothetical gene supported by AK021969 
(LOC92132), mRNA 


1E-99 


49 


M18824 


P.faciparum S antigen gene, complete cds 


0.003 


50 


XM_009311 


Homo sapiens zinc finger protein 304 (ZNF304), mRNA 


0.003 


51 


AK026697 


Homo sapiens cDNA: FLJ23044 fis, clone LNG02454 


0 


52 


XM_035597 


Homo sapiens hypothetical protein AF140225 (AF140225), mRNA 


1E-98 


53 


U81366 


Plasmid Rtsl killer protein and antidote protein genes, complete cds 


0.26 | 


54 


U28921 


Phaseolus vulgaris ATPase gamma subunit mRNA, nuclear gene 
encoding mitochondrial protein, partial cds 


0.27 


55 


AF305083 


Homo sapiens alpha(l,3)-fucosyltransferase IV (FUTIV) gene, 3* UTR 


0.031 


56 


D29757 


Human gene for cytochrome P-450 aromatase, exon 1 (brain specific) 


7E-20 


57 


XM 038048 


Homo sapiens similar to APICAL ENDOSOMAL GLYCOPROTEIN 
PRECURSOR (R. norvegicus) (LOC91372), mRNA 


0.0004 


58 


AK026860 


Homo sapiens cDNA: FLJ23207 fis, clone ADSE00968 


8E-12 
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Table 3A Nearest Neighbor (BlastN vs. Geribank) 


SEQID 
NO 


ACCESSN 


DESCRIP 


P VALUE 


59 


BC010849 


Homo sapiens, Similar to hypothetical protein FLJ20561, clone 
MGC:8851 IMAGE:3878776, mRNA, complete cds 


0 


60 


XM_036092 


Homo sapiens signal recognition particle 68kD (SRP68), raRNA 


0 


61 


XM 003184 


Homo sapiens zinc finger protein ANCJ2H01 (LOC51193), mRNA 


0 


63 ; 


Z49806 


B.taums mRNA for adenylyl cyclase type VII 


6.8 


64 


XMJH8858 


Homo sapiens hypothetical protein FLJ23384 (FLJ23384), mRNA 


0 


65 


NM 032315 


Homo sapiens hypothetical protein MGC4399 (MGC4399), mRNA 


0 


66 


AB057594 


Homo sapiens GSTT1 gene for glutathione S-transferase TT1, 
complete cds 


3E-69 


67 


XM 035597 


Homo sapiens hypothetical protein AF140225 (AF140225), mRNA 


2E-75 


68 \ 


XM 035597 


Homo sapiens hypothetical protein AF140225 (AF140225), mRNA 


1E-71 


69 


AB017505 


Homo sapiens mRNA for Nori-2p, complete cds 


0 


70 


AK010143 I 


Mus musculus adult male tongue cDNA, RIKEN full-length enriched 
library, clone:2310074B19, full insert sequence 


1.7 


71 


AK008666 


Mus musculus adult male stomach cDNA, RIKEN full-length 
enriched library, clone:22 10008 A03, full insert sequence 


e-173 


72 


L18785 


Plasmodium falciparum DNA polymerase alpha gene, complete cds 


0.007 


73 


XMJ)03327 


Homo sapiens hypothetical protein FLJ10858 (FLJ10858), mRNA 


0 


74 


AK004786 


Mus musculus adult male lung cDNA, RIKEN full-length enriched 
library, clone: 12000 15F23, full insert sequence 


2E-11 


75 


Z29969 


E. histolytica plasmid genes for ribosomal RNA and hemolysins 
HLY1, HLY5mcl HLY5mc2 HLY4 


0.028 


76 


AK024945 


Homo sapiens cDNA: FU21292 fis, clone COLO 1969 


0 


77 


Y15931 


Homo sapiens CTNS gene, exon 10 and flanking intronie regions 


0.78 


78 


BC000610 


Homo sapiens, hypothetical protein FLJ10986, clone MGC:908 
IMAGE:3347108, mRNA, complete cds 


0 


79 


XM 050424 


Homo sapiens KIAA1244 protein (KIAA1244), mRNA 


0.75 


80 


NC 001807 


Human mitochondrion, complete genome 


1E-36 


81 


AF360266 


Arabidopsis thaliana unknown protein (F28C11.8) mRNA, complete 
cds 


6.8 


82 


BC000225 


Homo sapiens, Similar to hypothetical protein FLJ10656, clone 
1MAGE:3350951, mRNA 


0 


83 


XMJ)34107 


Homo sapiens hypothetical protein FLJ20758 (FLJ20758), mRNA 


7E-32 


84 


AF305816 


Homo sapiens PRO0633 mRNA, complete cds 


0.67 


85 


AF196678 


HIV-1 isolate 96RW34 from Rwanda, gag protein (gag) gene, partial 
cds 


2.1 


86 


AK024927 


Homo sapiens cDNA: FU21274 fis, clone COLO 1781 


6E-75 


87 


X77394 


Rphilodendra mitochondrial DNA repeat unit 


0.23 
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Table 3A Nearest Neighbor (BlastN vs. Genbank) 


SEQID 
NO 


ACCESSN : 


DESCRIP 


P VALUE 


88 


U41203 


Human short tandem repeat polymorphism UM32, D17S1333 


0.0001 


89 


AC015937 


Hfomo sapiens chromosome 17, clone CTD-2542F12, complete 
sequence 


0.65 


90 


AF245117 


Mus musculus leukocyte cell-surface molecule (Ly9) gene, exon 1 


0.74 


91 


XM 001412 


Homo sapiens metal-regulatory transcription factor 1 (MTF1), mRNA 


2E-50 


92 


AE006690 


Sulfolobus solfataricus section 49 of 272 of the complete genome 


0.26 


93 


AJ292466 


Homo sapiens mRNA for WDR9 protein (WDR9 gene), form B | 


9E-44 


94 


NM 013983 


Homo sapiens neuregulin 2 (NRG2), transcript variant 4, mRNA 


0.27 


95 


J01390 


Emericella nidulans mtDNA between h2/h5 and bh2/b2 junctions, 
genes for ATPase subunit 6, cytochrome oxidase subunit 3, seven, 
unidentified proteins, twentyfour tRNA's and L-rRNA 


2.2 


96 


AP000384 


Arabidopsis tMianfl genomic DNA, chromosome 3, PI clone:MCE21 


0.22 


97 


AE001397 


Plasmodium falciparum chromosome 2, section 34 of 73 of the 
complete sequence 


2.4 


98 


AE000192 


Escherichia coli K12 MG1655 section 82 of 400 of the complete 
genome 


0.003 


99 


AF3 16828 


Schistosoma mansoni zinc finger protein SmZFl gene, complete cds 


1.7 


100 


BC007942 


Homo saoiens nucleolar autoantisen (55kD) similar to rat 
synaptonemal complex protein, clone MGC: 14267 IMAGE:4130726, 
mRNA, complete cds 


2E-59 


101 


L35301 


Homo sapiens nucleoside diphsophate kinase A (nm23-Hl) gene 
fragment 


0.003 


102 


AF1 14927 


Saccharomyces pastorianus CBS1538 small subunit ribosomal RNA 
gene, mitochondrial gene for mitochondrial RNA, complete sequence 


0.27 


103 


| AF299248 


Drosophila melanogaster talin mRNA, complete cds 


1.8 | 


104 


BC003191 


Homo sapiens, GL004 protein, clone MGC:895 IMAGE:3502929, 
mRNA, complete cds 


2E-10 


105 


U29926 


Human AMP deaminase (AMPD3) gene, exon 14 and 15, and 
complete cds 


0.06 


106 


BC003191 


Homo sapiens, GL004 protein, clone MGC:895 IMAGE:3 502929, 
mRNA, complete cds 


0 


107 


AK022127 


Homo sapiens cDNA FLJ12065 fis, clone HEMBB1002249 


0.0008 


108 


D16360 


Human DNA for plasma glutathione peroxidase, exon 1 


0.091 


109 


AK014138 


Mus musculus 13 days embryo head cDNA, RIKEN full-length 
enriched library, clone:3110038All, full insert sequence 


6 


110 


XM 006914 


Homo sapiens cold shock domain protein A (CSDA), mRNA 


0.079 


111 


U31283 


Mesotaenium caldariorum clone mesphyla photochrome gene, partial 
cds, exons 4 through 10 


0.56 


112 


XM 049935 


Homo sapiens hypothetical protein FU14950 (FLJ14950), mRNA 


0 
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113 


AE001329 


Chlamydia trachomatis section 56 of 87 of the complete genome 


0.75 


114 


XM 037830 


Homo sapiens OLF-1/EBF associated zinc finger gene (KIAA0760), 
mRNA 


0.25 


115 


AE006368 


Lactococcus lactis subsp. lactis EL1403 section 130 of 218 of the 
complete genome 


0.0003 


116 


XM_049935 


Homo sapiens hypothetical protein FLJ14950 (FLJ14950), mRNA 


0 


117 


XM 032533 


Homo sapiens hypothetical protein FLJ12787 (FLJ12787), mRNA 


0 


118 


AJ309861 


Homo sapiens partial mRNA for putative protein kinase WNK4 
(PRKWNK4 gene) 


9E-76 


119 


BC005173 


Homo sapiens, KIAA0255 gene product, clone IMAGE:3507918, 
mRNA 


2E-38 


120 


AF068758 


Drosophila melanogaster SIR2 (Sir2) mRNA, complete cds 


0.083 


121 


AK017307 


Mus musculus 6 days neonate head cDNA, RIKEN full-length 
enriched library, clone:5430414B19, full insert sequence 


0.003 


122 


XM_036208 


Homo sapiens sphingosine kinase 1 (SPHK1), mRNA 


2.1 


123 


XM_039960 


Homo sapiens PC3-96 protein (PC3-96), mRNA 


0 


124 


XM_O39306 


Homo sapiens hypothetical protein FLJ20400 (FLJ20400), mRNA 


0.08 


125 


XM 049935 


Homo sapiens hypothetical protein FLJ14950 (FLJ14950), mRNA 


0 


126 


AE006080 


Pasteurella multocida PM70 section 47 of 204 of the complete genome 


2.1 


127 


XM_009518 


Homo sapiens WNT1 inducible signaling pathway protein 2 (WISP2), 
mRNA 


0.16 


128 


U36263 


Homo sapiens beta-prime-adaptm (AP1B1) gene, exon 15 


0.022 


129 


AF267996 


Homo sapiens muscle-specific protein (C4orf5) gene, intron 3 


0.34 


130 


AF131859 


Homo sapiens clone 24923 mRNA sequence 


1.8 


131 


L48479 


Homo sapiens (subclone 6_hl from PI H21) DNA sequence 


0.01 


132 


XM_018317 


Homo sapiens ubiquitination factor E4B (homologous to yeast UFD2) 
(UBE4B), mRNA 


1.6 


133 


XM 008267 


Homo sapiens solute carrier family 16 (monocarboxylic acid 
transporters), member 6 (SLC16A6), mRNA 


0.23 


134 


AF162923 


Beet soil-borne virus RNA3 isolate 6b 22K protein gene, partial cds 


6.6 


135 


NM 010053 


Mus musculus distal-less homeobox 1 (Dlxl), mRNA 


2.5 


136 


AF257303 


Mus musculus synaptotagmin II (Syt2) gene, complete cds 


0.85 


137 


Z92837 


Caenorhabditis elegans cosmid R03E1, complete sequence 


0.027 


138 


AF258528 


Loligo pealei phospholipase C mRNA, complete cds 


0.74 


139 


M65252 


Bacillus thuringiensis alesti delta endotoxin gene, complete cds 


0.075 


140 


M14954 


D.melanogaster (W-IR1 mutation) I factor DNA, complete cds 


0.053 


141 


Z48636 


C.novyi gene for alpha-toxin 


0.026 


142 


XM_036491 


Homo sapiens hypothetical protein MGC3771 (MGC3771), mRNA 


0.002 
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143 


AK020198 


Mus musculus 15 days embryo male testis cDNA, RIKEN fiill-length 
enriched library, clone:8030442B05, full insert sequence 


0.53 


144 


AJ404228 


Saccharomyces douglasii mitochondrial tRNA gene cluster 


0.022 


145 


AJ297708 


Rattus norvegicus RT6 gene for T cell differentiation marker RT6.2, 
exons 1-8 


0.024 


146 


AK026697 


Homo sapiens cDNA: FLJ23044 fis, clone LNG02454 


1.9 


147 


AE001370 


Plasmodium falciparum chromosome 2, section 7 of 73 of the 
complete sequence 


0.017 


148 


AF325208 


Acetabularia acetabulum nicotinamide nucleotide transhydrogenase 
(Nnt) gene, exons 6, 7, 8, and complete cds 


5.2 


149 


L78770 


Homo sapiens (subclone 1JF7 from PI H43) DNA sequence 


0.068 


150 


AF182214 


Glycine max glutamine synthetase gamma 2 subunit gene, 5 f -flanking 
region and partial cds 


0.2 


151 


AF359513 


Lolium rigidum clone LR1 acetyl-CoA carboxylase mRNA, partial 
cds; nuclear gene for plastid product 


0.2 


152 


AE004798 


Pseudomonas aeruginosa PA01, section 359 of 529 of the complete 
genome 


2.6 


153 


NM 021699 


Rattus norvegicus serine/threonine kinase (LOC60328), mRNA 


1.8 


154 


XM 040539 


Homo sapiens interphotoreceptor matrix proteoglycan 200 
(SPACRCAN), mRNA 


2.3 


155 


AE006894 


Sulfolobus solfataricus section 253 of 272 of the complete genome 


0.027 


156 


Y14603 


Erwinia amylovora srlA, srlE, srlB, srlD, srlM and srlR genes 


0.8 


157 


AF181720 


Homo sapiens RU2AS (RU2) gene, complete cds; and RU2S (RU2) 
gene, partial cds 


0.051 


158 


XM_043815 


Homo sapiens hypothetical protein FLJ21736 (FLJ21736), mRNA 


1.1 


159 


AE006059 


Pasteurella multocida PM70 section 26 of 204 of the complete genome 


1.6 


160 


AF151097 


Homo sapiens voltage-dependent anion channel (VDAC1) gene, 
exons 8 and 9 and complete cds 


0.074 


161 


AF165136 


Mycoplasma mycoides mycoides LC hypothetical surface located 
membrane protein and lipoprotein B precursor (lppB) genes, partial 
cds 


0.23 


162 


XM_016392 


Homo sapiens hypothetical protein FLJ12768 (FLJ12768), mRNA 


2.1 


163 


AF170544 


Magnaporthe grisea vacuolar-ATPase (VATP) mRNA, complete cds 


v. id 


164 


AY045680 


Arabidopsis thaliana Atlg08350/T27G7_4 mRNA complete cds 


0.009 


165 


AF285100 


Bos taurus lysophosphatidic acid acyltransfesrase gene, complete cds 


1.6 


166 


AF194170 


Dictyostelium discoideum cudA protein gene, promotor sequence and 
partial cds 


0.039 


167 


AF242446 


Homo sapiens LINE-1 insertion dimorphism LID-3 empty site 
sequence 


0.043 



222 



INSDOCID: <WO 021 4500A2_I_> 



WO 02/14500 PCT7US01/25840 





Table 3A Nearest Neighbor (BlastN vs. Genbank) 


SEQID 
NO 


ACCESSN 


DESCRIP 


P VALUE 


168 


AF084196 


Pan troglodytes gamma-aminobutyric acid receptor A5 subunit 
duplicated gene, 5UTR region 


0.88 


169 


AK021754 


Homo sapiens cDNA FLJ11692 fis, clone HEMBA1004983 


0 


170 


AE006002 


Caulobacter crescentus section 328 of 359 of the complete genome 


1.6 


171 


AL096734 


Homo sapiens mRNA; cDNA DKFZp434M01 1 (from clone 
DKFZp434M011) 


0.015 


172 


AF251351 


Homo sapiens 2 f -5'oligoadenylate synthetase 3 (OAS3) gene, promoter 
region 


2.7 


173 


AF061244 


Agrocybe aegerita B type DNA polymerase (Mtpol) gene, complete 
cds; tRNA-Asn gene, complete sequence; and unknown genes, 
mitochondrial genes for mitochondrial products 


0.0006 


174 


X95275 


P.falciparum complete gene map of plastid-like DNA (IR-A) 


0.1 


176 


M88599 


Entamoeba histolytica P-glycoprotein-1 (pgpl) gene, complete cds 


0.12 


177 


AY019624 


Oiyza sativa microsatellite MRG1949 containing (AT)X40, closest to 
marker R1167, genomic sequence 


0.017 


178 


NM 026040 


Mus musculus RIKEN cDNA 2810036K01 gene (2810036K01Rik), 
mRNA 


e-101 


179 


AL162040 


Homo sapiens mRNA; cDNA DKFZp434N199 (from clone j 
DKFZp434N199) | 


e-152 


180 


AJ295190 


Human coxsackievirus Al 1 genomic RNA for partial polyprotein 
gene, isolate VR-169, ATCC 


1.5 


181 


AJ298684 


Phylloxera sp. MBLM2 mitochondrial ATP6 gene for ATP synthase A 
chain subunit 6 


0.51 


183 


AF268064 


Candidatus Carsonella ruddii natural-host Calophya schini RNA 
polymerase beta subunit (rpoB) and RNA polymerase beta-prime 
subunit (rpoC) genes, partial cds 


0.012 


184 


U27374 


Human Menkes disease gene (ATP7A), exon 16 


0.51 


185 


D10767 


Epizootic haemorrhagic disease virus serotype 1 (EHDV-1) gene 
encoding VP2 protein 


0.049 


186 


AJ002256 


Mus musculus minisatellite tandem repeat (MMS73) 


0.058 


187 


AB007546 


Homo sapiens gene for LECT2, complete cds 


2E-11 


188 


AF254573 . 


Cricetulus griseus origin recognition complex subunit 2 mRNA, 
complete cds 


0.57 


189 


AK024903 


Homo saoiens cDNA* FLJ21250 fis clone COLO 1253 hiehlv similar 
to AB020527 Homo sapiens mRNA for Na/P04 cotransporter 
homolog 


e-135 


190 


AF359513 


Lolium rigidum clone LR1 acetyl-CoA carboxylase mRNA, partial 
cds; nuclear gene for plastid product 


\ 0.15 


191 


AJ237585 


Mus musculus mRNA for hypothetical protein expressed in 
thymocytes (clone MFT.M05.13/MTAB10.066), partial 


4.4 


192 


AF233591 


Mus musculus son-of-sevenless 1 (Sosl) gene, partial cds 


0.52 


193 


X81190 


M. musculus DNA flanking site of transgene insertion 


4.8 


194 


AF232246 


Tetrahymena thennophila clone Tlrl Int integrase-like protein gene, 
complete cds 


0.53 
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195 


AB034969 


Thermococcus sp. B1001 cgtB, cgtA, cgtC, cgtD, cgtE genes for 
cyclomaltodextrinase, cyclomaltodextrin glucanotransferase, 
cyclomaltodextrin binding protein, cyclomaltodextrin transport 
membrane protein, complete cds 


1.8 


196 


U95885 


Ixodes granulatus 16S ribosomal RNA gene, mitochondrial gene for 
mitochondria] RNA, partial sequence 


0.048 


198 ! 


XM 016002 


Homo sapiens LOC88174 (LOC88174), mRNA 


4 


199 


AC091698 


Homo sapiens clone RP1 1-596 J6, complete sequence 


0.000003 


200 


Z99763 


Flaveria pringlei gdcsH gene 


0.02 


201 


AB048873 


Macaca fascicularis brain cDNA, clone:QnpA-10763 


0.053 


202 


AF286367 


Homo sapiens HMGIY gene, promoter 


0.007 


203 


AB029073 


Mus musculus Kkm gene for Ser/Thr kinase KKIAMRE, exon 13, 14, 
and complete cds 


3.9 


204 


AF262582 


Dolichorhinotermes sp. Manaus' 16S mitochondrial ribosomal RNA 
gene, partial sequence; mitochondrial gene for mitochondrial product 


0.045 


205 


AL360005 


Human DNA sequence from clone RP1 1-436H17 on chromosome 
Xq26. 1-27.1. complete sequence [Homo sapiens] 


0.16 


206 


NM_019054 


Homo sapiens hypothetical protein MGC5560 (MGC5560), mRNA 


9E-35 


207 


AK014626 


Mus musculus 10 days neonate skin cDNA, RIKEN Mi-length 
enriched library, clone:4731413G05, full insert sequence 


0.82 


208 


AK027128 


Homo sapiens cDNA: FLJ23475 fis, clone HSI13659 


0.021 


209 


J00778 


Rat pancreatic trypsin I gene, complete cds 


0.016 


210 


D49412 


Human gene for interleukin 3 receptor alpha subunit, exon 11 


0.067 


211 


AY045578 


Arabidopsis thaliana AT4g37000/C7A10_J360 mRNA, complete cds 


2 


213 


AJ011013 


Cicer arietinum epicotyl EST, clone Canl33 


4.7 


214 


XM 008731 


Homo sapiens meprin A, beta (MEP1B), mRNA 


5.6 


215 


AL023830 


Caenorhabditis elegans cosmid Y26E6 A, complete sequence 


0.084 


216 


AK027136 


Homo sapiens cDNA: FLJ23483 fis, clone KAIA04052 


6.6 


217 


U03554 


Bacillus thuringiensis morrisoni EG2158 transposon Tn5401 site- 
specific recombinase (tnpl) and transposase (tpnA) genes, complete 
cds 


0.46 


218 


XM 043584 


Homo sapiens ankyrin repeat and SOCS box-containing 1 (ASB1), 
mRNA 


2.2 


219 


AE001177 


Borrelia burgdorferi (section 63 of 70) of the complete genome 


0.094 


220 


AK018148 


Mus musculus adult male medulla oblongata cDNA, RIKEN full- 
length enriched library, clone:6330408Pll, full insert sequence 


1.6 


221 


AF271148 


Botryosphaeria ribis isolate 96-131 small subunit ribosomal RNA 
gene, partial sequence; mitochondrial gene for mitochondrial product 


0.65 


222 


XM 008090 


Homo sapiens KIAA0419 gene product (KIAA0419), mRNA 


0.25 


223 


Z54541 


Rsapiens CpG island DNA genomic Msel fragment, clone 132cll, 
forward read cpgl32cl 1 .ftlb 


6.4 
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225 


XMJ)42689 


Homo sapiens fasciculation and elongation protein zeta 2 (zygin II) 
(FEZ2), mRNA 


0.69 


226 


AF3 17083 


Anoplostoma viviparum isolate BrhlO large subunit ribosomal RNA 
gene, partial sequence; mitochondrial gene for mitochondrial product 


0.22 


227 


AE006436 


Lactococcus lactis subsp. lactis IL1403 section 198 of 218 of the 
complete genome 


0.029 


228 


AK027535 


Homo sapiens cDNA FLJ14629 fis, clone NT2RP2000448, weakly 
similar to KES1 PROTEIN 


7E-76 


229 


BC004209 


Homo sapiens, NIMA (never in mitosis gene a)-related kinase 6, clone 
MGC:4434 IMAGE:2958695, mRNA, complete cds 


3E-36 


230 


AC002183 


Homo sapiens (subclone 2Jh8 from BAC Hill) DNA sequence, 
complete sequence 


0.0000004 


231 


AF242489 


Prochlorococcus marinus UreC (ureC), UreB (ureB), UreA (ureA), 
UreD (ureD), UreE (ureE), UreF (ureF), and UreG (ureG) genes, 
complete cds 


0.0003 


232 


AF385396 


Homo sapiens psoriasis susceptibility gene, partial sequence 


0.027 


233 


AE006749 


Sulfolobus solfataricus section 108 of 272 of the complete genome 


2.1 


234 | 


X73501 


ELsapiens gene for cytokeratin 20 


0.25 


235 


AJ296801 


Glyptotendipes pallens 18S rRNA gene (partial), 2S rRNA gene, 28S 
rRNA gene (partial), internal transcribed spacer 1 (TTSl) and internal 
transcribed spacer 2 (TTS2), clone Gpaits9 


0.49 


236 


X17627 


C. thummi DNA transposable element TECthl 


0.59 | 


237 


AF157924 


Eutamias dorsalis cytochrome b (cytb) gene, complete cds; 
mitochondrial gene for mitochondrial product 


0.19 


238 


XM 040836 


Homo sapiens similar to calcium channel, voltage-dependent, alpha 
1H subunit (H. sapiens) (LOC91809), mRNA 


0.38 


239 


AF097103 


Oreina melanocephala 16S large subunit ribosomal RNA gene, partial 
sequence; mitochondrial gene for mitochondrial product 


0.23 


240 


AK013974 


Mus musculus 13 days embryo head cDNA, RJKEN full-length 
enriched library, clone:3110002D09, full insert sequence 


0.73 


241 


AK022073 


Homo sapiens cDNA FLJ12011 fis, clone HEMBB1001653 


0.26 


242 


XM_012392 


Homo sapiens ubiquitin specific protease 8 (USP8), mRNA 


2 


243 


AB048604 


Mus musculus Neu-2 mRNA for sialidase, complete cds 


0.54 


244 


AF275235 


Drepanaphis utahensis 12S small subunit ribosomal RNA gene, 
partial sequence; fRNA-Val gene, complete sequence; and 16S large 
subunit ribosomal RNA gene, partial sequence; mitochondrial genes 
for mitochondrial products 


0.076 


245 


XM 032956 


Homo sapiens cat eye syndrome chromosome region, candidate 2 
(CECR2), mRNA 


2.2 


246 


XM.050246 


Homo sapiens secretory protein SEC8; KIAA1699 protein (SEC8), 
mRNA 


1 o I 


247 


X69200 


D.tigrina Dth-1 gene, exon 2 


0.49 


248 


M33582 


O cuniculus beta-casein gene, complete cds 


0.006 


250 


AB048604 


Mus musculus Neu-2 mRNA for sialidase, complete cds 


0.46 
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251 


U48594 


Heliothis virescens insect storage protein (SP4) mRNA, partial cds 


0.022 


252 


AL162583 


Human DNA sequence from clone RP1-250J21 on chromosome 
Xpl 1.22-1 1.4, complete sequence [Homo sapiens] 


0.2 


i 253 


AF284038 


Cucurbita maxima phloem serpin-1 mRNA complete cds 


0.19 | 


254 


AK026699 


Homo sapiens cDNA: FLJ23046 fis, clone LNG02491 


0.6 


255 


AF069199 


Trioxys betulae NADH dehydrogenase 1 gene, mitochondrial gene 
encoding mitochondrial protein, partial cds 


0.19 


256 


Z12028 


L.pimpinellifolium gene encoding vacuolar invertase 


0.7 


257 


AB041256 


Panonychus mori mitochondrial COl gene for cytochrome oxidase 
subunit 1, partial cds, strain:TtDe 


0.21 


258 


AK022820 


Homo sapiens cDNA FLJ12758 fis, clone NT2RP2001328 


0.25 


259 


AE006782 


Sulfolobus solfataricus section 141 of 272 of the complete genome 


1.4 


260 


AF027656 


Homo sapiens cholesteryl ester transfer protein gene, promoter region 


0.00004 


261 


Z57957 


H.sapiens CpG island DNA genomic Msel fragment, clone 22b3, 
reverse read cpg22b3.rtla 


0.00003 


262 


AF266097 


Orthetrum cancellatum large subunit ribosomal RNA gene, partial 
sequence; tRNA- Valine gene, complete sequence; and small subunit 
ribosomal RNA gene, partial sequence; mitochondrial genes for 
mitochondrial products 


0.077 


263 


XM 003119 


Homo sapiens dynein light chain- A (LOC51143), mRNA 


5.6 


264 


AF099990 


Rattus norvegicus Ste-20 related kinase SPAK mRNA complete cds 


0.54 


265 


AE007394 


Streptococcus pneumoniae section 77 of 194 of the complete genome 


2.2 


266 


L07545 


Leishmania tarentolae kinetoplast mitochondrial MURF2 edited 
mRNA, complete cds 


0.002 


267 


L09706 


Homo sapiens complement component 2 (C2) gene allele b, exons 1 
through 8 


0.073 


268 


AF174326 


Pauesia unilachni 16S ribosomal RNA gene, partial sequence; 
mitochondrial gene for mitochondrial product 


0.063 


269 


Z79794 


Mmusculus dystrobrevin gene, exon 8 


1.8 


270 


AB029325 


Oryza sativa gene for water channel protein RWC3, promoter region 
and complete cds 


1.7 


271 


U42461 


Xenopus laevis neural specific DNA binding protein (Xgli3) mRNA 
complete cds 


1.6 


272 


XM_029784 


Homo sapiens CGI-53 protein (LOC51098), mRNA 


6E-23 


273 


D14011 


Mouse reg II gene for regenerating protein II, complete cds 


0.071 


274 


AF179235 


Homo sapiens short-chain dehydrogenase/reductase 1 (SDR1) gene, 
exons 2, 3, and 4 


2.1 
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275 


AF264911 


Campylobacter jejuni putative UDP-N-acetylmuraimte-alanine ligase 
(murC) gene, partial cds; putative integral membrane protein, putative 
MutS (mutS), restriction and modification enzyme Cjel (cjel), putative 
transferase, and putative integral membrane > 


0.22 


276 


AF266074 


Gomphus exilis large subunit ribosomal RNA gene, partial sequence; 
tRNA-Valine gene, complete sequence; and small subunit ribosomal 
RNA gene, partial sequence; mitochondrial genes for mitochondrial 
products 1 


0.021 


277 


AF229988 


Cryptosporidium parvum P-type ATPase3 gene, complete cds 


1.9 


278 


AF250346 


Xenopus laevis early growth response protein 1 (egrl) gene, partial 
cds 


0.25 


279 


AL157448 


Homo sapiens mRNA; cDNA DKFZp761E0311 (from clone 
DKFZp761E0311) 


0.00001 


280 


AF386078 


Homo sapiens serine-cysteine proteinase inhibitor clade C member 1 
(SERPINC1) gene, complete cds 


2.1 


281 


S80932 


MVAT5-RX2 VSG=variant surface glycoprotein {promoter} 
Trypanosoma brucei= African trypanosomes, Genomic, 6583 nt] 


0.003 


282 


U09478 


Dictyostelium discoideum KAx-3 LagC protein (lagC) mRNA, 
complete cds 


0.008 


283 


U63580 


Oxytricha fallax micronuclear actin I gene, partial sequence 


0.022 


284 \ 


AF348411 


Macaca mulatto Per4 pseudogene sequence 


0.0008 


285 


AF053645 


Homo sapiens cellular apoptosis susceptibility protein (CSE1) gene, 
exons 3 through 10 


0.0002 


287 


U90928 


Arabidopsis thaliana glyoxalase II mitochondrial isozyme (Glx2-1) 
mRNA, nuclear gene encoding mitochondrial protein, complete cds 


2.2 


288 


AB050751 


Carabus pseudopusio mitochondrial ND5 gene for NADH 
dehydrogenase subunit 5, partial cds 


1.6 


289 


AF275220 


Prociphilus fraxinifolii 12S small subunit ribosomal RNA gene, 
partial sequence; tRNA-Val gene, complete sequence; and 16S large 
subunit ribosomal RNA gene, partial sequence; mitochondrial genes 
for mitochondrial products 


0.003 


290 


NM_014358 


Homo sapiens C-type (calcium dependent, carbohydrate-recognition 
domain) lectin, superfamily member 9 (CLECSF9), mRNA 


0.009 


291 


D25274 


Homo sapiens mRNA, clone:P02ST9 


1E-38 


292 


AF1 11936 


Callithrix jacchus isolate MJ50037 alpha (1,2) fucosyltransferase 
(FUT1) gene, partial cds 


1.8 


293 


AL592159 


Human DNA sequence from clone RP13-237P14 oh chromosome X, 
complete sequence [Homo sapiens] 


0.18 


294 


AK013928 


Mus musculus 13 days embryo head cDNA, RIKEN full-length 
enriched library, clone:3100003M19, full insert sequence 


1.9 


295 


AF289199 


Mus musculus junctin and aspartyl beta-hydroxylase (Asph) genes, 
partial cds 


0.079 
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296 


AE001425 


Plasmodium falciparum chromosome 2, section 62 of 73 of the 
complete sequence 


0.008 


297 


XM011029 


Homo sapiens L1M domain-containing preferred translocation partner 
in lipoma (LPP), mRNA 


0.23 


298 


AF105034 


Arabidopsis thaliana delta7 sterol C-5 desaturase (STE1) gene, 
complete cds 


0.009 


299 


L77040 


Homo sapiens (subclone 8 cl 1 from PI H22) DNA sequence j 


9E-14 


300 


AF278536 


Parastrongyloides trichosuri heat shock 70 protein gene, complete cds 


0.61 


302 


Z28216 


S.cerevisiae chromosome XI reading frame ORF YKL216w 


1.7 


303 


AJ008049 


Chrysolina colasi 16S rRNA gene 


0.0007 


304 


AF280812 


Glycine max putative Hslpro-1-like receptor mRNA, complete cds 


0.67 


305 


AK007831 


Mus musculus 10 day old male pancreas cDNA, RIKEN full-length 
enriched library, clone:1810048N21, full insert sequence 


0.047 


306 


AL163543 


Human DNA sequence from clone RP1 1-424E21 on chromosome 13, 
complete sequence [Homo sapiens] 


0.64 


307 


AF045022 


Bos taurus phosphatide acid-preferring phospholipase Al mRNA, 
complete cds 


5 ! 


1 308 


AB052731 


Oikopleura longicauda OilBra mRNA for brachyury protein, complete 
cds 


1.8 


309 


AF194275 


Holbrookia maculata clone HMFL 12S ribosomal RNA gene, partial 
sequence; mitochondrial gene for mitochondrial product 


6.3 


310 


AJ400814 


Dictyostelium discoideum srfA gene, alternative promoters 


0.024 


311 


AK002854 


Mus musculus adult male kidney cDNA, RIKEN full-length enriched 
library, clone:0610039P13, full insert sequence 


0.68 


312 


U25177 


Helobdella robusta cyclin A mRNA, partial cds 


0.023 


313 


AK011102 


Mus musculus 13 days embryo liver cDNA, RIKEN full-length 
enriched library, clone:2510044F14, full insert sequence 


0.51 


314 


AK002181 


Homo sapiens cDNAFLJ11319 fis, clone PLACE10 10293 


0.0006 


315 


U67282 


Human mis5 homolog (MCM6) gene, exon x 


0.014 1 


316 


XMJH0357 


Homo sapiens chromosome 6 open reading frame 5 (C6orf5), mRNA 


1.5 


317 


AK006073 


Mus musculus adult male testis cDNA, RIKEN full-length enriched 
library, clone: 1700017M07, full insert sequence 


6.7 


318 


AE001381 


Plasmodium falciparum chromosome 2, section 18 of 73 of the 
complete sequence 


0.18 


319 


NM 017678 


Homo sapiens hypothetical protein FLJ20 127 (FLJ20127), mRNA 




320 


BC004680 


Mus musculus, Similar to coxsackievirus and adenovirus receptor, 
clone MGC:5878 IMAGE:3500491, mRNA, complete cds 


0.24 


321 


AK002181 


Homo sapiens cDNA FLJ11319 fis, clone PLACE1010293 


0.007 


322 


U90946 


Dictyostelium discoideum myosin heavy chain kinase B (MHCK-B) 
mRNA, complete cds 


0.003 


323 


Y14277 


Drosophila melanogaster mRNA for nuclear protein S A 


0.062 
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324 


Z17272 


H. sapiens DNA segment containing (CA) repeat; clone AFM248wcl; 
single read 


0.25 


325 


M37526 


E.gracilis chloroplast P-700 chlorophyll alpha apoprotein (psaA and 
psaB) genes, complete cds 


0.025 


326 


Y14952 


Mus musculus gene encoding immunoglobulin J chain precursor 


0.24 


327 


AE006596 


Streptococcus pyogenes Ml GAS strain SF370, section 125 of 167 of 
the complete genome 


6.8 


328 


AF079804 


Candida albicans N-acetyl-glucosamine-6-phosphate deacetylase 
(DAC1) and glucosamine-6-phosphate deaminase (NAG1) genes, 
complete cds; and hexokinase (HXK1) pseudogene, complete 
sequence 


2 


329 


XM 029315 


Homo sapiens Machado-Joseph disease (spinocerebellar ataxia 3, 
olivopontocerebellar ataxia 3, autosomal dominant, ataxin 3) (MJD), 
mRNA 


7.8 


330 


AK022820 


Homo sapiens cDNA FLJ12758 fis, clone NT2RP2001328 


0.24 


331 


AF059275 


Mus musculus heat shock transcription factor 1 (Hsfl) gene, partial 
cds 


0.085 


332 


AL049263 


Homo sapiens mRNA; cDNADKFZp564F133 (from clone 
DKFZp564F133) 


0.059 


333 


AP001233 


Homo sapiens genomic DNA, chromosome 2pll.2, clone:cos607/2 ? 


0.003 


334 


AC017021 


Homo sapiens BAC clone RP11-208M4 from 7, complete sequence 


0.0000005 


! 335 


U03248 


Human chromosome 21 clone pVC1.23c 


3E-56 


336 


XM 046758 


Homo sapiens tensin (TNS), mRNA 


2 


337 


AJ279150 


Homo sapiens partial TFNR gene for transcription factor-like nuclear 
regulator, exon 31 


0 


338 


AF282896 


Rattus norvegicus chemokine RANTES gene, promoter sequence 


0.001 


i 339 


AK000939 


Homo sapiens cDNA FLJ10077 fis, clone HEMBA1001864 


0.21 


340 


AF188517 


Staphylococcus aureus Bit-like protein SbtA (sbtA) gene, complete cds 


0.008 


341 


AF202562 


Homo sapiens DNA methyltransferase (DNMT1) gene, exon 13 


0.22 


342 


XM 017967 


Homo sapiens hypothetical protein FLJ13732 similar to tensin 
(FLJ13732), mRNA 


2.7 


343 


AF051685 


Homo sapiens clone SBS-12 SATB1 binding sequence in vivo 


0.21 


344 


M77188 


Campylobacter jejuni argininosuccinate lyase (argH) gene, complete 
cds 


0.22 


345 


U95137 


Rattus norvegicus Na+-Ca+ exchanger (NCX1) gene, exon 1-Ht 


0.24 


346 


AK019943 


Mus musculus 6 days neonate head cDNA, RIKEN full-length 
enriched library, clone:5430403G16, full insert sequence 


0.74 


347 


BC005571 


Mus musculus, ubiquitin specific protease 14, clone MGC:7106 
IMAGE:3 157723, mRNA, complete cds 


0.41 


348 


XM 026968 


Homo sapiens hypothetical protein MGC2668 (MGC2668), mRNA 


4E-16 
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349 1 


AK024903 


Homo sapiens cDNA: FLJ21250 fis, clone COL01253, highly similar 
to AB020527 Homo sapiens mRNA for Na/P04 cotransporter 
hoinolog 


7E-28 


350 


U85645 


Oryctolagus cuniculus fructose 1,6, bisphosphate aldolase (AldB) 
gene, complete cds 


0.075 


351 


AF388026 


Homo sapiens fibrinogen, B beta polypeptide (FGB) gene, complete 
cds 


5.8 


352 


AY028321 


Homo sapiens pro-melanin-concentrating hormone-like 2 protein 
(PMCHL2) gene, exons 4, 5a, and 5b, alternatively spliced 


0.0003 


353 


AL513491 


Human DNA sequence from clone RP1 1-79A21 on chromosome X, 
complete sequence [Homo sapiens] 


1.9 


354 


M92068 


Chimpanzee retrovirus-like sequence-isoleucine b (RTVL-Ib) gene, 5* 
and 3' LTR 


1.5 


355 


XM 002120 


Homo sapiens zinc finger protein 262 (ZNF262), mRNA 


0.007 


356 


X02435 


Tetrahymena mitochondrial gene for tRNA-Phe (GAA) 


0.028 


357 


XM_033459 


Homo sapiens syntaxin binding protein 1 (STXBP1), mRNA 


2.1 


358 


NM 015770 


Mus musculus non-agouti (a), mRNA 


0.64 


359 ! 


AJ296103 


Staphylococcus aureus repNVH99 gene for replication protein and 
smr gene 


0.15 


360 


Z12840 


O.cuniculus mRNA for protein of unknown function 


e-131 


361 1 


XM 038370 


Homo sapiens hypothetical gene supported by AK001938; AK001941 
(LOC91424), mRNA 


0.0002 


362 


AF108684 


Rhipidomys mastacalis cytochrome B (cytB) gene, mitochondrial gene 
encoding mitochondrial protein, partial cds 


0.009 


363 


NM_020045 


Mus musculus HIRA-interacting protein 5 (HIRIP5), mRNA 


1.8 


364 


AK024439 


Homo sapiens mRNA for FLJ00029 protein, partial cds 


0.73 


365 


[ AK025445 


Homo sapiens cDNA: FLJ21792 fis, clone HEP00441 


2.1 


366 


AP000307 


Homo sapiens genomic DNA, chromosome 21q22.1, D21S226-AML 
region, clone:f6C7, complete sequence 


0.00004 


367 


AE000818 


Methanobacterium thermoautotrophicum from bases 264585 to 
276866 (section 24 of 148) of the complete genome 


2.3 


368 


AK001164 


Homo sapiens ct)NAFLJ10302 fis, clone NT2RM2000042 


0.13 


369 


XM_029101 


Homo sapiens KIAA0947 protein (KIAA0947), mRNA 


0.95 


370 


AK022191 


Homo sapiens cDNA FLJ12129 fis, clone MAMMA1000198 


0.029 


371 


Z54280 


S.scrofa gene for skeletal muscle ryanodine receptor 


1.9 j 


372 


AY037259 


Arabidopsis thaliana AT3g52120/F4F15 230 mRNA, complete cds 


9.6 


373 


XM 034599 


Homo sapiens acid sphingomyelinase-like phosphodiesterase 
(ASM3A), mRNA 


O 


374 


XM 046914 


Homo sapiens SWI/SNF related, matrix associated, actin dependent 
regulator of chromatin, subfamily f, member 1 (SMARCF1), mRNA 


3E-16 


375 


M86340 


Mycoplasma-like sp. 23 S and 16S ribosomal RNAs, Tyr-transfer RNA 
and He-transfer RNA, complete cds 


0.24 


376 


AB019507 


Drosophila kanekoi gene for glycerol-3-phosphate dehydrogenase, 
complete cds 


1.3 
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377 


AF106915 


Phodopus sungorus luteinizing hormone beta subunit mRNA, partial 
cds 


0.61 


378 


AL354800 


Human DNA sequence from clone RP1-3 1615 on chromosome 20 
Contains the 3' end of the KIF3B gene encoding a kinesin family 
member 3B (KIAA0359) and ESTs, complete sequence [Homo 
sapiens] 


0.0001 


379 


AL133430 


Homo sapiens mRNA full length insert cDNA clone EUROIMAGE 
233360 


0.66 


380 


AF085894 


Homo sapiens full length insert cDNA clone YP97D 1 1 


0.0002 


381 


AJ286750 


Sesbania rostrata mRNA for phosphoenolpyravate carboxylase (pepc 
gene) 


0.068 


382 


XM_040415 


Homo sapiens region containing hypothetical protein; mutL (E. coli) 
homolog 3 (LOC82389), mRNA 


e-128 


383 


Y09794 


A.rabiei microsatellite DNA 


1.2 


384 


AB047905 


Macaca fascicularis brain cDNA, clone:QnpA-1591 1 


1.8 


385 


AK001451 


Homo sapiens cDNA FLJ10589 fis, clone NT2RP2004389, weakly 
similar to PROBABLE MITOCHONDRIAL 40S RIBOSOMAL 
PROTEIN S9 PRECURSOR 


1.5 


386 


AF010473 


Schizosaccharomyces pombe myosin-like protein Sp8 (sp8) mRNA, 
partial cds 


0.19 


388 


AL133604 


Homo sapiens mRNA; cDNA DKFZp434G1615 (from clone 
DKFZp434G1615) 


0.00006 


389 


XM_009252 


Homo sapiens methyl-CpG binding domain protein 3 (MBD3), 
mRNA 


0.055 


390 


AB053116 


Polyandrocarpa misakiensis mRNA for putative eukaryotic petide 
chain release factor subunit 1, complete cds 


1.5 


391 


AL138666 


S. pombe chromosome I cosmid c694 


1.3 


392 


AF091933 


Pleurotus ostreatus small subunit ribosomal RNA gene, partial 
sequence; mitochondrial gene for mitochondrial product 


0.078 


393 


Z37964 


Staphylococcus sp. genes encoding QacC and replication protein 
(rep827) 


0.073 


394 


AB020712 


Homo sapiens mRNA for KIAA0905 protein, complete cds 


2E-64 


395 


BC0O9O23 


Mus musculus, Similar to prostaglandin E receptor 4 (subtype EP4), 
clone MGC:7183 IMAGE:3481696, mRNA, complete cds 


1.5 


396 


XM 010128 


Homo sapiens odz (odd Oz/ten-m, Drosophila) homolog 1 (ODZ1), 
mRNA 


0.55 


397 


XM011386 


Homo sapiens hypothetical protein PRO0813 (PRO0813), mRNA 


5.3 


398 


XM 040694 


Homo sapiens AD031 protein (AD031), mRNA 


1.3 


399 


AE006177 


Pasteurella midtocida PM70 section 144 of 204 of the complete 
genome 


5 


400 


AL1 13079 


Botrytis cinerea strain T4 cDNA library under conditions of nitrogen 
deprivation 


1.2 


401 


U39694 


Mycoplasma genitalium section 16 of 51 of the complete genome 


0.84 


402 


XM 016314 


Homo sapiens ankyrin 2, neuronal (ANK2), mRNA 


4.6 
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403 


XM 045362 


Homo sapiens likely homolog of rat kinase D-interacting substance of 
220 kDa; KIAA1250 protein (KIAA1250), mRNA 


7E-77 


404 


NM 028513 


Mus musculus RIKEN cDNA 1700052K15 gene (1700052K15Rik), 
mRNA 


9 


405 


U47416 


Solanum allophyllum NADH dehydrogenase subunit (ndhF) gene, 
chloroplast gene encoding chloroplast protein, partial cds 


0.2 


406 


NC 002502 


Lactococcus Iactis subsp. lactis plasmid pCI305, complete sequence 


5.3 


407 


AF269561 


Staphylococcus epidennidis strain SRI clone step.l010e06 genomic 
sequence 


0.19 


409 


AF336128 


Lumpy skin disease virus strain Neethling, 5 f partial sequence 


1.5 


410 


AB058397 


Oiyza sativa CHS gene for chalcone synthase, complete cds 


0.16 


411 ! 


AE006830 


Sulfolobus solfataricus section 189 of 272 of the complete genome 


0.21 


412 


AY035064 


Arabidopsis thaliana unknown protein (T20O10_100/AT3g63000) 
mRNA, complete cds 


0.22 


413 


AF158049 


Platybrachys decemmacula 16S ribosomal RNA gene, partial 
sequence; mitochondrial gene for mitochondrial product 


0.063 


414 


NM 026444 


Mus musculus citrate synthase (Cs), mRNA 


1.7 


415 


BC007820 


Homo sapiens, clone 1MAGE:4303165, mRNA 


0 


416 


AJ295228 


Caenorhabditis elegans mRNA for MAGUK protein DLG-1 (dlg-1 
gene) 


0.2 


417 


AB028633 


Flammulina velutipes mitochondrial gene for DNA polymerase, RNA 
polymerase, complete and partial cds 


0.6 


418 


Y09794 


A.rabiei microsatellite DNA 


1.2 


419 


AB054063 


Pagrus major lpl gene for lipoprotein lipase, complete cds 


5.9 


420 


AF323928 


Plasmodium falciparum GcpE (gcpE) gene, complete cds 


0.14 


421 


M31012 


Chicken MHC class I B-FTV-B12 alpha-chain gene, complete cds 


1.2 


422 


Z49806 


B.taurus mRNA for adenylyl cyclase type VII 


6.1 


423 


AB009907 


Luciola kuroiwae mitochondrial DNA for 16S rRNA, partial sequence 


0.003 


424 


X76693 


C.coli (UA585) sodB gene for superoxide dismutase 


0.45 


425 


AF284942 


Carex capitata tRNA-Thr, partial sequence; trnT-trnL intergenic 
spacer, tRNA-Leu, and trnL-trnF intergenic spacer, complete 
sequence; and tRNA-Phe, partial sequence; chloroplast genes for 
chloroplast products 


0.29 


426 


AK017986 


Mus musculus adult male thymus cDNA, RIKEN full-length enriched 
library, clone:5830443G21, full insert sequence 


^ i 


s 427 


AL5 12549 


S.pombe chromosome I BAC pB2B4 


1.6 




AF283669 


Homo sapiens kaUikrein 14 (KLK14) gene, complete cds 


2.3 


429 


Z12842 


O.cuniculus mRNA for protein of unknown function 


5E-73 


430 


XM_016894 


Homo sapiens LOC86241 (LOC86241), mRNA 


3E-65 


431 


X96616 


P.primaurelia gene encoding 156D surface antigen 


0.23 


432 


X02175 


Schizosaccharomyces pombe cdclO start gene 


0.027 


433 


AK021807 


Homo sapiens cDNA FLJ11745 fis, clone HEMBA1005526 


0.001 
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434 


XM_005233 


Homo sapiens protein phosphatase 2 (formerly 2 A), regulatory subunit 
B (PR 52), alpha isoform (PPP2R2A), mRNA 


5E-09 


435 


X52623 


Rice 4-CL gene for 4-coumarate-CoA ligase (EC 6.2. 1. 12) 


0.63 


436 


AK005294 


Mus musculus adult male cerebellum cDNA, RIKEN full-length 
enriched library, clone: 1500019016, full insert sequence 


1E-16 


437 


AE002337 


Chlamydia muridarum, section 65 of 85 of the complete genome 


0.049 


438 


AF106574 


Caenorhabditis elegans cosmid E02D9, complete sequence 


0.7 


439 


X56866 


S.alba 5S rRNA gene 


0.74 


440 


AF127158 


Drosophila mauritiana Cu-Zn superoxide dismutase (Sod) gene, exons 
1 and 2 and complete cds 


1.7 


441 


U08466 


Nephila clavipes alanine tRNA gene, complete sequence 


2.1 


442 


AY021236 


Oryza sativa microsatellite MRG3561 containing (TA)X22, genomic 
sequence 


1.5 


443 


L09262 


Arabidopsis thaliana phytochrome b gene, exons 2 and 3 


0.079 


444 


AC007095 


Homo sapiens BAC clone RP1 1-303 A22 from 7, complete sequence 


0.23 


445 


AKO 12607 


Mus musculus 1 1 days embryo cDNA, RIKEN full-length enriched 
library, clone:2700094F01, full insert sequence 


0.19 


446 


AF309415 


Cyprinus carpio ovarian fibroin-like substance-2 mRNA, partial cds 


0.008 


447 


X03956 


Potato (tetraploid variety Maris piper) patatin gene 


0.003 


448 


BC008617 


Mus musculus, clone IMAGE:3588380, mRNA, partial cds 


0.07 


449 


AJ242146 


Mauritiella armata 18S rRNa gene (partial), 5.8S rRNA gene, 26S 
rRNA gene (partial) and internal transcribed spacers 1 and 2 (TTS1, 
ITS2), clone 1 


0.07 


450 


M67489 


Bovine desmocollin mRNA, complete cds 


0.066 


451 


AB048954 


Macaca fascicularis brain cDNA, clone:QnpA- 10509 


0.48 


452 


AE0O1136 


Borrelia burgdorferi (section 22 of 70) of the complete genome 


0.18 


453 


AE001398 


Plasmodium falciparum chromosome 2, section 35 of 73 of the 
complete sequence 


0.025 


454 


AF218904 


Homo sapiens attractin precursor (ATRN) gene, exon 19 


1.6 


455 


AK025078 


Homo sapiens cDNA: FLJ21425 fis, clone COL04162 


1E-71 


456 


Ml 1449 


Yeast mitochondrial ori2-ori7 region DNA with putative peptide 


0.34 


457 


XM 031156 


Homo sapiens hypothetical gene supported by AB007970 
(LOC90360), mRNA 


0.006 


458 


AK015180 


Mus musculus adult male testis cDNA, RIKEN full-length enriched 
library, clone:4930422I22, full insert sequence 


0.007 


459 


AC090422 


Homo sapiens clone RP1 1-691 A12, complete sequence 


0.013 j 


460 


U06154 


Human clone 2004V-I-1 from chromosome lq terminal region 


1E-19 


461 


AE001123 


Borrelia burgdorferi (section 9 of 70) of the complete genome 


0.83 


462 


Y17797 


Enterococcus feecalis gph, ydjH, ydjG, ydjl, pbp4 and ydiC, ORF2 
and ORF3 genes, partial 


0.22 


463 


AB026906 


Homo sapiens SDHD gene for small subunit of cytochrome b of 
succinate dehydrogenase, complete cds 


0.042 


464 


L13942 


Human (clone Cos35) glycerol kinase (GK) gene, exons 1-3 


0.042 


465 


U81510 


Spodoptera frugiperda caspase-1 mRNA, complete cds 


0.041 
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466 


AJ003222 


Borrelia burgdorferi flgK, flbF, thdF, gidA, gidB, moxR, orfl, orf2, 
or£3, orf4 and orf5 genes 


0.58 


467 


AY034379 


Capsicum annuum branched-chain amino acid aminotransferase 
mRNA, complete cds 


1.8 


468 


AF120317 


Mus musculus mahogany (mg) gene, exons 


0.56 


469 


AE007451 


Streptococcus pneumoniae section 134 of 194 of the complete genome 


0.005 


470 


AF305819 


Homo sapiens PRO0777 mRNA, complete cds 


0.055 


471 


AF045950 


Mus musculus D16Jhu5 YAC 183G11 acentric end, partial sequence 


0.22 


473 


XM 010416 


Homo sapiens GTP-binding protein ragB (RAGB), mRNA 


0.14 


474 


XM 028004 


Homo sapiens amyotrophic lateral sclerosis 2 (juvenile) chromosome 
region, candidate 3 (ALS2CR3), mRNA 


0.00005 


475 


AK021820 


Homo sapiens cDNA FLJ11758 fis, clone HEMBA1005609, highly 
similar to Homo sapiens mRNA; cDNA DKFZp564K133 (from clone 
DKFZp564K133) 


0.004 


476 


AE002579 


Drosophila melanogaster genomic scaffold 142000013385638, 
complete sequence 


6.7 


477 


AF047677 


Homo sapiens dystrophin (DMD) gene, deletion breakpoints 1-3 in 
intron5 


0.64 


478 


AL589203 


Human DNA sequence from clone RP11-391F23 on chromosome 6, 
complete sequence [Homo sapiens] 


0.00009 


479 


AF265343 


Danio rerio protein kinase Npk mRNA, complete cds 


0.14 


480 


XM_031879 


Homo sapiens CREB/ATF family transcription factor (CREB-H), 
mRNA 


0.44 


481 


XM 050760 


Homo sapiens ATP-binding cassette, sub-family C (CFTR/MRP), 
member 2 (ABCC2), mRNA 


0.077 


482 


AF069508 


Homo sapiens human endogenous retrovirus HML6.28 gag andpol 
pseudogenes, complete sequence 


0.23 


483 


| AK022914 


Homo sapiens cDNA F1J12852 fis, clone NT2RP2003445 


2 


484 


XM 032753 


Homo sapiens hypothetical gene supported by AK022412 
(LOC90584), mRNA 


1.9 


485 


XM 018436 


Homo sapiens hypothetical protein FLJ14327 (FU14327), mRNA 


0.66 


486 


AF130764 


Equus caballus FN1 gene, partial sequence 




487 


AK024286 


Homo sapiens cDNA FLJ14224 fis, clone NT2RP3004028 


2E-18 


488 


AF348342 


Plasmodium malariae caseinolytic protease C (clpC) gene, partial cds; 
apicoplast gene for apicoplast product 


0.23 


489 


XM 029336 


Homo sapiens similar to potassium voltage-gated channel, subfamily 
H (eag-related), member 2; ether a go-go related (M. musculus) 
(LOC90134), mRNA 


6.2 


490 


BC005951 


Homo sapiens, clone MGC: 14588 IMAGE:4249174, mRNA, 
complete cds 


5E-35 


491 


AJ391749 


Vicia melanops genomic repetitive element, clone M29 


0.13 


492 


AF154170 


Sceloporus dugesii 12S ribosomal RNA gene, partial sequence; 
mitochondrial gene for mitochondrial product 


0.21 



234 



INSDOCID: <WO 021 4500A2_I_> 



WO 02/14500 



PCT/US01/25840 





Table 3A Nearest Neighbor (BlastN vs. Genbank) 


SEQID 
NO 


ACCESSN 


DESCRIP 


P VALUE 


493 


X84060 


H.sapiens TCF11 gene, exon 3-6 


0.21 


494 


AP000418 


Arabidopsis thaliana genomic DNA, chromosome 3, PI clone:MPK17 


0.003 


495 


AJ277428 


Scenedesmus obliquus mitochondrial tRNA-Ser, tRNA-Asp, 16S 
rRNA, 23S rRNA genes, strain KS3-2 


0.65 


496 


LI 1447 


Mycoplasma hyorhinis repeat regions in potential metal binding 
protein gene region 


0.046 


497 


AY029764 


Mus musculus TRAM1 mRNA, complete cds 


0.23 


498 


XM_031173 


Homo sapiens parathyroid hormone (PTH), mRNA 


0.22 


499 


AF270168 


Staphylococcus epidermidis strain SRI clone step. 1051f02 genomic 
sequence 


0.24 


500 


AF336131 


Lumpy skin disease virus strain Neethling, 3* partial sequence 


3.5 


I 501 


AF195272 


Mus musculus serine/threonine kinase A3E1 gene, complete cds 


6.7 


502 


NC_002322 


Laqueus rubellus mitochondrion, complete genome 


0.26 


503 


AK007179 


Mus musculus adult male testis cDNA, RIKEN full-length enriched 
library, clone: 17001 12 J16, foil insert sequence 


0.24 


504 


U32706 


Haemophilus influenzae Rd section 21 of 163 of the complete genome 


0.68 


505 


AF247555 


Drosophila mauritiana heat shock protein 68 gene, partial cds 


1.9 


506 


NM_011146 


Mus musculus peroxisome proliferator activated receptor gamma 
(Pparg), mRNA 


0.024 


507 


M26940 


Mouse beta-casein gene, complete cds 


0.002 | 


508 


AB003043 


Phycomyces blakesleeanus classll chitin synthase (PbCHSl) gene, 
complete cds 


0.68 


509 


U89348 


Human papillomavirus type 16 variant, complete sequence 


0.62 


511 


AF270160 


Staphylococcus epidermidis strain SRI clone step. 105 le03 genomic 
sequence 


0.077 


512 


AF1 19554 


Plasmodium falciparum para-aminobenzoic acid synthetase gene, 
complete cds 


0.078 


513 


AY014991 


Waterstoniella sp. CAM-2000 cytochrome oxidase subunit I (COI) 
gene, partial cds; mitochondrial gene for mitochondrial product 


0.68 


514 


NMJ 13037 


Rattus norvegicus Fos-responsive gene 1 (St2), mRNA 


0.085 


515 


AF169003 


Hepatitis C virus isolate G2aKl polyprotein gene, complete cds 


0.028 


516 


| U23442 


Tetrahymena thermophila RR internal deletion sequence 


0.25 


517 


AF381638 


Homo sapiens SNP ECU psoriasis susceptibility gene candidate 
interval, partial sequence 


0.0003 


518 


NM 023755 


Mus musculus Tcfcp2-related transcriptional repressor 1 (Crtrl- 
pending), mRNA 


0.009 


519 


NMJ)04706 


Homo sapiens Rho guanine nucleotide exchange factor (GEF) 1 
(ARHGEF1), mRNA 


3.4 


520 


X67164 


P.hybrida T-DNA integration region 


0.063 


521 


AB042275 


Canine herpesvirus CICP0 gene for infected cell protein 0, complete 
cds 


0.66 


522 


AB019942 


Arabidopsis thaliana gene for sigma factor SigA, complete cds 


1.4 
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523 


AF270382 


Staphylococcus epidermidis strain SRI clone step.4041f02 genomic j 
sequence 


0.23 


524 


AF284897 


Carex supina tRNA-Thr, partial sequence; trnT-trnL intergenic 
spacer, tRNA-Leu, and trnL-trnF intergenic spacer, complete 
sequence; and tRNA-Phe, partial sequence; chloroplast genes for 
chloroplast products 


0.23 


525 


AK000493 


Homo sapiens cDNA FLJ20486 fis, clone KAT08039 


0.17 


526 


U67535 


Methanococcus jannaschii section 77 of 150 of the complete genome 


0.6 


527 


AE002754 


Drosophila melanogaster genomic scaffold 142000013385431, 
complete sequence 


1.8 


528 


AF241656 


Caenorhabditis elegans SR protein specfic kinase SPK-1 mRNA, 
complete cds 


1.7 


529 


AF173828 


Caenorhabditis elegans putative potassium channel Slo-2 mRNA, 
complete cds 


0.21 


530 


AE005382 


Escherichia coli 0157.H7 EDL933 genome, contig 3 of 3, section 1 of 
290 


2.3 


531 


Z36154 


S.cerevisiae chromosome II reading frame ORF YBR285w 


0.68 


532 


AF054126 


Rattus norvegicus polymorphic marker D6UIA6 sequence 


0.061 


533 


AK004780 


Mus musculus adult male lung cDNA, RIKEN full-length enriched 
library, clone: 12000 15C19, full insert sequence 


0.7 


534 


AF352555 


Staphylococcus aureus subsp. aureus strain ATCC31889 leukocidin 
LukS component gene, partial cds 


0.2 


535 


AF352555 


Staphylococcus aureus subsp. aureus strain ATCC3 1889 leukocidin 
LukS component gene, partial cds 


0.1 


536 


J05246 


Mouse alpha-fetoprotein (AFP) gene, 5' flank 


0.024 


537 


AF021935 


Rattus norvegicus mytonic dystrophy kinase-related Cdc42-binding 
kinase (MRCK) mRNA, complete cds 


e-141 


538 


Y12573 


D. melanogaster Jun and 14-3-3 zeta gene 


0.5 


539 


AF248584 


Moraxella catarrhalis UDP-glucose dehydrogenase gene, partial cds; 
and UDP-glucose 4-epimerase (galE) and L-glutamine:D-fructose-6- 
phosphate aminotransferase (glmS) genes, complete cds 


0.026 


540 


AL136554 


Homo sapiens mRNA; cDNA DKFZp761D0712 (firom clone 
DKFZp761D0712) 


0.089 


541 


AF158049 


Platybrachys decemmacula 16S ribosomal RNA gene, partial 
sequence; mitochondrial gene for mitochondrial product 


0.082 


542 


AY023618 


Oryza sativa microsatellite MRG5943 containing (TTA)X55, closest 
to marker RG908, genomic sequence 


0.00009 


543 


AL591498 


Human DNA sequence firom clone RP11-113L12 on chromosome 13, 
complete sequence [Homo sapiens] 


3E-14 


544 


AF183908 


Danio rerio P450 aromatase (cypl9b) mRNA, complete cds 


0.069 


545 


AK016658 


Mus musculus adult male testis cDNA, RIKEN full-length enriched 
library, clone: 493 3405 A14, full insert sequence 


0.53 


546 


AB014084 


Homo sapiens genomic DNA, chromosome 6p21.3, HLA class I 
region, Cosmid clone :TY7A5, complete sequence 


0.011 
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547 


AF285076 


Scleria reticularis tRNA-Leu, partial sequence; trnL-trriF intergenic 
spacer, complete sequence; and tRNA-Phe, partial sequence; 
chloroplast genes for chloroplast products 


0.19 


548 


XM_0476O4 


Homo sapiens copropoiphyrinogen oxidase (coproporphyria, 
harderoporphyria) (CPO), mRNA 


0.17 


549 


Z81538 


Caenorhabditis elegans cosmid F45H10, complete sequence 


0.24 


550 


AF026800 


Fundulus heterochtus cytochrome P4501A (CYP1A) mRNA, 
complete cds 


1.9 


551 


XM_051897 


Homo sapiens phospholipase A2, group IVA (cytosolic, calcium- 
dependent) (PLA2G4A), mRNA 


9E-08 


552 


XMJ)15200 


Homo sapiens hypothetical protein FLJ22724 (FLJ22724), mRNA 


2.1 


553 


AF266244 


Gillichthys mirabilis adenylate cyclase type v-like protein mRNA, 
complete cds 


6 


554 


Z21818 


H.sapiens carcinoembiyonic antigen gene 


0.23 


555 


AE002100 


Ureaplasma urealyticum section 1 of 59 of the complete genome 


0.023 


556 1 


XM__045179 


Homo sapiens hypothetical gene supported by AL 137724 
(LOC92454), mRNA 


0.19 


557 


X16509 


Rice aipha-amylase gene 


1.8 


558 


M33753 


D.melanogaster crumbs protein mRNA, complete cds 


5.4 


559 


D32022 


Human mRNA for T cell receptor V beta 8 CDR3, parcial sequence 


1.5 


560 


AF188893 


Homo sapiens guanidinoacetate N-methyltransferase (GAMT) gene, 
complete cds 


0.18 


561 


AF367310 


Arabidopsis thaliana At2g36880/Tl J8.6 gene, complete cds 


0.11 


562 


AF207699 


Elaeis guineensis agamous-like MADS box protein OPMADS1 
mRNA, complete cds 


0.45 


564 


AB014586 


Homo sapiens mRNA for KIAA0686 protein, partial cds 


0.17 


565 


L77040 


Homo sapiens (subclone 8_cll from PI H22) DNA sequence 


8E-14 


566 


NM_024942 


Homo sapiens hypothetical protein FLJ13490 (FLJ13490), mRNA 


0.013 


567 


U88158 


Tetrahymena thennophila micronuclear developmentally eliminated 
sequence region 


0.18 


568 


AF210744 


Nannochloropsis oculata large subunit ribosomal RNA gene, partial 
sequence 


0.057 


569 


AL590384 


Human DNA sequence from clone RP11-349A16 on chromosome 
Xq22.3-24, complete sequence [Homo sapiens] 


2E-15 


570 


AY034379 


Capsicum annuum branched-chain amino acid aminotransferase 
mRNA, complete cds 


1.2 


571 


AF038283 


Campylobacter jejuni cytolethal distending toxin A (cdtA), cytolethal 
distending toxin B (cdtB), and cytolethal distending toxin C (cdtC) 
genes, complete cds 


0.4 


572 


X98807 


A.thaliana mRNA for peroxidase ATP21a 


1.8 


573 


AF111941 


Dictyostelium discoideum AX4 development protein DG1148 
(DG1148) gene, complete cds 


0.0009 
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574 


AE002247 


Chlamydophila pneumoniae AR39, section 74 of 94 of the complete 
genome 


1.7 


575 


AF247555 


Drosophila mauritiana heat shock protein 68 gene, partial cds 


1.4 


576 


AJ237599 


Gallus gallus mRNA for iroquois homologue-2 


0,056 


577 


NMJ)30995 


Rattus norvegicus Microtubule-associated protein la (Mapla), mRNA 


0.21 


578 


XM 027638 


Homo sapiens KIAA0240 protein (KIAA0240), mRNA 


1.5 


579 


AF3 10886 


Dictyostelium discoideum RacA (racA) gene, complete cds 


0.065 


580 


AB001090 


Homo sapiens gene for H-cadherin, exon 1 


0.2 


581 


BC009101 


Mus musculus, Similar to hypothetical protein DKFZp761C121, clone 
MGC: 12146 IMAGE.3710846, mRNA, complete cds 


0.0008 


582 


XM 011473 


Homo sapiens serine (or cysteine) proteinase inhibitor, clade B 
(ovalbumin), member 9 (SERPINB9), mRNA 


1.8 


583 


AF004051 


Galium hypocarpium rpsl6 gene, chloroplast gene, partial intron 
sequence 


0.068 


584 


AE007440 


Streptococcus pneumoniae section 123 of 194 of the complete genome 


0.066 


585 


L25647 


Homo sapiens fibroblast growth factor receptor gene (located in the 
central MHC) signal peptide and consecutive exon 


0.069 


586 


AF191079 


Stealth virus 1 clone 3B632, genomic sequence 


0.009 


587 


L20465 


Onchocerca volvulus nicotinic acetylcholine receptor mRNA, partial 
cds 


0.72 


588 


NM 008713 


Mus musculus nitric oxide synthase 3, endothelial cell (Nos3), mRNA 


2 


589 


NM 004052 


Homo sapiens BCL2/adenovirus E1B 19kD-interacting protein 3 
(BNIP3), nuclear gene encoding mitochondrial protein, mRNA 


4E-26 


590 


U40228 


Plasmodium falciparum ADP-ribosylation factor 1 (pARFl) mRNA, 
complete cds 


0.009 


591 


AL049435 


Homo sapiens mRNA; cDNA DKFZp586B0220 (from clone 
DKFZp586B0220) 


0.008 


592 


AC000981 


Homo sapiens (subclone 2_c6 from PI H31) DNA sequence, complete 
sequence 


0.68 


593 


AF304319 


Neospora caninum small subunit ribosomal RNA and large subunit 
ribosomal RNA genes, partial sequences and intergenic region, plastid 
genes for plastid products 


0.009 


594 


BC006517 


Homo sapiens, Similar to PP AR binding protein, clone 
IMAGE:3546031, mRNA 


1 7E-43 


595 


AJ249162 


Homo sapiens promoter enhancer from ISG20 gene 


2.1 


596 


AJ297708 


Rattus norvegicus RT6 gene for T cell differentiation marker RT6.2, 
exons 1-8 


0.024 


J? I 


Y11779 


Kprowazekii ygjT-like gene and 4 open reading frames 


0.078 


598 


AE005660 


Escherichia coli 0157:H7 EDL933 genome, contig 3 of 3, section 279 
of 290 


0.53 


599 


XM 041617 


Homo sapiens RAB11B, member RAS oncogene family (RAB11B), 
mRNA 


4.8 


600 


AK023165 


Homo sapiens cDNA FLJ13 103 fis, clone NT2RP3002304 


0.25 
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601 


] 

Z69388 


ftuman DNA sequence from cosmid L60G9A, Huntington's Disease 
Region, chromosome 4pl6.3 contains ESTs and a CpG island 


0.084 


602 


U84760 


Chromatium vinosum dsr locus, complete sequence 


0.022 


603 


AF122050 


Hordeum vulgare limit dextrinase gene, complete cds 


1.8 


604 


AJ308471 


Mus musculus partial gene for acetyl-CoA carboxylase-alpha, 
promoter I 


0.022 


605 


AF158032 


Cixius inflatus 16S ribosomal RNA gene, partial sequence; 
mitochondrial gene for mitochondrial product 


0.008 


607 


XMJ)12733 


Homo sapiens KIAA1012 protein (KIAA1012), mRNA 


0.005 


608 


D37781 


Human mRNA for protein-tyrosine phosphatase HPTPeta, complete 
cds 


0.059 


609 


AC002252 


Homo sapiens (subclone l_g7 from BAC H76) DNA sequence, 
complete sequence 


7E-11 


610 


AF 160964 


Peanut witches'-broom phytoplasma RNA polymerase sigma factor 
(rpoD) gene, complete cds 


0.63 


611 


U67528 


Methanococcus jannaschii section 70 of 150 of the complete genome 


0.67 


612 


AF063134 


Justicia caudata tRNA-Leu (trnL) gene, chloroplast gene for 
chloroplast RNA, partial sequence 


1.2 


613 


U67527 


Methanococcus jannaschii section 69 of 150 of the complete genome 


1.5 


614 


AK022455 


Homo sapiens cDNA FLJ12393 fis, clone MAMMA1002711 


0.000001 


615 


AF272082 


Poiretia punctata trnK gene, intron, 3* partial sequence; and maturase 
(matK) gene, partial cds; chloroplast genes for chloroplast products 


0.075 


616 


AF1 12461 


Homo sapiens G protein-coupled receptor 57 (GPR57) gene, complete 
cds 


0.025 


617 


AF310884 


Dictyostelium discoideum RaclB (raclB) gene, complete cds; racK 
pseudogene, complete sequence; and unknown genes 


5.6 


618 


AF348342 


Plasmodium malariae caseinolytic protease C (clpC) gene, partial cds; 
apicoplast gene for apicoplast product 


0.22 


619 


AB043965 


Danio rerio gene for dharma, promoter and partial cds 


0.054 


620 


NC 001839 


Petunia vein clearing virus, complete genome 


0.61 


621 


AK023959 


Homo sapiens cDNA HJ13897 fis, clone THYRO1001706 


0.67 


622 


AF028835 


Drosophila pallidosa vermilion (v) gene, complete cds 


0.13 j 


623 


U67598 


Methanococcus jannaschii section 140 of 150 of the complete genome 


0.072 


624 


AF283103 


Pineapple mealybug wilt associated virus-2 polyprotein (ORFla) and 
RNA-dependent RNA polymerase genes, partial cds; and hydrophobic 
protein p5, heat shock protein 70, p46, coat protein, diverged coat 
protein, p20, p22, and p6 genes, complete cds 


1.5 


625 


XM 008161 


Homo sapiens hypothetical protein FLJ20345 (FLJ20345), mRNA 


1E-15 


626 


XM 035599 


Homo sapiens adaptor-related protein complex 4, epsilon 1 subunit 
(AP4E1), mRNA 


6E-22 
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627 


AF143066 


Lepidopilum scabrisetum ribosomal protein system 4 (rps4) gene, 
partial cds 


0.61 


628 


U89283 | 


Biomphalaria glabrata myoglobin gene, complete cds 


0.46 


629 


U44949 


Xenopus laevis zona pellucida A glycoprotein homolog (xlZPA) 
mRNA, complete cds 


1.6 


630 


AE007395 


Streptococcus pneumoniae section 78 of 194 of the complete genome 


0.15 


631 


BC006963 ! 


Mus musculus, clone IMAGE:3708410, mRNA, partial cds 


1.6 


632 


XM 031120 


Homo sapiens peptidylglycine alpha-amidating monooxygenase 
(PAM), mRNA 


4E-40 


633 


AF316122 


Kinosternon arizonense isolate KariAZ2 mitochondrial control region, 
complete sequence 


0.69 


634 


U67561 


Methanococcus jannaschii section 103 of 150 of the complete genome 


0.22 


635 


D88010 


Human DNA for ribosomal protein S13, complete cds, U14 small 
nucleolar RN A, complete sequence 


0.24 


636 


AE006321 


Lactococcus lactis subsp. lactis IL1403 section 83 of 218 of the 
complete genome 


2.2 


637 


AF045595 


Carassius auratus clone gf-40 glutamic acid decarboxylase isoform 67 
(GAD67) mRNA, complete cds 


0.74 


638 


D38378 


Human DNA, novel tandem repeat sequence 


0.21 


639 


AF260819 


Plasmodium falciparum CGI protein gene, complete cds 


0.003 


640 


NC 000857 


Ceratitis capitata complete mitochondrial genome 


0.23 


641 


AK015383 


Mus musculus adult male testis cDNA, RIKEN full-length enriched 
library, clone:4930444M15, full insert sequence 


2.1 


642 


AF286367 


Homo sapiens HMGIY gene, promoter 


1.6 


643 


AK019143 


Mus musculus 10 days embryo cDNA, RIKEN full-length enriched 
library, clone:2600001P13, full insert sequence 


0.69 


644 


AE001421 


Plasmodium falciparum chromosome 2, section 58 of 73 of the 
complete sequence 


2 


645 


D16904 


Human HepG2 3' region cDNA, clone hmd3a02 


0.21 


646 


XM 003799 


Homo sapiens protocadherin beta 10 (PCDHB10), mRNA 


0.69 


647 


NC_001438 


Bean golden mosaic virus A component DNA, complete sequence 


0.66 


648 


AK013971 


Mus musculus 13 days embryo head cDNA, RIKEN full-length 
enriched library, clone:3110001P07, full insert sequence 


2 


649 


AF078880 


Prosimulium formosum dopa decarboxylase (DDC) gene, partial cds 


0.076 


ODU 


AJ009600 


Begonia taipeiensis rbcL promoter, clone 16320 


0.22 


651 


AF042834 


Homo sapiens phosphodiesterase delta subunit gene, exons 2, 3 and 4 


5.8 


652 


AF244214 


Dysdera ratonensis 16S ribosomal RNA gene, partial sequence 


0.075 


653 


XM 043252 


Homo sapiens PCAF associated factor 65 alpha (PAF65 A), mRNA 


2E-33 
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654 


i 
j 

1 

AF105116 


Streptococcus pneumoniae type 19C Cpsl9CR (cpsl9CR) gene, 
partial cds; putative oligosaccharide repeat unit transporter (cpsl9CJ), 
JDP-N-acetyl glucosamine-2-epimerase (cpsl9CK), and putative 
glucosyl transferase (cpsl9CS) genes, complete cds; and gluO 


1.9 


655 


AF181720 


Homo sapiens RU2AS (RU2) gene, complete cds; and RU2S (RU2) 
gene, partial cds 


0.23 


656 


AL392164 


Human DNA sequence from clone RP1 1-269L6 on chromosome X, 
complete sequence [Homo sapiens] 


0.0003 


657 


AF189720 


Oiyctolagus cuniculus chloride channel (CFTR) mRNA, complete cds 


0.18 


658 


AF077821 


Canis familiaris inducible nitric oxide synthase mRNA, complete cds 


0.077 


659 


NC 001327 


Ascaris suum mitochondrion, complete genome 


0.074 


660 


XM 003164 


Homo sapiens KIAA0218 gene product (KIAA0218), mRNA 


5.8 


661 


M22207 


T.gratilla 217g gene encoding 217g protein, complete cds 


0.071 


662 


BC004062 


Mus musculus, Similar to cofactor required for Spl transcriptional 
activation, subunit 6 (77kD), clone IMAGE:3590309, mRNA, partial 
cds 


0.22 


663 


AF007132 


Homo sapiens clone 23551 mRNA sequence 


0.023 


666 


XM 007847 


Homo sapiens hypothetical protein from clone 24796 (LOC57146), 
mRNA 


5E-32 


667 


AE002755 


Drosophila melanogaster genomic scaffold 142000013385531, 
complete sequence 


5.8 


668 


U70780 


Clostridium botulinum unidentified protein P-48 gene, complete cds 
and neurotoxin binding protein gene, partial cds 


0.075 


669 


XM 029131 


Homo sapiens similar to KIAA0773 gene product (EL sapiens) 
(LOC90115),mRNA 


2 


670 


XM 003091 


Homo sapiens KIAA0001 gene product; putative G-protein-coupled 
receptor, G protein coupled receptor for UDP-giucose (KIAA0001), 
mRNA 


1.7 


671 


AF284881 


Carex pauciflora tRNA-Thr, partial sequence; trnT-trnL intergenic 
spacer, tRNA-Leu, and trnL-trnF intergenic spacer, complete 
sequence; and tRNA-Phe, partial sequence; chloroplast genes for 
chloroplast products 


1.6 


672 


NC 001381 


Plasmid pAJL5000, complete sequence 


6.1 


673 


AF081201 


Arabidopsis thaliana villin 1 (VLN1) mRNA, complete cds 


0.21 


674 


j AK023674 


Homo sapiens cDNA FU13612 fis, clone PLACE1010833, weakly 
similar to C ALTRACTTN 


0.54 


675 


X60097 


Streptococcus sp. (group C) emm gene for M protein 


0.063 


676 


AF026087 


Schizosaccharomyces pombe Satl gene, partial cds 


0.22 


677 


X00117 


Galago Alu repeat type I, GAL9 


0.0001 


678 


AJ276629 


Rattus norvegicus Sacm21/RT1-A intergenic region, haplotype RTln 
and partial RT1-A gene for MHC Class I antigen 


1.9 


679 


AK021058 


Mus musculus adult male corpus striatum cDNA, RIKEN full-length 
enriched library, clone:C030009O12, full insert sequence 


0.23 
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680 [ 


: 

BC003727 


Mus musculus, Similar to Interleukin 1 alpha, clone MGC:5780 
IMAGE:3599550, mRNA, complete cds 


1.9 


681 j 


XM 048261 


Homo sapiens similar to ring finger protein 23; RING-B box-coiled 
coil-B30.2 (M. musculus) (LOC92947), mRNA 


5E-18 


682 


XM 038154 


Homo sapiens KIAA1016 protein (KIAA1016), mRNA 


0.23 


683 


XM 049918 


Homo sapiens similar to hypothetical protein MGC10940 (H. sapiens) 
(LOC93230), mRNA 


0 


684 


X14727 


Sheep beta-B globin gene 


0.68 


i 685 I 


AB052187 


Macaca fascicularis brain cDNA, clone:QnpA-12170 


0.024 | 


686 


AK004783 


Mus musculus adult male lung cDNA, RIKEN full-length enriched 
library, clone:1200015E14, full insert sequence 


5.8 


687 


AE002799 


Drosophila melanogaster genomic scaffold 142000013385566, 
complete sequence 


0.003 


688 


AF190395 


Corymbia calophylla tRNA-Leu and tRNA-Phe genes, partial 
sequence; chloroplast genes for chloroplast products 


1.9 


689 


U57896 


Candida albicans cytochrome C gene (CYC1) gene, complete cds 


0.22 


690 


AE006710 


Sulfolobus solfataricus section 69 of 272 of the complete genome 


2 


691 


AL392164 


Human DNA sequence from clone RP1 1-269L6 on chromosome X, 
complete sequence [Homo sapiens] 


0.0003 


692 


D87001 


Homo sapiens immunoglobulin lambda gene locus DNA, clone:47H9 
downstream contig 


0.008 


693 


Y13853 


Drosophila erecta clone Erl inactive Bari-1 family transposon 


0.16 


694 


AB022927 


Oiyctolagus cuniculus HAC4 mRNA for hyperpolarization activated 
cation channel, complete cds 


0.66 


695 


AF307327 


Araraquara virus medium RNA segment, G1/G2 glycoprotein 
precursor gene, partial cds 


2 


696 


AF269369 


Staphylococcus epidennidis strain SRI clone step.l002g02 genomic 
sequence 


0.57 


697 


XM 043252 


Homo sapiens PCAF associated factor 65 alpha (PAF65 A), mRNA 


2E-33 


698 


XM 008640 


Homo sapiens lethal giant larvae (Drosophila) homolog 1 (LLGL1), 
mRNA 


0.22 


699 


AC024829 


Caenorhabditis elegans cosmid Y55F3BM, complete sequence 


0.023 


700 


M83660 


Dictyostelium discoideum clathrin heavy chain (chcA) mRNA, 
complete cds 


1.9 


701 


X95481 


S.cerevisiae ARC1 gene 


1.9 


702 


AJ408897 


Homo sapiens chromosomal breakpoint fragment derivative 4, 
AF4/MLL fusion, patient UPN006 


A O 


/Uj 


AF1 10007 


Molgula oculata p68 RNA helicase (bobcat) and transcription factor 
(manx) genes, complete cds 


0.68 


704 


AL133339 


Human DNA sequence from clone RP4-807J1 on chromosome 20. 
Contains a GSS and a CpG island, complete sequence [Homo sapiens; 


2 


705 


U10116 


Human superoxide dismutase (SOD3) gene, complete cds 


0.23 


706 


AB037771 


Homo sapiens mRNA for KIAA1350 protein, partial cds 


0 
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707 


AF023840 


Homo sapiens natural killer group protein 2-A (NKG2-A) gene, 
complete cds 


0.68 


708 


U49023 


Tegeticula maculata extranea cytochrome oxidase subunit I, 
cytochrome oxidase subunit II genes, mitochondrial genes encoding 
mitochondrial proteins, partial cds, and tRNA-Leu gene, 
mitochondrial gene 


0.001 


709 


XM_044768 


Homo sapiens similar to death receptor 6 (H. sapiens) (LOC92385), 
mRNA 


e-126 


710 


D50006 


Human DNA for alpha-platelet-derived growth factor receptor, exon 6- 
10 


2.1 


711 | 


Y14524 


Tetrahymena thermophila tRNA-Tyr gene, exons 1 and 2, clone 
pTetY2 


0.057 


712 


AL592159 


Human DNA sequence from clone RP13-237P14 on chromosome X, 
complete sequence [Homo sapiens] 


0.52 


713 


U88825 


HIV-1 isolate 92NG003 from Nigeria complete genome 


3.1 


714 


U05813 


Blastocrithidia culicis ATCC 30268 kinetoplast (MURF1) and 
apocytochrome B (cytB) genes, partial cds, and ATPase subunit 6 
(MURF4) gene, complete pre-RNA-edited DNA sequence 


0.23 


715 


AF048990 


Homo sapiens MutS homolog 5 (MSH5) gene, exons 11 and 12 


0.0009 


716 


AF3 17082 


Adoncholaimus thalassophygas isolate Sou39 large subunit ribosomal 
RNA gene, partial sequence; mitochondrial gene for mitochondrial 
product 


0.0008 


717 


AK013269 


Mus musculus 10, 1 1 days embryo cDNA, RIKEN full-length 
enriched library, clone:2810439K08, full insert sequence 


0.024 


718 


L14769 


Physarum polycephalum cytochrome oxidase subunit 1 mRNA, 
complete cds; mitochondrial gene for mitochondrial product 


1.6 


720 


U45453 


Xenopus laevis transcription factor xGATA-4 mRNA, complete cds 


0.61 


721 


AF090432 


Danio rerio serrateB mRNA, complete cds 


0.075 


722 


M62504 


Spiroplasma citri fibril protein gene, complete cds 


0.024 


1 723 


AK000939 


Homo sapiens cDNA FLJ10077 fis, clone HEMBA1001864 


0 


724 


XM_032587 


Homo sapiens 2127 (KIAA1754), mRNA 


0.22 


726 


AF012899 


Sambucus nigra ribosome inactivating protein precursor mRNA, 
complete cds 


0.000001 


727 


S82175 


RAR-gamma 2.2=retinoic acid receptor gamma isoform 2.2 
{alternatively spliced} [Xenopus laevis, embryos, stage 24, mRNA, 
2154 nt] 


0.008 


728 


AK002856 


Mus musculus adult male kidney cDNA, RIKEN full-length enriched 
library, clone:0610040A09, full insert sequence 


0.44 


729 


AF077828 


Oryctolagus cuniculus indolethylamine N-methyltransferase (INMT) 
gene, exon 3 and complete cds 


1.4 


730 


BC006633 


Mus musculus, Flt3 interacting zinc finger protein 1, clone 
MGC7201 IMAGE:3482191, mRNA, complete cds 


1.3 


731 


AK024670 


Homo sapiens cDNA: FU21017 fis, clone CAE05907 


0.5 


732 


Z35823 


S.cerevisiae chromosome II reading frame ORF YBL062w 


1.1 


733 


Z48636 


C.novyi gene for alpha-toxin 


0.065 
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734 


BC008580 


Homo sapiens, clone IMAGE:4 179986, mRNA, partial cds 


0.71 | 


735 


L37746 


Homo sapiens lamin Bl gene, exon 10 


0.11 


736 


AF142372 


Anolis roquet cytochrome b gene, partial cds; mitochondrial gene for 
mitochondrial product 


0.95 


737 


AF107454 


Homo sapiens clone cD622 mRNA sequence 


0.48 


738 


XM_010112 


Homo sapiens solute carrier family 6 (neurotransmitter transporter), 
member 14 (SLC6A14), mRNA 


0.5 


739 


AF188067 


Arabidopsis thaliana clone AtDP5 isopentenyl 
pyrophosphaterdimethyllallyl pyrophosphate isomerase (ipiAt2) 
mRNA, partial cds 


1.6 


740 


X90383 


A.thaliana DNA for Y13 gene 


0.034 i 


741 


AE004309 


Vibrio cholerae chromosome I, section 217 of 251 of the complete 
chromosome 


0.18 


742 


AF236642 


Calothrix parietina clone 102-2A 16S-23S internal transcribed spacer, 
complete sequence; and tRNA-Ile and tRNA-Ala genes, complete 
sequence 


1.1 


744 


AF107699 


Homo sapiens RING3 protein (RING3) gene, partial cds 


0.027 


745 


XM 035624 


Homo sapiens KIAA0876 protein (KIAA0876), mRNA 


3.4 | 


746 


XM_001795 


Homo sapiens lamin B receptor (LBR), mRNA 


0.15 


747 


AF395180 


Panteliella bicolor cytochrome oxidase subunit I gene, partial cds;. 
mitochondrial gene for mitochondrial product 


0.093 


748 


AF262633 


Homo sapiens clone NIGMS NA11418 chromosome 15 paralogous 
sequence variant, genomic sequence 


0.27 


749 


AF086375 


Homo sapiens full length insert cDNA clone ZD68B12 


0.000003 


750 


AJ271002 


Mus musculus TFFl/pS2 gene for Trefoil Factorl/pS2, exons 1-3 


0.8 


751 


AF063681 


Mus musculus type Xm collagen (coll3al) gene, exons 22-27 


0.53 


752 


AL031634 


Caenorhabditis elegans cosmid Y32B12C, complete sequence 


0.053 


753 


Y00165 


Sus scrofa mRNA for link protein 


0.065 


754 


AP000693 


Homo sapiens genomic DNA, chromosome 21q22.2, PCR iragment 
from BAC clone:KB739Cll, CBR1-HLCS region 


0.022 


755 


AK021535 


Homo sapiens cDNA FLJ11473 fis, clone HEMBA1001712 


0.064 


756 


AF305539 


Mus musculus B lymphocyte induced maturation protein 1 (Prdml) 
gene, exon 8 and complete cds 


4E-12 


757 


J05493 


S.pombe no message in thiamine protein (mntl) gene, complete cds 


0.55 


758 


AE001480 


Helicobacter pylori, strain J99 section 41 of 132 of the complete 
genome 


0.51 


1 759 


AF072685 


Bos taurus matrix metalloproteinase 13 (MMP13) mRNA, complete 
cds 


0.072 


760 


XM_038408 


Homo sapiens hypothetical protein FLJ11331 (FLJ11331), mRNA 


0.6 


761 


XM041148 


Homo sapiens hypothetical gene supported by AF037219 
(LOC91856), mRNA 


1.7 


762 


AF234897 


Haemulon plumieri Hpl224 NADH dehydrogenase I (ND-1) gene, 
complete cds; mitochondrial gene for mitochondrial product 


3 
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763 


NM 008399 ] 


Mus musculus integrin, alpha E, epithelial-associated (Itgae), mRNA 


1.4 


764 


Z74898 


S.cerevisiae chromosome XV reading frame ORF YOL156w 


1.8 


765 


AF280812 


Glycine max putative Hslpro-1-like receptor mRNA, complete cds 


0.59 


766 


AB013615 


Broad bean wilt virus 2 genomic RNA,* complete sequence of RNA1 


0.68 


768 


AF079169 


Nicotiana tabacum feedback-insensitive anthranilate synthase alpha-2 
chain (ASA2) gene, promoter region and 5UTR 


5.9 


769 


ABO 14079 


Homo sapiens genomic DNA, chromosome 6p21.3, HLA class I 
region, Cosmidclone:TYlEll, complete sequence 


0.007 


770 


XM 012336 


Homo sapiens MBIP protein (MBIP), mRNA 


0.0003 


111 


U41509 


Caenorhabditis elegans cosmid W03H1, complete sequence j 


0.18 


772 


U23478 


Dictyostelium discoideum phosphatidylinositoW^-diphosphate 3- 
kinase (PIK3) mRNA, partial cds 


0.00002 


773 


AP000380 


Arabidopsis thaliana genomic DNA, chromosome 5, genomic PCR 
product GA469 


0.39 


774 


AF1 15772 


Danio rerio basic helix-loop-helix transcription factor Ndrla (ndrla) 
mRNA, complete cds 


0.2 


775 


XM 007458 


Homo sapiens butyrate response factor 1 (EGF-response factor 1) 
(BRFl), mRNA 


0.42 


776 


L31955 


Human (clone 1NIB-2) normalized cDNA library sequence 


1.6 


111 


D49711 


Populus kitakamiensis (P. sieboldii X P. grandidentata) homt3 gene 
for caffeic acid O-methyltransferase, complete cds (exonl-4) 


0.046 


IIS 


L49234 


Manduca sexta chittnase precursor gene, complete cds 


0.05 


119 


AY008848 


Ictalurus punctatus clone Icpu-UA/3 MHC class I antigen gene, 
complete cds 


0.015 


780 


AB048375 


Oserya coulteriana chloroplast matK gene for maturase K, complete 
cds 


1.9 


781 


XM 010323 


Homo sapiens toll-like receptor 7 (TLR7), mRNA 


0.21 


782 


X69121 


P.falciparum Pfgrp mRNA for heat shock protein 


0.18 


783 


AE001570 


Helicobacter pylori, strain J99 section 131 of 132 of the complete 
genome 


0.05 


784 


M31756 


Human t(9;22) chromosome 9 breakpoint DNA 


0.77 


785 


M37889 


Staphylococcus aureus replication (rep), control of replication (cop), 
and resistance protein (QacC) genes, complete cds 


0.2 


786 


AF091326 


Plasmodium falciparum histone deacetylase (HDAC1) mRNA, 
complete cds 


0.19 


787 


Z74851 


S.cerevisiae chromosome XV reading frame ORF YOL109w 


0.2 


788 


Z85396 


Rsapiens Ig lambda light chain variable region gene (34-34S , WlJUb"i2) 
rearranged; Ig-Light-Lambda; VLambda 


0.56 


789 


AF075681 


Hemorrhagic enteritis virus hexon protein gene, partial cds 


2.1 


790 


AY029613 


Mus musculus GTP binding protein 3 gene, complete cds; nuclear 
gene for mitochondrial product 


5.8 


791 


NC 001338 


Sulfolobus virus 1, complete genome 


0.63 
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792 


U67563 


Vlethanococcus jannaschii section 105 of 150 of the complete genome 


0.11 


794 


U67549 


Methanococcus jannaschii section 91 of 150 of the complete genome 


0.21 


795 


AF293450 


Canis familiaris BPAGl-like gene sequence 


0.53 


797 


XM 029942 


Homo sapiens KIAA0877 protein (KIAA0877), mRNA 


1.4 ; 


798 


BC009775 


Homo sapiens, Similar to hypothetical protein FLJ10842, clone 
MGC: 13398 IMAGE:4092662, mRNA, complete cds 1 


0.061 


799 


NM 008399 


Mus musculus integrin, alpha E, epithelial-associated (Itgae), mRNA 


0.24 


800 


NC 001733 


Methanococcus jannaschii small extra-chromosomal element, 
complete sequence 


2 


801 ! 


Z98532 


S.pombe chromosome I cosmid clBl 


0.66 


802 


AL158132 


Human DNA sequence from clone RP3-323A24 on chromosome 4. 
Contains GSSs and a putative CpG island, complete sequence [Homo 
sapiens] 


0.003 


803 


XM 007847 


Homo sapiens hypothetical protein from clone 24796 (LOC57146), 
mRNA 


5E-32 


804 


XM 015759 


Homo sapiens hypothetical protein MGC2615 (MGC2615), mRNA 


1E-48 


805 


XM 003799 


Homo sapiens protocadherin beta 10 (PCDHB10), mRNA 


0.71 


806 


AF098697 


Mandarina mandflrina from Anijima population 16S ribosomal RNA 
gene, mitochondrial gene encoding mitochondrial product, partial 
sequence 


0.086 


807 


AE007509 


Streptococcus pneumoniae section 192 of 194 of the complete genome 


1.6 


808 


AY035064 


Arabidopsis thaliana unknown protein (T20O10_100/AT3g63000) 
mRNA, complete cds 


0.2 


809 


AE000595 


Helicobacter pylori 26695 section 73 of 134 of the complete genome 


0.046 


810 


Z75093 


S.cerevisiae chromosome XV reading frame ORF YOR185c 


0.11 


811 


M30825 


Drosophila melanogaster ovarian tumor protein isoforms (otu) gene, 
complete cds, alternatively spliced 


0.11 


812 


Y17004 


Suberites domuncula mRNA for monoubiquitin 


0.005 


813 


BC005104 


Homo sapiens, Similar to RIKEN cDNA 5830420C20 gene, clone 
IMAGE: 3633379, mRNA, partial cds 


1.7 


814 


XMJ)29941 


Homo sapiens cadherin-like 22 (CDH22), mRNA 


0.0006 


815 


NM 003246 


Homo sapiens thrombospondin 1 (THBS1), mRNA 


0.23 


816 


U38414 


Canis familiaris junctional sarcoplasmic reticulum protein mRNA, 
complete cds 


0.21 


817 


XM 004000 


Homo sapiens chromodomain helicase DNA binding protein 1 
(CHD1), mRNA 


0.63 


818 


AF006062 


Cryptococcus neoformans var. grubii strain H99 urease (URE1) gene, 
complete cds 


1.7 


819 


XM 018303 


Homo sapiens hypothetical protein FLJ11565 (FLJ11565), mRNA 


0.07 . 



246 



INSDOCID: <WO 021 4500A2J_> 



WO 02/14500 



PCT/US01/25840 





Table 3A Nearest Neighbor (BlastN vs. Genbank) 


SEQID 
NO 


ACCESSN 


DESCRIP 


P VALUE 


820 


AE001164 ] 


Borrelia burgdorferi (section 50 of 70) of the complete genome 


1.6 


821 


AF022969 < 


Caenorhabditis elegans cosmid C29G2 


0.2 


822 


AK021462 1 


Homo sapiens cDNA FLJ11400 fis, clone HEMBA1000673 


1.6 


823 


Y10908 1 


B.cereus cysA and wapA genes 


0.0003 


824 


U36892 


Triticum aestivum clone AC29 microsatellite sequence 


3.4 


825 


AF258615 


Dictyostelium discoideum LagC protein (lagC) gene, complete cds 


0.011 


826 


Z99763 


Flaveria pringlei gdcsH gene 


0.023 


827 


X71543 


B.taurus microsatellite sequence INRA1 18 | 


7E-15 


828 


AF349526 


Arabidopsis thaliana putative 60S ribosomal protein (T6C23. 18) 
mRNA, complete cds 


1.7 


829 


D87001 


Homo sapiens immunoglobulin lambda gene locus DNA, clone:47H9 
downstream contig 


0.007 


830 


XM 006470 


Homo sapiens radixin (RDX), mRNA 


2.4 


831 


AF191609 


Homo sapiens filamin (FLNB) gene, exon 21 


e-140 


832 


AF064754 


Homo sapiens nuclear inhibitor of protein phosphatase-1 (PPP1R8) 
gene, exon 4 


5.5 


833 


NC 001438 


Bean golden mosaic virus A component DNA, complete sequence 


0.57 


834 


D14567 


Penicillium urticae mitochondrial 1-rRNA (large rRNA) gene and its 
flanking region 


0.23 


835 


AP000301 


Homo sapiens genomic DNA, chromosome 21q22.1, D21S226-AML 
region, clone:D16B8, complete sequence 


0.5 


836 


NM_014147 


Homo sapiens HSPC047 protein (HSPC047), mRNA 


1E-90 


837 


AF296100 


Porcine teschovirus strain DS 562/91 polyprotein gene, complete cds 


1.3 


838 


AK020740 


Mus musculus adult male spinal cord cDNA, RIKEN full-length 
enriched library, clone: A330106M24, full insert sequence 


1.6 


839 


XM 045585 


Homo sapiens hypothetical protein FLJ11085 (KIAA1434), mRNA 


3E-16 


840 


AF115104 


Homo sapiens glutaredoxin (GLRX) gene, exon 1 


1.3 


841 


AF094673 


Mus musculus Sosl gene, intron 


3E-26 


842 


AK025101 


Homo sapiens cDNA: FLJ21448 fis, clone COL04473 


5E-93 


843 


M85276 


Homo sapiens NKG5 gene, complete cds 


0.039 


844 


AK022390 


Homo sapiens cDNA FLJ12328 fis, clone MAMMA1002145 


0.018 


845 


AF269767 


Staphylococcus epidennidis strain SRI clone step.l023b06 genomic 
sequence 


0.47 


846 


NM 005916 


Homo sapiens minichromosome maintenance deficient (S. cerevisiae) 
7 (MCM7), mRNA 


4.5 


847 


i AF209976 


Rattus norvegicus stromal cell-derived factor 1 (SDF-1) mRNA, 
complete cds 


1.3 


848 


NM 017078 


Rattus norvegicus Acetylcholine receptor alpha 5 (Chrna5), mRNA 


0.95 


849 


AF269324 


Staphylococcus epidennidis strain SRI clone step.l002al2 genomic 
sequence 


0.16 
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850 


AK008669 


Mus musculus adult male stomach cDNA, RJKEN full-length 
enriched library, clone:2210008F06, full insert sequence 


4.3 


851 


XM 044217 


Homo sapiens Ran binding protein 11 (LOC51194), mRNA 


0.44 


852 


XM 011510 


Homo sapiens CDC5 (cell division cycle 5, S. pombe, homolog)-like 
(CDC5L), mRNA 


0.056 


853 


AF068574 


Arabidopsis thaliana ferulate-5-hydroxylase (F5H) gene, complete cds 


0.042 


854 


AE003072 


Drosophila melanogaster genomic scaffold 142000013386025, 
complete sequence 


1.4 


855 


AF159462 


Escherichia coli EHEC factor for adherence (efal) gene, complete cds 


0.42 


856 


AF1 10797 


Rattus norvegicus GABAB1 receptor (GABABR1) gene, exons 10 
through 20 and alternatively spliced products, complete cds 


L6 


857 


AF319172 


Homo sapiens cGMP-specific phosphodiesterase type 5 A (PDE5A) 
gene, partial cds 


0.6 


858 


U90299 


Human chromosome 10 duplicated adrenoleukodystrophy (ALD) 
gene, 3' breakpoint sequence 


0.0000003 


859 


AL050065 


Homo sapiens mRNA; cDNA DKFZp566M043 (from clone 
DKFZp566M043) 


0.006 


860 


U32802 


Haemophilus influenzae Rd section 117 of 163 of the complete 
genome i 


1.4 


861 


AK001127 


Homo sapiens cDNA FLJ10265 fis, clone HEMBB 1001014 


0.17 


• 862 


AB042995 


Plecoglossus altivelis DNA, microsatellite, clone:Pal-3* 


0.45 


863 


NM_004668 


Homo sapiens maltase-glucoamylase (alpha-glucosidase) (MGAM), 
mRNA 


0.001 


864 


NM 021697 


Rattus norvegicus neuronal potassium channel alpha subunit (Kv8.1), 
mRNA 


0.19 


865 


Z28173 


S.cerevisiae chromosome XI reading frame ORF YKL173w 


0.035 


866 


XM 051783 


Homo sapiens hypothetical protein FLJ11712 (FLJ11712), mRNA 


1.1 


867 


X92498 


M.musculus fkh-6 gene 


0.99 


868 


XM 041339 


Homo sapiens similar to phosphodiesterase 4D interacting protein 
(myomegalin) (H. sapiens) (LOC91892), mRNA 


1.5 


869 


AL590401 


Human DNA sequence from clone RP1 1-466P12 on chromosome 6, 
complete sequence [Homo sapiens] 


0.16 


870 


Z68758 


Human DNA sequence from clone LL22NC03-85E10 on chromosome 
22 Contais GSSs, complete sequence [Homo sapiens] 


0.021 


871 


Z34916 


O.aries DNA for natural resistance associated macrophage protein 


4.6 


872 


NM 009253 


Mus musculus serine protease inhibitor-2 related sequence 1 (Spi2- 
rsl), mRNA 


1.5 


873 


AY024283 


Oryza sativa microsatellite MRG6608 containing (TGTA)X6, 
genomic sequence 


3.8 


874 


Z36108 


S.cerevisiae chromosome II reading frame ORF YBR239c 


1.6 


875 


XM 008791 


Homo sapiens hypothetical protein (LOC51320), mRNA 


0.14 
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876 


X89383 | 


R.norvegicus mRNA for SNF1 -related kinase 


0.18 ! 


877 


XM 002378 


Homo sapiens uncharacterized hypothalamus protein HT010 (HT010), 
mRNA 


0 


879 


XM 047537 


Homo sapiens DKFZP434C153 protein (DKFZP434C153), mRNA 


0.003 


880 


XM_006700 


Homo sapiens checkpoint with forkhead and ring finger domains 
(CHFR), mRNA 


4E-27 


881 


NC_002756 


Mertensiella luschani mitochondrion, complete genome 


2.1 


882 


M90915 


Human immunodeficiency virus type 1, viral sample LC01.DA11, V3 
region 


0.019 


883 


AB011534 


Rattus norvegicus mRNA for MEGF8, partial cds 


0.19 


885 


AF364468 


Simian-Human immunodeficiency virus clone 201/26w 1 envelope 
glycoprotein (env) gene, partial cds 


L4 


886 


XM_005320 


Homo sapiens cerberus 1 (Xenopus laevis) homolog (cysteine knot 
superfamily) (CER1), mRNA 


0.63 


887 


AC079875 


Homo sapiens clone CTD-2333B13, complete sequence 


0.024 | 


888 


NM 032394 


Mus musculus myosin VTTb (Myo7b), mRNA 


6.6 


889 


AF209448 


Borrelia hermsii plasmid cp32-like EcoRI-cut fragment #3 


1.8 


890 


AK008242 


Mus musculus adult male small intestine cDNA, RIKEN full-length 
enriched library, clone:2010015A19, full insert sequence 


0.065 


891 


AF305851 


Homo sapiens clone 16 ribosomal RNA intergenic spacer region, j 
partial sequence 


2 


892 


XM 047604 


Homo sapiens coproporphyrinogen oxidase (coproporphyria, 
harderoporphyria) (CPO), mRNA 


0.22 


893 


AF211133 


Carsonella ruddii natural-host Ctenarytaina eucalypti tryptophanyl- 
tRNA synthetase (trpS) gene, partial cds; 16S ribosomal RNA 23 S 
ribosomal RNA, and 5S ribosomal RNA genes, complete sequence; 
transaldolase (tal) and alkyl hydroperoxide reductase smal> 


0.2 


894 


AY047513 


Drosophila melanogaster GH01880 full length cDNA 


8.8 


896 


AL158132 


Human DNA sequence from clone RP3-323A24 on chromosome 4. 
Contains GSSs and a putative CpG island, complete sequence (Homo 
sapiens] 


0.0001 


1 897 


Z21978 


B.napus Myr2.Bnl gene encoding myrosinase, thioglucoside 
glucohydrolase 


1.4 


898 


X95904 


L.esculentum mRNA for 14-3-3 protein, TFT6 


0.81 


899 


Z48930 


C.caldarium mitochondrial genes (lOkb) 


0.27 


900 


XMJ)38535 


Homo sapiens KIAA1074 protein (KIAA1074), mRNA 




901 


AF016898 


Homo sapiens B-ATF gene, complete cds 


0.002 


902 


XM_051951 


Homo sapiens similar to PRO0097 protein (H. sapiens) (LOC93544), 
mRNA 


0.003 


903 


X89383 


R.norvegicus mRNA for SNF1 -related kinase 


0.23 


904 


AF056336 


Danio rerio bone morphogenetic protein 4 precursor (BMP4) gene, 
complete cds 


0.002 


905 


U34339 


Protopterus aethiopicus 28S ribosomal RNA gene, partial sequence 


1E-09 
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SEQID 
NO 


ACCESSN 


DESCRIP 


P VALUE 


906 


XM 041974 


Homo sapiens hypothetical gene supported by AK025102 
(LOC91974), mRNA 


0.06 


907 


D 14567 


Penicillium urticae mitochondrial 1-rRNA (large rRNA) gene and its 
flanking region 


0.26 


908 


AL035632 


Drosophila melanogaster BAC clone BACN32G11 


0.52 


909 


AF220354 


Drosophila melanogaster mitotic kinesin-like motor protein CENP- 
ana (cana) mRNA, complete cds 


0.86 


910 


U67484 


Methanococcus jannaschii section 26 of 150 of the complete genome 


2.6 


911 


Y13853 


Drosophila erecta clone Erl inactive Bari-1 family transposon 


0.17 


912 


AL1 37643 


Homo sapiens mRNA; cDNA DKFZp586A2324 (from clone 
DKFZp586A2324) 


1.3 


913 


AE006618 


Streptococcus pyogenes Ml GAS strain SF370, section 147 of 167 of 
the complete genome 


1.4 J 


914 


Z46663 


M.musculus DNA for growth hormone gene and promoter 


0.005 


915 


AF197225 | 


Emericella nidulans TUPA (tupA) gene, complete cds 


0.65 


916 


L20594 


Human annexin in (ANX3) gene, exon 4 


0.028 


917 | 


AB040730 


Clostridium haemolyticum genes for 16S rRNA, tRNA-Ala, tRNA-De, 
23 S rRNA, partial and complete sequence 


0.31 


918 


L10162 


Human variable number tandem repeat (VNTR) region, allele 13R4 3* 
to collagen type n (COL2A1) gene 


0.19 


919 


L35061 


Bacteriophage phi-41 orfs Ll-12, complete cds's, orf 13, 5' end 


0.75 


920 


AK018618 


Mus musculus adult male cecum cDNA, RIKEN full-length enriched 
library, clone:9130014E20, full insert sequence 


0.72 


921 


AC024865 


Caenorhabditis elegans cosmid Y74C10AL, complete sequence 


0.083 


922 


Z65965 


H.sapiens CpG island DNA genomic Msel fragment, clone 69g7, 
reverse read cpg69g7.rtlb 


0.46 


923 


AF364335 


Influenza A virus (A/Goose/Guangdong/3/97(H5Nl)) segment 6 
neuraminidase (NA) gene, complete cds 


0.46 


924 


M83534 


Athaliana isocitrate lyase gene, 5' end 


0.003 


925 


AE0O1390 


Plasmodium falciparum chromosome 2, section 27 of 73 of the 
complete sequence 


0.0001 


926 


AK023864 


Homo sapiens cDNA FLJ13802 fis, clone IHYRO1000186 


0 


927 


XM 010575 


Homo sapiens LOC87729 (LOC87729), mRNA 


1.3 


928 


L16771 


Drosophila simulans suppressor of forked (su(f)) gene sequence 


0.17 


931 


BC001815 


Homo sapiens, clone IMAGE:2989261, mRNA 


2E-33 


932 


BC008580 


Homo sapiens, clone IMAGE:4179986, mRNA, partial cds 


1.4 


933 


D32063 


Human gene for 2-oxoglutarate dehydrogenase, exon 14, 15, 16, 17, 
18 and 19 sequence 


0.058 


934 


XM 028551 


Homo sapiens hypothetical gene supported by AK026706 
(LOC90063), mRNA 


0.003 


935 


M87298 


Chicken neuropeptide Y gene, exons 1-3 


2.1 


936 


AE006100 


Pasteurella multocida PM70 section 67 of 204 of the complete genome 


1.5 


S 937 


L77040 


Homo sapiens (subclone 8 ell from PI H22) DNA sequence 


9E-14 
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SEQID 
NO 


ACCESSN 


DESCRIP 


P VALUE 


938 


AF016349 


Homo sapiens hexokinase I (HK1) gene, exon 1R and exon 1, 
alternatively spliced 


2 


939 


BC001393 


Homo sapiens, hypothetical protein, clone MGC:782 
EMAGE:3051397, mRNA, complete cds 


0.054 


j 940 


AK024092 


Homo sapiens cDNAFLJ14030 fis, clone HEMBA1004086 


0.025 


941 


AJ244013 


Ectromelia virus proviral spi-3 gene for serine proteinase inhibitor, 
strain Hampstead (egg) 


4.3 


942 


AK000184 


Homo sapiens cDNAFLJ20177 fis, clone COL09966, highly similar 
to Y08136 H 


0 


943 


U67528 


Methanococcus jannaschii section 70 of 150 of the complete genome 


0.22 


944 


AF065599 


Schistosoma mansoni annexin mRNA, complete cds 


0.64 j 


945 


AK020851 


Mus musculus adult retina cDNA, RIKEN full-length enriched 
library, clone: A930013F10, full insert sequence 


2.7 


946 


AF067228 


Buchnera aphidicola bacterioferritin comigratory protein (bcp) gene, 
partial cds; chorismate synthase (aroC), ATP 
phosphoribosyltransferase (hisG), histidinol dehydrogenase (hisD), 
histidinolphosphate aminotransferase (hisC), 
imidazoleglycerolphosphate> 


0.021 


947 


XM 049755 


Homo sapiens hypothetical gene supported by AL 137578 
(LOC93 193), mRNA 


0.19 


948 


M14657 


Bovine thrombomodulin mRNA, 3* end 


0.004 


949 


AF3 12917 


Plasmodium falciparum reticulocyte binding protein 2 homolog B ! 
gene, complete cds 


1 4.6 


950 


U13067 


Drosophila melanogaster Twin of m4 (Tom) gene, complete cds; and 
Brd gene, promoter region and 5TJTR, partial sequence 


0.025 


951 


i AL1 13476 


Botiytis cinerea strain T4 cDNA library under conditions of nitrogen 
deprivation 


4.1 


952 


XMJH8492 


Homo sapiens talin 1 (TLN1), mRNA 


5E-50 


953 


M19686 


Bovine interstitial retinol binding protein (IRBP) mRNA, 3* flank 


0.0003 


954 


M69188 


Cotton legumin A D-genome alloallele gene, complete cds 


0.072 


955 


AK016519 


Mus musculus adult male testis cDNA, RIKEN full-length enriched 
library, clone:4932411G06, full insert sequence 


0.65 


956 


X53545 


T. cruzi mRNA for SA85-1.1 protein 


0.023 


957 


U67557 


Methanococcus jannaschii section 99 of 150 of the complete genome 


0.015 


958 


AL136165 


Human DNA sequence from clone RP5-863M11 on chromosome 6, 
complete sequence [Homo sapiens] 


7E-12 


959 


AE006697 


Sulfolobus solfataricus section 56 of 272 of the complete genome 


2 


960 


AB056323 


Macaca fascicularis brain cDNA, clone:QflA-10392 


0.56 


961 


XM 015216 


Homo sapiens cytochrome b5 outer mitochondrial membrane 
precursor (CYB5-M), mRNA 


0.25 


962 


U88513 


Flexamia pyrops NADH dehydrogenase 1 gene, mitochondrial gene 
encoding mitochondrial protein, complete cds 


0.025 


963 


NM 007869 


Mus musculus DnaJ-like protein 1 (Dnajll), mRNA 


1.5 


964 


AE002349 


Chlamydia muridarum, section 76 of 85 of the complete genome 


| 0.51 
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SEQ1D 
NO 


ACCESSN 


DESCRIP 


P VALUE 


965 


AF039714 


Caenorhabditis elegans cosmid T13E5 


1.8 


966 


AF111813 


Plasmodium falciparum 1-deoxy-D-xylulose 5-phosphate 
reductoisomerase (dxr) gene, complete cds 


1.3 


967 


AE000425 


Escherichia coli K12 MG1655 section 315 of 400 of the complete 
genome 


0.002 


968 


AF108658 


Homo sapiens p5327 mRNA, complete cds 


0.57 


969 


X82329 


A.hypogaea chi2.1 gene for chitinase (class II) 


3.6 


970 


AK026575 


Homo sapiens cDNA: FLJ22922 fis, clone KAT06722 


1.5 


971 


ABO16605 


Bombyx mori gene for peptidoglycan recognition protein, complete 
cds 


0.0002 


972 


AB009629 


Avian rotavirus RN A for VP 1 , complete cds 


1.4 


973 


AK020413 


Mus musculus 12 days embryo embryonic body between diaphragm 
region and neck cDNA, RIKEN full-length enriched library, 
clone:9430013L14, full insert sequence 


0.095 


974 


AF013754 


Gallus gallus transcription factor RREB-1 (rreb-1) mRNA, complete 
cds 


3.4 


975 


V01390 


Trypanosoma brucei kinetoplast DNA maxicircle fragment encoding 
two very small ribosomal RNAs, a 12S rRNA homologous to E.coli 
23S rRNA and a 9S rRNA homologous to 16S rRNA 


0.18 


976 


AL1 12360 


Botiytis cinerea strain T4 cDNA library under conditions of nitrogen 
deprivation 


0.22 


977 


AF389904 


Danaus chrysippus orientis 12S ribosomal RNA gene, partial 
sequence; mitochondrial gene for mitochondrial product 


0.022 


979 


X96999 


P.vulgaris YprlO gene 


1.5 | 


980 


AL1 10480 


Caenorhabditis elegans cosmid Y24F12A, complete sequence 


0.16 


981 


AJ387746 


Weldenia Candida chloroplast tRNA-Leu intron and trnL-F intergenic 
spacer 


1.8 


982 


AE001377 


Plasmodium falciparum chromosome 2, section 14 of 73 of the 
complete sequence 


0.002 


983 


U26946 


Gallus gallus inhibin beta A subunit mRNA, complete cds 


2.1 


984 


AC091651 


Homo sapiens clone RP11-17D6, complete sequence 


2E-76 


985 


AF143382 


Arabidopsis lyrata pistillata gene, complete cds 


0.02 


\ 986 


Z29969 


E.histolytica plasmid genes for ribosomal RNA and hemolysins 
HLY1, HLY5mcl HLY5mc2 HLY4 


0.02 


987 


XM 001301 


Homo sapiens KIAA0469 gene product (KIAA0469), mRNA 


0.022 


988 


AB007638 


Bacillus subtilis genomic DNA containing gutA to cotA region, 48 
degree 




989 


XM 012394 


Homo sapiens myosin EXA (MY09A), mRNA 


7E-09 


990 


AF258322 


Zenaida galapagoensis beta fibrinogen gene, intron 7 and partial cds 


5 


991 


XM 048778 


Homo sapiens similar to coagulation factor V3H, procoagulant 
component (hemophilia A) (H sapiens) (LOC93021), mRNA 


0.73 


992 


AB056815 


Macaca fascicularis brain cDNA clone:QflA-15307, full insert 
sequence 


6.4 
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SEQID 
NO 


ACCESSN 


DESCRDP 


P VALUE 


993 


] 

AJ403948 ( 


Homo sapiens partial SLC22A3 gene for organic cation transporter 3, 
sxons 3-4 


0.008 


994 


] 

AF069188 


Ephedrus laevicollis NADH dehydrogenase 1 gene, mitochondrial 
gene encoding mitochondrial protein, partial cds 


0.082 


995 


AF282920 


Staphylococcus aureus quinone oxidoreductase and glycine betaine 
transporter genes, complete cds 


6.6 


996 


AB031325 


Homo sapiens gene for calcium-sensing receptor, exons, promoter 
region 


0.081 


997 


AE001421 


Plasmodium falciparum chromosome 2, section 58 of 73 of the 
complete sequence 


0.74 


998 


AF323615 


Rattus norvegicus phosphoinositide-specific phospholipase C epsilon 
mRNA, complete cds 


2.3 1 


999 


AF305103 


Picea abies clone 1PABC5D3 hypermethylated genomic sequence 


0.25 


1000 


AY022117 


Oryza sativa microsatellite MRG4442 containing (TC)X17, closest to 
marker RG450, genomic sequence 


0.7 


1001 


NM 023324 


Mus musculus pellino 1 (Pelil), mRNA 


0.26 


1002 


X06329 


Mouse repetitive sequence LIMd-F type 5 'end (1G-6) 


0.73 


1003 


AE001173 


Borrelia burgdorferi (section 59 of 70) of the complete genome 


0.24 


1004 


AK021535 


Homo sapiens cDNA FLJ11473 fis, clone HEMBA1001712 


0.25 


1005 


U90529 


Plasmodiinn vivax SV-5 Pv200 high-binding region I gene, partial cds 


0.24 


1006 


XM 003560 


Homo sapiens MAD2 (mitotic arrest deficient, yeast, homolog)-like 1 
(MAD2L1), mRNA 


4E-26 


1007 


AF315106 


Homo sapiens isolate HERV-H-PA9 long terminal repeat, partial 
sequence 


0.000001 


1008 


U25177 


Helobdella robusta cyclin A mRNA, partial cds 


0.025 | 


1009 


NM 021401 


Mus musculus secreted and transmembrane 1 (Sectml), mRNA 


| 0.084 


1010 


AF339813 


Homo sapiens clone IMAGE:297403, mRNA sequence 


2 


1011 


AF020599 


Equus caballus insulin-like growth factor 2 (IGF2) gene, partial cds 


0.38 


1012 


BC001815 


Homo sapiens, clone IMAGE.2989261, mRNA 


7E-43 


1013 


AJ297945 


Frankia sp. ArD cryptic plasmid pFQ3 1 ORF AF, korSAF gene, ORF 
CrF, ORF DrF, repF gene, ORF FrF, ORF GF, ORF HF, ORF JF, 
ORF KrF, ORF LrF, ORF MF, ORF NrF, ORF OF, ORF PrF, parAF 
gene and ORF SrF 


0.64 


1014 


Z71327 


S.cerevisiae chromosome XIV reading frame ORF YNL051w 


0.61 


1015 


XM 031156 


Homo sapiens hypothetical gene supported by AB007970 
(LOC90360), mRNA 


0.003 


1016 


XMJ)07274 


Homo sapiens hypothetical protein FLJ10242 (FLJ10242), mRNA 


0.029 


1017 


AE001495 


Helicobacter pylori, strain J99 section 56 of 132 of the complete 
genome 


0.076 


| 1018 


AK024606 


Homo sapiens cDNA: FLJ20953 fis, clone ADSE01979 


e-109 


1019 


AF049434 


Melaleuca lateritia 5S ribosomal RNA gene, partial sequence; and 5S 
ribosomal RNA intergenic spacer, complete sequence 


1.9 
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SEQID 
NO 


ACCESSN 


DESCRIP 


P VALUE 


1020 


AF078543 


Pestivirus type 4 strain MN fetus polymerase (NS5b) gene, partial cds 


0.68 


1021 


M63636 


Streptococcus thermophilus beta-D-galactosidase (lacZ) gene, 
complete cds 


0.6 


1022 


AJ270226 


Entodinium caudatum partial mRNA for putative glycosyltransferase, 
clone L48 


0.67 


1023 


M59429 


T. thermophilaF-antigen (tfa) gene, complete cds 


0.009 


1024 


X77801 


S.beecheyi genomic DNA with direct repeats and integrated Hepatitis 
B virus 


1.9 


1025 


AL590109 


Human DNA sequence from clone RP1 1-405 J10 on chromosome 10, 
complete sequence [Homo sapiens] 


6E-84 


1026 


AF269904 


Staphylococcus epidermidis strain SRI clone step.l029hl0 genomic 
sequence 


0.076 


1027 


AL591343 


Human DNA sequence from clone RP11-260H5 on chromosome 6, 
complete sequence [Homo sapiens] 


0.22 


1028 


Z12986 


O.aries rearranged T-cell receptor delta 


0.78 


1029 


AY042836 


Arabidopsis thaliana Unknown protein mRNA, complete cds 


0.25 


1030 


X00117 


Galago Alu repeat type I, GAL9 | 


0.0001 


1031 


XM 015858 


Homo sapiens Werner syndrome (WRN), mRNA 


6.6 


1032 


M27444 


Bos taurus (clone pTKD7) dopamine and cyclic AMP-regulated 
neuronal phosphoprotein (DARPP-32) mRNA, complete cds 


0.63 


1033 


AF272852 


Arabidopsis thaliana beta 1,2-xylosyltransferase mRNA, complete cds 


0.77 


1034 


L08174 


Romanomeraiis culicivorax mitochondrial NADH dehydrogenase 
subunits 6 and 3 (ND6, ND3) cytochrome P450-like protein genes, 
complete CDS 


0.084 


1035 


! AF387598 


Nicotiana tabacum EPSP synthase 1 (EPSPS1) gene, partial cds 


0.007 


1036 


AF157094 


Sarcophilus harrisii mitochondrial DNA control region, partial 
sequence 


0.23 


1037 


XM_046761 


Homo sapiens hypothetical gene supported by AK000735 
(LOC92705), mRNA 


0.66 


1038 


| AF348479 


Bos taurus myostatin gene, partial cds 


6.1 


1039 


XM_050269 


Homo sapiens solute carrier femily 1 (glial high affinity glutamate 
transporter), member 3 (SLC1A3), mRNA 


0.21 


1040 


AF164967 


Canine distemper virus strain A75/17, complete genome 


7.1 


1041 


AB028634 


Flammulina velutipes mitochondrial gene for RNA polymerase, DNA 
polymerase, complete and partial cds 


0.086 


1042 


AF3 12925 


Homo sapiens cryptic (CFC1) gene, complete cds 


f\ c\(\f\f\r\A 
0.UUUUU4 




U67596 


Methanococcus jannaschii section 138 of 150 of the complete genome 


0.26 


1044 


L04631 


Human cholesterol 7 alpha-hyroxylase (CYP7) gene, exon 3 


0.009 


1045 


U50336 


Mus musculus Ah receptor gene, promoter region and exon 1, partial 
sequence 


2.1 


1046 


AF105001 


Mus musculus Fab 1098 immunoglobulin light chain variable region 
mRNA, partial cds 


0.8 
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SEQID 
NO 


ACCESSN 


DESCRIP 


P VALUE 


1047 


U55276 


Rattus norvegicus microtubule-associated protein IB gene, promoter 
region 


0.79 


1048 


AF056302 


Drosophila melanogaster eIF-2alpha kinase (GCN2) mRNA, complete 
cds 


0.63 


1049 


NC 001322 


Drosophila yakuba mitochondrion, complete genome 


0.24 j 


1050 


AF097036 


Oreina coerulea cytochrome oxidase subunit I gene, partial cds; 
mitochondrial gene for mitochondrial product 


0.23 


1051 


L12148 


Pasteurella haemolytica-like sp. pUktC, pllktB, pllktD, and leukotoxin 
structural protein (pllktA) genes, complete cds 


0.21 


1052 


AB005153 


Mus musculus DNA for DNA dependent protein kinase catalytic 
subunit, exon 5, 6, 7 


0.006 


1053 


AF126733 


Brassica oleracea var. italica floral homeotic protein pseudogene, 
partial sequence 


2.1 


1054 


AF090900 


Homo sapiens clone HQ0189 PRO0189 mRNA, complete cds 


0.000001 


1055 


AJ223260 


Chromodoris purpurea mitochondrial COI gene, partial 


0.007 


1056 


AF089052 


Pseudoleistes virescens cytochrome b (cytb) gene, mitochondrial gene 
encoding mitochondrial protein, partial cds 


0.77 


1057 


AL138748 


Human DNA sequence from clone RP4-732E19 on chromosome 
Xq21.1-21.33 Contains GSSs, complete sequence [Homo sapiens] 


0.59 


1058 


AL132767 


Human DNA sequence from clone RP5-819L10 on chromosome 6ql2- 
13. Contains GSSs, complete sequence [Homo sapiens] 


5 


1059 


AF032670 


Lucilia cuprina Notch homolog (Scl) gene, exon B 


0.055 


1060 


AE001371 


Plasmodium falciparum chromosome 2, section 8 of 73 of the 
complete sequence 


0.062 


1061 


AY039845 


Arabidopsis thaliana Atlg35160/T32G9_30 mRNA, complete cds 


4.6 


1062 


AF365189 


Sindoropsis le-testui tRNA-Leu (trnL) gene, partial intron sequence; 
chloroplast gene for chloroplast product 


0.11 


1063 


U60591 


Drosophila melanogaster kuzbanian (kuz) mRNA, complete cds 


0.41 


1064 


AJ297052 


Homo sapiens SDS-stable vimentin-bound DNA fragment 
HEF42VIM12 


4E-09 


1065 


AB023191 


Homo sapiens mRNA for KIAA0974 protein, partial cds 


0.23 


1066 


AK021989 


Homo sapiens cDNA FLJ11927 fis, clone HEMBB1000402 


0.33 


1067 


XM 038524 


Homo sapiens hypothetical protein FLJ23059 (FLJ23059), mRNA 


e-169 


1068 


XM 045967 


Homo sapiens estrogen receptor 1 (ESR1), mRNA 


! 4.9 


1069 


AF354258 


Homo sapiens polynucleotide kinase-3 '-phosphatase (PNKP) gene, 
complete cds 


0.66 


! 1070 


AJ404320 


Posidonia oceanica medium repetitive sequence, clone pPopm6 


0.038 


| 1071 


NC 002756 


Mertensiella luschani mitochondrion, complete genome 


4.9 


1072 


Z68886 


Human DNA sequence from cosmid L21F12, Huntington's Disease 
Region, chromosome 4pl6.3 


2E-10 


1073 


AE001370 


Plasmodium falciparum chromosome 2, section 7 of 73 of the 
complete sequence 


0.025 
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SEQID 
NO 


ACCESSN 


DESCRIP 


P VALUE 


1074 


XMJ)33521 


Homo sapiens hypothetical protein FLJ10357 (FLJ10357), mRNA 


1.5 


1075 


U02239 


Mycoplasma genitalium random genomic clone xd5, partial cds 


0.006 


1076 


AE006395 


Lactococcus lactis subsp. lactis EL1403 section 157 of 218 of the 
complete genome 


0.32 


1077 


AY023826 


Oryza sativa microsatellite MRG6151 containing (ATAQX8, 
genomic sequence 


0.031 


1079 


X63213 


B.taurus CI-B12 mRNA for ubiquinone oxidoreductase complex 


7 


1080 


AK021648 


Homo sapiens cDNA FLJ11586 fis, clone HEMBA1003720 


3E-13 


1081 


AF246689 


Dictyostelium discoideum ABC transporter mdrAl (MDRA1) and 
ABC transporter mdr A2 (MDRA2) genes, complete cds 


0.67 


1082 


AB016100 


Oryza sativa LINE retrotransposon, endonuclease region of RILN5 


0.068 


1083 


U93157 


Orestias species mitochondrial control region, allele SRT1, partial 
sequence 


6.9 


1084 


AF173373 


Caenorhabditis elegans transmembrane protein NDG-4 (ndg-4) 
mRNA, complete cds 


1.8 


1086 


AJ3 10640 


Turkey coronavirus genomic RNA for partial spike glycoprotein (s 
gene), 3a protein (gene 3), 3b protein (gene 3), E protein (gene 3) and 
partial M protein (gene 4), strain turkey/UK/4 12/00 


2 


1087 


M21953 


Duck hepatitis B virus, complete genome 


0.086 ! 


1088 


AF332583 


Homo sapiens stratum corneum chymotryptic enzyme (KLK7) gene, 
complete cds, alternative transcripts 


2.1 


1089 


AY037894 


Glomus intraradices phosphate transporter mRNA, partial cds 


0.084 


1091 


NM 020078 


Rattus norvegicus a disintegrin and metalloproteinase domain 1 
(fertilin alpha) (Adaml), mRNA 


0.51 


1092 


AF282531 


Homo sapiens clone 18qtel c30t7 sequence 


0.003 


1093 


AF324424 


Ichthyophthirius multifiliis immobihzation antigen isoform (IAG52A) 
gene, complete cds 


0.56 


1 1094 


AJ404320 


Posidonia oceanica medium repetitive sequence, clone pPopm6 


0.075 


1096 


Z46803 


V.narbonensis mRNA for legumin A precursor 


2.2 


1097 


XM 050195 


Homo sapiens similar to hypothetical protein FLJ14751 (H. sapiens) 
(LOC93274), mRNA 


5.7 


1098 


AJ246004 


Homo sapiens RET gene, 5 f genomic fragment 


0.00004 


1099 


AY031051 


HIV-1 isolate NC4057-1999 from USA pol polyprotein (pol) gene, 
partial cds 


0.23 


1100 


AE002146 


Ureaplasma urealyticum section 47 of 59 of the complete genome 


0.26 


1101 


Z23650 


H. sapiens (D1S436) DNA segment containing (CA) repeat; clone 
AFM217zc3; single read 


0.001 


1102 


D73382 


Sugarbeet mitochondrial gene for ORFB, complete cds 


0.73 


1103 


U67503 


Methanococcus jannaschii section 45 of 150 of the complete genome 


0.69 


1104 


AL590376 


Human DNA sequence from clone RP1 1-22P2 on chromosome Xq23- 
24, complete sequence [Homo sapiens] 


0 


1106 


D14636 


Mouse mRNA for PEBP2al protein, complete cds 


0.61 
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Table 3A Nearest Neighbor (BlastN vs. Genbank) 


SEQID 
NO 


ACCESSN 


DESCRIP 


P VALUE 


1107 


Y14622 


Mus musculus Tnfr2 gene, exon 4 to exon 8 


0.71 


1108 


AF130049 


Homo sapiens clone FLB3411 PRO0852 mRNA, complete cds 


0.065 


1109 


AJ011009 


Glycine max cytosolic glutamine synthetase (gsl5) gene promoter 


0.007 


1110 


XMJ)43523 


Homo sapiens Rho guanine exchange factor (GEF) 11 (ARHGEF11), 
mRNA 


0.7 


1111 


Z77974 


Ksapiens flow-sorted chromosome 6 HindTTT fragment, SC6pA5D12 


0.001 


1112 


AJ002513 


Schizosaccharomyces pombe cmbl gene 


8.2 


1113 


AF098878 


Homo sapiens APAF1 gene, exons 6 and 7 


1.9 


1114 


M14080 


Herpesvirus saimiri thymidylate synthase gene, complete cds 


0.023 


1115 


U44838 


Glycine max extensin (SbHRGP3) gene, complete cds 


0.76 


1116 


Y15491 


Pongo pygmaeus gene encoding fertilin alpha 


2.1 


1117 


U43145 


Plasmodium chabaudi repeat organellar protein gene, complete cds 


0.25 


1118 


Z81524 


Caenorhabditis elegans cosmid F32H5, complete sequence 


0.009 


1119 


AF052957 


Homo sapiens type XV collagen (COL15A1) gene, exons 3 and 4 


2.2 


1120 


U67523 


Methanococcus jannaschii section 65 of 150 of the complete genome 


0.001 


1121 


AL121807 


S.pombe chromosome in cosmid cl32 


0.19 


1122 


X59280 


S.douglasii gene for cytochrome b 


0.29 


1123 


XM_030915 


Homo sapiens KIAA0673 protein (KIAA0673), mRNA 


5E-19 


1124 


X99254 


P.falciparum gene encoding primase, small subunit 


0.084 


1125 


AC004214 


Homo sapiens clone UWGC:m32agap from 6p21, complete sequence 


0.009 


1126 


AF061244 


Agrocybe aegerita B type DNA polymerase (Mtpol) gene, complete 
cds; tRNA-Asn gene, complete sequence; and unknown genes, 
mitochondrial genes for mitochondrial products 


0.001 


1127 


AK001125 


Homo sapiens cDNA FLJ10263 fis, clone HEMBB 1000991 


0 


1128 


L41046 


Pisum sativum endo-l,4-beta-glucanase (EGL1) gene, complete cds 


0.0001 


1129 


XM 008099 


Homo sapiens integrin, alpha L (antigen CD11A (pl80), lymphocyte 
function-associated antigen 1; alpha polypeptide) (TTGAL), mRNA 


2E-18 


1130 


AE000049 


Mycoplasma pneumoniae M129 section 31 of 63 of the complete 
genome 


0.75 


1131 


AY019627 


Oryza sativa microsatellite MRG1952 containing (AT)X40, closest to 
marker L128, genomic sequence 


0.024 


1132 


Z46383 


Bovine herpesvirus type 4 DNA for nonconserved region A 


4.3 


1133 


Y07893 


S.cerevisiae DNA fragment, right arm of chromosome VII 


4.2 


1134 


XMJH0867 


Homo sapiens sperm acrosome associated 1 (SPACA1), mRNA 


0.003 


1135 


XM 038693 


Homo sapiens solute carrier family 9 (sodium/hydrogen exchanger), 
isoform 2 (SLC9A2), mRNA 


2 


1136 


NM 033074 


Mus musculus DNA segment, Chr 15, Wayne State University 59, 
expressed (D15Wsu59e), mRNA 


0.64 
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Table 3A Nearest Neighbor (BlastN vs. Geribank) 


SEQID 
NO 


ACCESSN 


DESCRIP 


P VALUE 


1137 


M97168 


Homo sapiens X (inactive)-specific transcript (XIST) complete exon 


3.4 


1138 


D13973 


Dictyostelium discoideum DNA for Dp87 protein, complete cds 


0.075 


1139 


AF303254 


Caenorhabditis elegans gene XL153 mRNA complete cds 


2 


1140 


AB009926 


Cyphonocerus ruficollis mitochondrial DNA for 16S rRNA, partial 
sequence 


0.24 


1141 


NC_002660 


Cochliomyia hominivorax mitochondrion, complete genome 


0.0009 


1142 


AL096756 


Human DNA sequence from clone RP3-477J10 on chromosome 
22ql3.3 1-13.33 Contains a GSS, complete sequence [Homo sapiens] 


0.26 


1143 


XM_031937 


Homo sapiens hypothetical protein FLJ11183 (FLJ11183), mRNA 


2.2 


1144 


AE003025 


Drosophila melanogaster genomic scaffold 142000013385389, 
complete sequence 


0.22 


1145 


Z11702 


V.faba mRNA for polyphenol oxidase 


0.76 


1146 


XM_049924 


Homo sapiens calcium binding protein 5 (CABP5), mRNA 


5.6 


1147 


XM_047707 


Homo sapiens KIAA1265 protein (KIAA1265), mRNA 


1.6 


1148 


AY005256 


Xylocopa erythrina cytochrome b (cytb) gene, partial cds; 
mitochondrial gene for mitochondrial product 


0.086 


1151 


Z96679 


ILsapiens telomeric DNA sequence, clone 7PTEL026, read 
7PTELOO026.seq 


9E-34 


1152 


AJ001677 


Branchiostoma floridae mRNA for phenylalanine hydroxylase, partial 


0.067 


1153 


J03252 


Human alkaline phosphatase ( ALPP) gene, complete cds 


0.65 


1154 


AK007179 


Mus museums adult male testis cDNA, RIKEN full-length enriched 
library, clone: 17001 12 Jl 6, full insert sequence 


0.24 


1155 


AK004093 


Mus museums 18 days embiyo cDNA, RIKEN full-length enriched 
library, clone: 1110034K09, full insert sequence 


6.7 


1156 


AF370142 


Arabidopsis thaliana unknown protein (T5I7.1/At2g39710) mRNA, 
complete cds j 


1.7 


1157 


AF053649 


Homo sapiens cellular apoptosis susceptibility protein (CSE1) gene, 
exons 15 and 16 


0.006 


1158 


AL080199 


Homo sapiens mRNA; cDNA DKFZp434E082 (from clone 
DKFZp434E082) 


0.081 


1159 


AF267225 


Candidatus Carsonella ruddii natural-host Tainarys sordida ATP 
synthase alpha subunit (atpA) gene, partial cds; ATP synthase gamma 
subunit (atpG) gene, complete cds; and ATP synthase beta subunit 
(atpD) gene, partial cds 


0.15 


1160 


AE000654 


Hehcobacter pylori 26695 section 132 of 134 of the complete genome 


0.51 


1161 


AK007179 


Mus museums adult male testis cDNA, RIKEN full-length enriched 
library, clone: 17001 12 J16, full insert sequence 


0.25 


1162 


NMJ)08787 


Mus museums pericentrin (Pent), mRNA 


1.7 


1163 


AK024592 


Homo sapiens cDNA: FLJ20939 fis, clone ADSE01583 


0.077 


1164 


D13197 


Mouse gene for immiinoglobulin diversity region Dl 


0.076 
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Table 3A Nearest Neighbor (BlastN vs. Genbank) 


SEQID 
NO 


ACCESSN 


DESCRIP 


P VALUE 


1165 


* 

AE001381 


Plasmodium falciparum chromosome 2, section 18 of 73 of the 
complete sequence 


0.073 


1166 


X68360 


M.fascicularis gene for apolipoprotein A-II 


0.0009 


1167 


X73032 


BJuncea ribosomal intergenic spacer 


0.023 


1168 


XM 037745 


Homo sapiens hypothetical gene supported by AL137266 
(LOC91332), mRNA 


0.21 


1169 


NM 017755 


Homo sapiens hypothetical protein FLJ20303 (FLJ20303), mRNA 


0.58 


1170 


XM 045819 


Homo sapiens cofilin 2 (muscle) (CFL2), mRNA 


2 


1171 


AE006349 


Lactococcus lactis subsp. lactis IL1403 section 111 of 218 of the 
complete genome 


0.12 


1172 


XM 030637 


Homo sapiens G protein-coupled receptor kinase 7 (GPRK7), mRNA 


1.6 


1173 


AK017298 


Mus musculus 6 days neonate head cDNA, RIKEN full-length 
enriched library, clone:5430411K23, Ml insert sequence 


0.18 


1174 


AF154915 


Homo sapiens transcription factor HOXD12 (HOXD12) and 
transcription fector HOXD11 (HOXD11) genes, complete cds 


0.52 


1175 


AF143066 


Lepidopilum scabrisetum ribosomal protein system 4 (rps4) gene, 
partial cds 


0.51 


1176 


AL391814 


Human DNA sequence from clone RP1 1-49 A4 on chromosome 6, 
complete sequence [Homo sapiens] 


0.06 


1177 


AF003511 


Toxoneuron sp. 16S ribosomal RNA gene, partial sequence 


0.0000001 


1178 


AK022350 


Homo sapiens cDNA FLJ12288 fis, clone MAMMA1001783 


0.00002 


1180 


AF307986 


Mus musculus small inducible cytokine A2 lb (Scya2 lb) gene, 
complete cds 


0.021 


1181 


NM_022242 


Rattus norvegicus niban protein (Niban), mRNA 


5 


1182 


Z57380 


H.sapiens CpG island DNA genomic Msel fragment, clone 173c2, 
forward read cpgl73c2.ftla 


0.13 


1183 


XM_043937 


Homo sapiens hypothetical protein MGC13033 (MGC13033), mRNA 


6.4 


1184 


L15439 


Geukensia demissa neuropeptide mRNA, complete cds 


0.24 


1185 


AJ251399 


Proechimys cayennensis mitochondrial cytb gene for cytochrome b 
(specimen voucher V-815) 


0.7 


j 1186 


| U00053 


Caenorhabditis elegans cosmid K03F8 


0.62 


1187 


J00277 


Human (genomic clones lambda-[SK2-T2, HS578T]; cDNA clones 
RS-[3,4, 6]) c-Ha-rasl proto-oncogene, complete coding sequence 


7E-55 


1189 


X70677 


Xlaevis XFG 5-1 mRNA for zinc finger protein 


0.23 


1190 


AJ223323 


Saccharomyces cerevisiae mitochondrial tRNA-Tyr, tRNA-Asn, & 
tRNA-Met genes 


0.083 


1191 


XM_030479 


Homo sapiens KIAA1462 protein (BOAA1462), mRNA 


0.76 


1192 


AK016425 


Mus musculus adult male testis cDNA, RIKEN full-length enriched 
library, clone:4931403E03, full insert sequence 


0.00003 


1193 


AF092854 


Saguinus mystax visual pigment gene, P562 allele, intron 4 


0.0001 1 


1194 


AC025732 


Homo sapiens BAC clone RP11-1 16J19 from Y, complete sequence 


1.7 
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Table 3A Nearest Neighbor (BlastN vs. Geribank) 


SEQID 
NO 


ACCESSN 


DESCRIP 


P VALUE 


1195 


AE002177 


Chlamydophila pneumoniae AR39, section 13 of 94 of the complete 
genome 


0.011 


1196 


M94732 


Plasmodium falciparum 15 kDa vesicular-like antigen gene, exons 1 
through 4 


0.009 


1197 


X76311 


H.sapiens endothelial nitric oxidase synthase gene, exons 15 and 16 


0.6 


1198 


L81756 


Homo sapiens (subclone l_g5 from PI H16) DNA sequence, complete 
sequence 


0.061 


1199 


Z82625 


Kprowazekii genomic DNA fragment (clone A396F) 


0.76 


1200 


XM_012528 


Homo sapiens protein kinase C, beta 1 (PRKCB1), mRNA 


2.1 


1201 


M27082 


S.cerevisiae GCN2 gene (which couples GCN4 expression to amino 
acid availability), complete cds 


0.21 


1202 


AK007856 


Mus musculus 10 day old male pancreas cDNA, RIKEN full-length 
enriched library, clone:1810054D07, full insert sequence 


6E-73 


1203 


M69188 1 


Cotton legumin A D-genome alloallele gene, complete cds 


0.076 


1204 


AB043132 


Thunnus thynnus DNA, microsatellite Ttho-6*, partial sequence 


0.001 


1205 


AF019084 


Homo sapiens keratin 2e (KRT2E) gene, complete cds 


6 


1206 


U32784 


Haemophilus influenzae Rd section 99 of 163 of the complete genome 


0.026 


1207 


XM_004699 


Homo sapiens ubiquitin-conjugating enzyme E2H (homologous to 
yeast UBC8) (UBE2H), mRNA 


2 


1208 


AF375643 


Pongo pygmaeus HERV-K-GC1 preintegration site sequence 


2.2 


1209 


AK007179 


Mus musculus adult male testis cDNA, RIKEN full-length enriched 
library, clone: 17001 12J16, full insert sequence 


0.22 


1210 


AF182214 


Glycine max glutamine synthetase gamma 2 subunit gene, 5'-flanking 
region and partial cds 


0.23 


1211 


AF170224 


Alloplectus meridensis ribulose 1 , 5 -bisphosphate carboxylase large 
subunit gene, partial cds; chloroplast gene for chloroplast product 


6.5 


1212 


NMJ)22978 


Homo sapiens small EDRK-rich factor IB (centromeric) (SERF1B), 
mRNA 


0.23 


1213 


U17787 


Gromphadorhina portentosa mitochondrion 16S ribosomal RNA, 
partial sequence 


0.027 


1214 


AF144086 


Papio hamadryas anubis chromosome 13 repeat region 


0.25 


1215 


XMJ)10106 


Homo sapiens DKFZP586N0819 protein (DKFZP586N0819), mRNA 


0.73 


1216 


AF198964 


Arabidopsis thahana ATP sulfurylase gene, complete cds 


0.072 


1217 


AJ298866 


Xenopus laevis mRNA for winged helix transcription factor (foxd3b 
gene) 


0.2 


1218 


AF069111 


Uroleucon obscurum large subunit ribosomal RNA gene, partial 
sequence; tRNA-Val gene, complete sequence; and small subunit 
ribosomal RNA gene, partial sequence 


0.003 


1219 


X16715 


D. melanogaster gene for chorion protein sl6 


0.23 


1220 


AF245117 


Mus musculus leukocyte cell-surface molecule (Ly9) gene, exon 1 


0.024 
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Table 3A Nearest Neighbor (BlastN vs. Genbank) 


SEQID 
NO 


ACCESSN 


DESCRIP 


P VALUE 


1221 


AP001310 


Arabidopsis thaliana genomic DNA, chromosome 3, PI clone:MU022 


0.23 


1222 


AF092375 


Rivulus gransabanae tRNA-Ile, tRNA-Gln, and tRNA-Met genes, 
complete sequence; NADH dehydrogenase subunit II gene, complete 
cds; tRNA-Trp, tRNA-Ala, tRNA-Asn, tRNA-Cys, and tRNA-Tyr 
genes, complete sequence; and cytochrome c oxidase subunit I gene, 
par> 


0.26 


1223 


U84864 


HIV-1 strain M14 from USA, envelope glycoprotein (env) gene, 
partial cds 


0.76 


1224 


XM_051684 


Homo sapiens similar to estrogen-related receptor alpha; estrogen 
receptor-like 1 (H. sapiens) (LOC93491), mRNA 


0.57 


1225 


AF282391 


Cucumis sativus clone B99 mitochondrial genomic sequence 


2.2 


1226 


AK026459 


Homo sapiens cDNA: FLJ22806 fis, clone KAIA2845 


0.024 


1227 


AJ0O3131 


Spodoptera littoralis nuclear polyhedrosis virus fgt egt genes and 3 
ORFs 


0.022 


1228 


XM 028308 


Homo sapiens helicase-moi (KIAA0928), mRNA 


1.8 


1229 


AF219168 


Fritillaria camtschatcensis clone cam36 retrotransposon Tfcl, partial 
sequence 


2 


1230 


AF325699 


Homo sapiens ataxia-telangiectasia and Rad3 related protein (ATR) 
gene, partial cds, alternatively spliced 


0.069 


1231 


AE007433 


Streptococcus pneumoniae section 1 16 of 194 of the complete genome 


7.9 


1232 


XM 007429 


Homo sapiens CGI-35 protein (LOC51077), mRNA 


1E-08 


1233 


AF021084 


Dianous nitidulus cytochrome b (cylb) gene, mitochondrial gene 
encoding mitochondrial protein, partial cds 


0.07 1 


1234 


Z97349 


Plasmodium falciparum DNA *** SEQUENCING IN PROGRESS 
*** from contig 3-06, complete sequence 


2.1 


1235 


XM 038588 


Homo sapiens like-glycosyltransferase (LARGE), mRNA 


5.5 | 


1236 


X95276 


Rfalciparum complete gene map of plastid-like DNA (TR-B) 


0.008 


1237 


U88570 


Drosophila melanogaster CREB-binding protein homolog mRNA, 
complete cds 


L7 


1238 


AK024332 


Homo sapiens cDNAFLJ14270 fis, clone PLACE1004491 


0.68 


1239 


AC006678 


Caenorhabditis elegans cosmid R05G9, complete sequence 


1.8 


1240 


AF250958 


Lycopersicon esculentum peptide deformylase-like protein mRNA, 
complete cds 


0.076 


1241 


M26650 


Human islet amyloid polypeptide (hIAPP) gene, complete cds 


1.9 


1242 


XMJ)37062 


Homo sapiens DKFZP564G202 protein (DKFZP564G202), mRNA 


0.61 


1243 


AE003025 


Drosophila melanogaster genomic scaffold 142000013385389, 
complete sequence 


0.027 


1244 


U39701 


Mycoplasma genitalium section 23 of 51 of the complete genome 


2.1 


1245 


AE000036 


Mycoplasma pneumoniae M129 section 44 of 63 of the complete 
genome 


8.2 


1246 


AE000830 


Methanobacterium thermoautotrophicum from bases 394245 to 
404874 (section 36 of 148) of the complete genome 


0.67 
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Table 3A Nearest Neighbor (BlastN vs. Genbank) 


SEQID 
NO 


ACCESSN 


DESCRIP 


P VALUE 


1247 


AL358951 


Human DNA sequence from clone RP3-456L16 on chromosome 6, 
complete sequence [Homo sapiens] 


0.64 


1248 


XM 045757 


Homo sapiens chloride channel 4 (CLCN4), mRNA 


0.56 


1249 


U59683 


Nicotiana tabacum squalene synthase (TSS) gene, complete cds 


0.005 


1250 


AK010434 


Mus musculus ES cells cDNA, RIKEN full-length enriched library, 
clone:2410006O21, full insert sequence 


4.9 


1251 


XM_045757 


Homo sapiens chloride channel 4 (CLCN4), mRNA 


0.61 


1252 


AE000717 


Aquifex aeolicus section 49 of 109 of the complete genome 


0.024 


1253 


AC024780 


Caenorhabditis elegans cosmid Y43B1 1 AR, complete sequence 


0.027 


1254 


AF343030 


Human herpesvirus 7 isolate CR18 glycoprotein H gene, complete cds 


0.009 


1255 


AK022153 


Homo sapiens cDNA FLJ12091 fis, clone HEMBB1002582 


0.0000001 


1256 


AF309947 


Dictyostelium discoideum RaclA (racl A) gene, complete cds; and 
unknown gene 


0.009 


1257 


AB051519 


Homo sapiens mRNA for KIAA1732 protein, partial cds 


0.25 


1258 


AC006681 


Caenorhabditis elegans clone R13H9, complete sequence 


2.2 


1260 


Y11648 


Campylobacter jejuni waaC, galE, wlaIB,C,D3^>G,H,I,KJ.,M], 
cheY genes and orfl 


0.25 


1261 


AB009592 


Oryza sativa gene for cytosolic glutathione reductase, complete cds 


2.1 


1262 


X83997 


C.parasitica eapC gene 


0.22 


1263 


AJ404320 


Posidonia oceanica medium repetitive sequence, clone pPopm6 


0.034 


1264 


AF038620 


Caenorhabditis elegans cosmid H10D12 


0.063 


1265 


U05822 


Human proto-oncogene BCL3 gene, exon 2 


0.0000001 


1266 


Z79343 


H.sapiens flow-sorted chromosome 6 TaqI fragment, SC6pA4G2 


0.13 


1267 


S60289 


LeB4=legumin {5* region} [Vicia feba, Genomic, 1222 nt] 


0.0003 


1268 


AE006303 


Lactococcus lactis subsp. lactis IL1403 section 65 of 218 of the 
complete genome 


1.9 


1269 


AE001410 


Plasmodium falciparum chromosome 2, section 47 of 73 of the 
complete sequence 


0.008 


1270 


X96972 


D.buzzatii copia element DNA 


0.2 j 


1271 


AB042973 


Nicotiana tabacum wizz gene, 5* flanking region, partial cds 


0.073 j 


1272 


Z95309 


Caenorhabditis elegans cosmid H36L18, complete sequence 


3.5 ! 


1273 


AF091745 


Homo sapiens exonuclease I (EXOI) gene, exon 5 


0.2 i 


1274 


AF386945 


Arabidopsis thaliana Rubisco subunit binding-protein beta subunit 
(T5A14.11) mRNA, complete cds 


0.7 


1275 


AF269376 


Staphylococcus epidermidis strain SRI clone step.!002h01 genomic 
sequence 


0.076 


1276 


AK012864 


Mus musculus 10, 11 days embryo cDNA, RIKEN full-length 
enriched library, clone:2810036I15, full insert sequence 


0.028 


1277 


Z74204 


S.cerevisiae chromosome IV reading frame ORF YDL156w 


0.18 


1278 


AB049851 


Macaca fascicularis brain cDNA, clone:QnpA-18831 


2E-91 


1279 


AE001761 


Thermotoga maritima section 73 of 136 of the complete genome 


0.19 


1280 


AE002102 


Ureaplasma urealyticum section 3 of 59 of the complete genome 


5.5 


1281 


AK021982 


Homo sapiens cDNA FLJ1 1920 fis, clone HEMBB 10003 12 


0.006 
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SEQID 
NO 


ACCESSN 


DESCRIP 


P VALUE 


1282 


AE007504 


Streptococcus pneumoniae section 187 of 194 of the complete genome 


0.21 


1283 


X79095 


F.trinervia pdk gene for pyruvate,orthophosphate dikinase 


0.22 


1284 


AE007413 


Streptococcus pneumoniae section 96 of 194 of the complete genome 


1.7 


1285 


AB050513 


Macaca fascicularis brain cDNA, clone:QnpA-17049 


0.41 


1286 


AF339809 


Homo sapiens clone IMAGE:274150 > mRNA sequence 


0.52 


1287 


AF074991 


Homo sapiens full length insert cDNA YH88A03 


4E-09 


1288 


AF277581 


Campostoma anomalum microsatellite CA9 sequence 


1.6 


1289 


AF035245 


Bacillus lynceorum strain pVIL31 Bag320 satellite DNA 


0.23 


1291 


AF298208 


Dictyostelium discoideum transposon thug-T, complete sequence 


0.028 


1292 


AL133656 


Homo sapiens mRNA; cDNA DKFZp434O2017 (from clone 
DKFZp434O2017) 


0.021 


1293 


AK024666 


Homo sapiens cDNA: FLJ21013 fis, clone CAE05223 


9E-34 


1294 


U55367 


Caenorhabditis elegans cosmid C55H1, complete sequence 


2.1 ! 


1295 


X16715 


D. melanogaster gene for chorion protein sl6 


0.25 


1296 


AC001080 


Homo sapiens (subclone 3_g7 from PAC H74) DNA sequence, 
complete sequence 


0.69 


1297 


XM 051200 


Homo sapiens hypothetical protein MGC5149 (MGC5149), mRNA 


0.58 


1298 


AE000016 


Mycoplasma pneumoniae M129 section 1 of 63 of the complete 
genome 


0.026 


1299 


AF264059 


Homo sapiens RAPD fragment lost in a grade 2 astrocytoma 


0.73 


1300 


AE002308 


Chlamydia muridarum, section 39 of 85 of the complete genome 


0.078 


1301 


U58745 


Caenorhabditis elegans cosmid C10G6, complete sequence 


0.26 


1302 


XM 007768 


Homo sapiens KIAA1018 protein (KIAA1018), mRNA 


2.2 


1303 


AC000102 


Homo sapiens Chromosome 22qll.2 BAC Clone 60b5 In GNAZ-BCR 
Region, complete sequence 


0.17 


1304 


AF338734 


Homo sapiens hypothetical PHD zinc finger protein XAP135 
pseudogene, complete sequence 


2.3 


1305 


AF211141 


Carsonella ruddii natural-host Pachypsylla celtidis tryptophanyl-tRNA 
synthetase (trpS) gene, partial cds; 16S ribosomal RNA, 23S 
ribosomal RNA, and 5S ribosomal RNA genes, complete sequence; 
transaldolase (tal) and alkyl hydroperoxide reductase small > 


0.087 


1306 


L81391 


Homo sapiens (subclone 2_a6 from PI H39) DNA sequence, complete 
sequence 


0.18 


1307 


AB013613 


Oryza sativa DNA, centromere sequence RCB 1 1 


0.085 


1308 


U03906 


Macroscelides proboscideus aldehyde dehydrogenase I, eta-crystallin 
mRNA, complete cds 


0.26 


1309 


L13470 


Human thyroxine-binding globulin gene, complete cds 


0.21 


1 1310 


AK026225 


Homo sapiens cDNA: FLJ22572 fis, clone HSI02313 


3E-27 


1311 


XM_030391 


Homo sapiens dJ467Nll.l protein (DJ467N11.1), mRNA 


0.001 


1312 


XM 017923 


Homo sapiens hypothetical protein FLJ12660 (FLJ12660), mRNA 


2E-32 
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SEQID 
NO 


ACCESSN 


DESCRIP 


P VALUE 


1313 


M57256 


B.hermsii vmp21 gene encoding Vmp21 outer membrane lipoprotein 


0.58 


1314 


Y14051 


Staphylococcus aureus mecA, mecRl, mecl genes and ORF168, 
ORF142, ORF44, ORF145 and ORF224 


0.081 


1315 


XM 032018 


Homo sapiens similar to RIKEN cDNA 2310040G17 gene (M 
musculus) (LOC90480), mRNA 


0.027 


1316 


AF227514 


Octodon lunatus cytochrome b gene, partial cds; mitochondrial gene 
for mitochondrial product 


1.9 


1317 


AE001414 


Plasmodium falciparum chromosome 2, section 51 of 73 of the 
complete sequence 


0.73 


1318 


Z21622 


S.cerevisiae RB37 gene encoding HTP reductase 


0.77 


1319 


XM 040462 


Homo sapiens ATP-binding cassette, sub-family D (ALD), member 3 
(ABCD3), mRNA 


1.8 


1320 


Z65229 


H sapiens CpG island DNA genomic Msel fragment, clone 28a8, 
forward read cpg28a8.ftla 


0.23 


1321 


XM_040095 


Homo sapiens hypothetical protein (DKFZP434G0310), mRNA 


2.3 


1322 


XM_027314 


Homo sapiens hypothetical protein FLJ2235 1 (FLJ2235 1), mRNA 


1E-14 


1323 


AF282064 


Homo sapiens clone 2qtel edcl7iF-cl7jF sequence 


3E-88 


1324 


AF101098 


Homo sapiens collagen type XI alpha-1 (COL11A1) gene, exon 38 


1.3 


1325 


AB050093 


Neoceratodus forsteri mRNA for pituitary glycoprotein hormone alpha 
subunit, complete cds 


0.009 


1326 


AL591507 


Human DNA sequence from clone RP11-289C14 on chromosome 6, 
complete sequence [Homo sapiens] 


0.026 


1327 


AJ400873 


Homo sapiens partial GPLD1 gene for glycosylphosphatidylinositol 
phospholipase D, exons 15-20 


1.9 


1328 


AF378831 


Mus musculus orphan G protein-coupled receptor (Raig2) mRNA, 
complete cds 


1.9 


1329 


XM_030116 


Homo sapiens hypothetical protein MGC3077 (MGC3077), mRNA 


0 


1330 


X54806 


Rnorvegicus mRNA for cytokeratin type I (3 f end) 


0.77 


1331 


XMJH3141 


Homo sapiens ATPase, Cu++ transporting, alpha polypeptide 
(Menkes syndrome) (ATP7A), mRNA 


0.72 


1332 


NM_026428 


Mus musculus RIKEN cDNA 1810027P18 gene (1810027P18Rik), 
mRNA 


0.019 


1333 


U85448 


Actinidia deliciosa sucrose-phosphate synthase (KSPS-1) mRNA, 
partial cds 


0.001 




AK013974 


Mus musculus 13 days embryo head cDNA, RIKEN full-length 
enriched library, clone:3110002D09, full insert sequence 


0.74 


1335 


U25641 


Tetrahymena thermophila telomerase component p80 mRNA, 
complete cds 


0.008 


1336 


NC_002102 


Lactobacillus helveticus plasmid pLHl, complete sequence 


0.78 


1337 


M14292 


Human LIHeg repetitive element from the intergenic region of the 
epsilon and G-gamma globin genes 


5E-08 
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SEQID 
NO 


ACCESSN 
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P VALUE 


1338 


AF239910 


Petunia axillaris haplotype S15 self-incompatibility ribonuclease 
mRNA, complete cds 


2.1 


| 1339 


AF1 16872 


Candida albicans alternative oxidase (AOX2) gene, complete cds 


0.25 


1340 


XM_035344 


Homo sapiens KIAA1688 protein (KIAA1688), mRNA 


1.6 


1341 


XM 046636 


Homo sapiens hypothetical protein FLJ20080 (FLJ20080), mRNA 


6E-74 


1342 


AF292561 


Strongyloides stercorals G protein alpha subunit 2 (gpa-2) gene, 
complete cds 


0.76 


1343 


AF008597 


Catharanthus roseus desacetoxyvindoline^^iydroxylase (CRSD4H) 
gene, complete cds 


0.26 


1344 


AF347671 


Equine rhinitis A virus isolate PERV/62 PI (PI) gene, partial cds 


6.9 | 


1345 


U89019 


Hepatitis C virus polyprotein gene, complete cds 


0.27 


1346 


XM_043815 


Homo sapiens hypothetical protein FLJ21736 (FU21736), mRNA 


2.2 


1347 


L39930 


Podocoryne carnea fibrillin mRNA, partial cds 


0.7 


1348 


Z21512 


Klactis GAL80 gene encoding K1GAL80 protein 


0.075 


1349 


AC001490 


Homo sapiens (subclone 2__cl0 from PI H32) DNA sequence, 
complete sequence 


2E-11 


1350 


L36891 


Saccharomyces cerevisiae mitochondrion transfer RNA-fMet (tRNA- 
fMet) gene; transfer RNA-Pro (tRNA-Pro) gene; 9S ribosomal RNA 
(9S rRNA) gene 


0.025 


1351 


AK017200 


Mus musculus 1 1 days pregnant adult female ovaiy and uterus cDNA, 
RIKEN full-length enriched library, clone:5033428C03, full insert 
sequence 


6.2 


1352 


AB003286 


Homo sapiens DNA for choline kinase like protein and muscle type 
carnitine palmitoyltransferase I, partial and complete cds 


0.069 


1353 


AF3 12917 


Plasmodium falciparum reticulocyte binding protein 2 homolog B 
gene, complete cds 


0.081 


1354 


M65060 


M.thermoautotrophicum trpE, trpG, trpC, trpF, trpB, tip A, trpD 
genes, complete cds 


0.26 


1355 


AB025642 


Arabidopsis thaliana genomic DNA, chromosome 5, BAC 
clone:T32G24 


0.34 


1356 


L24957 


Zygosaccharomyces cidri alpha-galactosidase gene, complete cds 


0.08 


1 1357 


L38437 


Rattus norvegicus NADH ubiquinone oxidoreductase subunit (IPO) 
gene, complete cds 


0.27 


1358 


AJ301686 


Polystoma sp. DNA-25 ITS 1, isolate DNA-25 


2.2 


1359 


AB002805 


Homo sapiens OS-9 gene, 5* upstream region and partial cds 


0.0004 


1361 


XM 030723 


Homo sapiens CGI-107 protein (LOC51012), mRNA 


3.1 


1362 


NM_031881 


Mus musculus neural precursor cell expressed, developmentally down- 
regulated gene 4b (Nedd4b), mRNA 


3.8 


1363 


AC008855 


Homo sapiens chromosome 5 clone CTD-2178M23, complete 
sequence 


0.72 


1364 


NC 002660 


Cochliomyia hominivorax mitochondrion, complete genome 


0.074 


1365 


AF072589 


Python regius alpha enolase mRNA, complete cds 


1.4 


1366 


| AJ243461 


Vicia faba nod-CCPl gene, exons 1 and 2 


0.017 
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SEQID 
NO 


ACCESSN 


DESCRIP 


P VALUE 


1367 


XM 011052 


Homo sapiens 28kD interferon responsive protein (IFRG28), mRNA 


1.6 


1368 


AL390157 


Homo sapiens mRNA; cDNA DKFZp434D179 (from clone 
DKFZp434D179) 


0.0005 


1369 


AL023822 


Caenorhabditis elegans cosmid Y102A5A, complete sequence 


0.046 


1370 


U36918 


Mesocricetus auratus mucin (MUC1) mRNA, complete cds 


4.4 


1371 


AF081786 


Homo sapiens lysyl hydroxylase 1 gene, promoter region and partial 
cds 


0.18 


1372 


XMJ)35682 


Homo sapiens similar to hypothetical protein FLJ22639 (H. sapiens) 
(LOC91047), mRNA 


8E-08 


1373 


Z54645 


Rsapiens CpG island DNA genomic Msel fragment, clone 14h7, 
reverse read cpgl4h7.rtlb 


0.027 


1374 


AB012135 


Schizosaccharomyces pombe gene for catalytic subunit (C5) of 
proteasome, complete cds 


1.9 


1375 


L47352 


Brassica napus (clone Sta 39-4) arabinogalactan protein mRNA, 
complete cds 


0.73 


1376 


AB036993 


Homo sapiens RHAG gene for Rh50 glycoprotein, partial cds 


0.0000001 


1377 


U67535 


Methanococcus jannaschii section 77 of 150 of the complete genome 


0.03 


1378 


AF203637 


Myotis myotis microsatellite A2 sequence 


0.017 


1379 


XM 004748 


Homo sapiens hypothetical protein FLJ1 1 1 10 (FLJ1 1 1 10), mRNA 


1.1 


1380 


AJ249744 


Campylobacter jejuni cjaE gene for putative membrane protein CjaE 


0.69 


1381 


AL591033 


Human DNA sequence from clone RP11-94L3 on chromosome 6, 
complete sequence [Homo sapiens] 


1.7 


1382 


AJ242625 


Mus musculus Dmp-1 gene, exons 1-6 


0.71 


1383 


AB036851 


Ciona intestinalis mRNA for pellino, complete cds 


0.23 


1384 


XM 012972 


Homo sapiens transmembrane phosphatase with tensin homology 
(TPTE), mRNA 


2E-09 


1385 


AF175313 


Oncorhynchus mykiss cardiac sodium-calcium exchanger (NCX) 
mRNA, complete cds 


0.47 


I 1386 


AF283669 


Homo sapiens kallikrein 14 (KLK14) gene, complete cds 


| 0.44 


1387 


AY018560 


Oryza sativa microsatellite MRG0885 containing (AT)X18, genomic 
sequence 


0.63 


1388 


AK027272 


Homo sapiens cDNA FLJ14366 fis, clone HEMBA1001020 


0.019 


1389 


XM_003052 


Homo sapiens roundabout (axon guidance receptor, Drosophila) 
homolog 1 (ROBOl), mRNA 


1E-49 


1390 


AF208532 


Homo sapiens fatly acid omega-hydroxylase CYP4A11 (CYP4A11) 
gene, complete cds 


0.69 


1391 


AF233326 


Mus musculus tumor necrosis factor receptor-associated factor (Tra£2) 
gene, exons 2, 3, and 4 


0.72 


1392 


AE001413 


Plasmodium falciparum chromosome 2, section 50 of 73 of the 
complete sequence 


0.15 


| 1394 


X54889 


Sea urchin developmentally regulated ectodermal gene 


0.023 
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SEQID 
NO 


ACCESSN 


DESCRIP 


P VALUE 


1395 


AF378868 , 


krabidopsis thaliana AT5gll200/F2Ill_90 mRNA, complete cds 


2.5 


1396 


AF328539 ] 


Homo sapiens clone 12qtel c64t3 sequence j 


0.0003 


1397 


] 

AE006450 < 


Lactococcus lactis subsp. lactis IL1403 section 212 of 218 of the 
complete genome 


0.47 


1398 


Z12840 < 


O.cuniculus mRNA for protein of unknown function 


2E-97 


1399 


U31165 


Rattus norvegicus SH3 domain binding protein (CR16) gene, exon 6 


1.9 


1400 


Y15005 


Bison bonasus mitochondrial cytB gene 


1.8 


1401 


AC091267 


Caenorhabditis elegans cosmid Y37B1 1 A, complete sequence 


3.7 


1402 


AB055364 


Macaca fascicularis brain cDNA, clone:QflA-12522 


e-119 


1403 


NC 001610 


Didelphis virginiana mitochondrion, complete genome 


0.006 


1404 


AF362372 


Dictyostelium discoideum histidine kinase DhkJ (dhkJ) gene, 
complete cds 


0.055 


1405 


AF027164 


Homo sapiens type II integral membrane protein (NKG2-E) gene, 
partial cds; and type n integral membrane protein (NKG2-F) gene, 
complete cds 


3.6 


1406 


M76376 


Human cysteine-rich protein (CRP) gene, exon 2 


0.027 


1407 


D31863 


Mouse Pig-a gene for GPI-anchor biosynthesis (PIG-A protein), exon 
6 and complete cds 


0.026 


1408 


XM 031315 


Homo sapiens hypothetical protein FLJ14033 similar to hypoxia 
inducible factor 3, alpha subunit (FLJ14033), mRNA 


0.000007 


1409 


AY019369 


Oryza sativa microsatellite MRG1694 containing (AT)X32, genomic 
sequence 


0.0001 


1410 


AF067610 


Caenorhabditis elegans cosmid F41 A4 


0.084 


1411 


AJ401391 


Drosophila melanogaster mRNA for PDGF/VEGF-like protein 
(CG7103 gene) 


0.18 


1412 


XM 008417 


Homo sapiens hypothetical protein FLJ20694 (FLJ20694), mRNA 


0.0000001 


1413 


AF242195 


Homo sapiens KLK15 (KLK15) gene, complete cds, alternatively 
spliced 


e-160 


1414 


Y14077 


Bacillus subitlis 10.6 Kb chromosomal DNA: glyB-prsA region 


0.008 


1415 


AE006802 


Sulfolobus solfataricus section 161 of 272 of the complete genome 


0.057 


1416 


AY007149 


Homo sapiens clone CDABP0086 mRNA sequence 


0.092 


1417 


K02628 


Oxytricha nova (hypotrichous ciliate) ( clones LMiC2-(5,6,8)) 
micronuclear DNA, C2 gene (version 3), complete cds 


0.063 


1419 


i X14260 


Xenopus tropicalis alpha-globin gene 


1.7 


1420 


AJ401391 


Drosophila melanogaster mRNA for PDGF/VEGF-like protem 
(CG7103 gene) 


0.15 


1421 


XM 032580 


Homo sapiens nebulette (NEBL), mRNA 


1.3 


1422 


XM 008060 


Homo sapiens solute carrier family 7 (cationic amino acid transporter, 
y+ system), member 5 (SLC7A5), mRNA 


2E-12 


1424 


AF035820 


Hordeum vulgare gibberellin action negative regulator SPY mRNA 
complete cds 


2.1 
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SEQID 
NO 


ACCESSN 


DESCRIP 


P VALUE 


1425 


AL353672 


Human DNA sequence from clone RP1-241K1 on chromosome 6, 
complete sequence [Homo sapiens] 


3.8 


1426 


AP070565 


Homo sapiens clone 24425 mRNA sequence 


0.2 


1427 


AK026914 


Homo sapiens cDNA: FLJ23261 fis, clone COL05862 


0.009 


1428 


X14735 


Podospora anserina mtDNA for large subunit rRNA 


0.21 


1429 


XM 004081 


Homo sapiens aldehyde dehydrogenase 5 family, member Al 
(succinate-semialdehyde dehydrogenase) (ALDH5A1), mRNA 


1.4 


1430 


Y10687 


Sulfolobus acidocaldarius gene encoding purine NTPase 


0.019 


1431 


AF168678 


Drosophila melanogaster adenylyl cyclase 78C-s mRNA, complete cds 


0.13 


1432 


AC006666 


Caenorhabditis elegans cosmid H31G24, complete sequence 


0.014 


1433 


X02473 


Calf mRNA for delta subunit of muscle acetylcholine receptor 


1.1 


1434 


AF270148 


Staphylococcus epidermidis strain SRI clone step. 105 lc08 genomic 
sequence 


0.006 


1435 


XM__008084 


Homo sapiens hypothetical protein LOC55565 (LOC55565), mRNA 


3E-36 


1436 


AF007876 


Homo sapiens Na,K-ATPase beta 2 subunit gene, complete cds 


0.008 


1437 


AK016592 


Mus musculus adult male testis cDNA, RIKEN full-length enriched 
library, clone:4933400F21, full insert sequence 


0.61 


1438 


AB023053 


Homo sapiens genomic DNA, chromosome 6p21.3, HLA class I 
region, clone:53L9, complete sequence 


2E-10 


1439 


XM_035837 


Homo sapiens cAMP responsive element modulator (CREM), mRNA 


1.7 


1440 


Y12710 


Athaliana gene encoding shaggy-like kinase gamma 


0.068 


1441 


AF351620 


Homo sapiens lipocalin-1 interacting membrane receptor (LIMR) 
gene, complete cds 


0.028 


1442 


U67560 


Methanococcus jannaschii section 102 of 150 of the complete genome 


2.1 


1443 


NM 002251 


Homo sapiens potassium voltage-gated channel, delayed-rectifier, 
subfamily S, member 1 (KCNS1), mRNA 


1.9 


1444 


XM 027341 


Homo sapiens similar to retinoblastoma inhibiting gene 1 (M 
musculus) (LOC89932), mRNA 


0.0008 


1446 


AF039373 


Arabidopsis thaliana ecotype RLD chromomethylase (CMT1) gene, 
complete cds 


1.9 


1447 


U29397 


Rattus norvegicus plasma membrane Ca2+ ATPase isoform 3 
(PMCA3) gene, 5* flanking region 


0.008 


1448 


XMJ)36536 


Homo sapiens hypothetical protein FLJ10402 (FU10402), mRNA 


4.1 


1449 


AK000677 


Homo sapiens cDNA FLJ20670 fis, clone KAIA4743 


1E-09 


1450 


AJ243538 


Glomus mosseae fox2 gene for Fox2 protein 


0.018 


1451 


AJ296103 


Staphylococcus aureus repNVH99 gene for replication protein and 
smrgene 


0.23 


1452 


AF084911 


Bombus ternarius cytochrome b gene, partial cds; mitochondrial gene 
for mitochondrial product 


1.4 


1453 


AC001080 


Homo sapiens (subclone 3 _g7 from PAC H74) DNA sequence, 
complete sequence 


0.72 
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SEQID 
NO 


ACCESSN 


DESCRTP 


P VALUE 


1454 


AF213160 ( 


Crallus gallus microsatellite C-MNT-7 sequence 


0.019 


1455 


AB056366 ] 


Macaca fascicularis brain cDNA, clone:Q£LA-14203 


0.024 


1456 


] 

AF129515 ( 


Paramecium tetraurelia SEC7-related protein (SEC7) mRNA, 
complete cds 


0.055 


1457 


] 

XM 037469 


Homo sapiens similar to spastic ataxia of Charlevoix-Saguenay 
(sacsin) (H. sapiens) (LOC91290), mRNA 


1.8 


1458 


AK005404 


Mus musculus adult female placenta cDNA, RIKEN foil-length 
enriched library, clone: 1600009D24, full insert sequence 


4.8 


1459 


AE001397 


Plasmodium falciparum chromosome 2, section 34 of 73 of the 
complete sequence 


4.2 


1460 


AF161253 


Dictyostelium discoideum protein phosphatase 4 catalytic subunit 
(pppC) mRNA, complete cds 


0.77 


1461 


X74912 


M.hominis lysS gene 


0.19 


1462 


AF027148 


Homo sapiens myogenic determining factor 3 (MYOD1) gene, 
complete cds 


0.45 


1463 


BC003261 


Mus musculus, serine/threonine kinase 5, clone MGG5803 
IMAGE:3501444, mRNA, complete cds 


0.21 


1464 


AC079143 


Homo sapiens Xp BAC RP1 1-346D13 (Roswell Park Cancer Institute 
Human BAC Library) complete sequence 


2 


1465 


Y07829 


Homo sapiens RFB30 gene for RING finger protein 


4.5 


1466 


BC006448 


Homo sapiens, clone IMAGE:3637085, mRNA 


1.9 1 


1467 


X05287 


Locusta migratoria mitochondrial gene for 16S rRNA 


0.19 


1468 


XM 052063 


Homo sapiens hypothetical protein DKFZp547 A023 
(DKFZp547A023), mRNA 


0 


1469 


AF1 13952 


Campylobacter jejuni multidrug-efflux transporter gene, complete cds; 
and unknown genes 


0.066 


1470 


AE006680 


Sulfolobus solfataricus section 39 of 272 of the complete genome 


! 5.2 


1471 


AY045578 


Arabidopsis thaliana AT4g37000/C7A10 360 mRNA, complete cds 


2.2 


1472 


AF269209 


Takifugu rubripes clone FTCRAV4S1 T cell receptor V alpha chain 
gene, partial cds 


1.9 


1473 


XM 027952 


Homo sapiens dihydrofolate reductase (DHFR), mRNA 


1.3 


1474 


U80449 


Caenorhabditis elegans cosmid R10A10 


1.8 


1475 


X15634 


C. elegans core histone genes H4(his-10) and H3(his-9) 


0.025 


1476 


Z16938 


H. sapiens (D19S218) DNA segment containing (CA) repeat, clone 
AFM207wc3; single read 


0.02 


1477 


X96783 


H. sapiens Syt V gene (genomic and cDNA sequence) 


0.0007 


1478 


Y08924 


P.falciparum mRNA for AARP2 protein 


0.003 


1479 


XM 036181 


Homo sapiens RAN binding protein 2-like 1 (RANBP2L1), mRNA 


i 0.21 \ 


1480 


AL390874 


Human DNA sequence from clone RP1 1-238K16 on chromosome 9, 
complete sequence [Homo sapiens] 


0.002 


1481 


AL133157 


S.pombe chromosome I cosmid clB2 


0.38 


1482 


U67598 


Methanococcus jannaschii section 140 of 150 of the complete genome 


0.37 
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SEQID 
NO 


ACCESSN 


DESCRIP 


P VALUE 


1483 


AJ242525 


Chlamydomonas reinhardtii partial dhclO gene for 1 beta dynein 
heavy chain, exons 16-53 


5,8 


1484 


L19842 


Daltonia sp. mitochondrial NADH dehydrogenase, and cytochrome b 
genes, 3* end, and transfer RNA-Ser gene 


0.001 


1 1485 


AJ239534 


Canis familiaris tRNA-derived SINE element, clone D254UC12 


0.009 


1486 


AF044206 


Homo sapiens cyclooxygenase (COX-2) gene, promoter and exon 1 


0.2 | 


1487 


AB035880 


Chaitoregma tattakana mitochondrial genes for small subunit rRNA, 
tRNA-Val, large subunit rRNA, partial and complete sequences 


0.23 


1488 


AE006205 


Pasteurella multocida PM70 section 172 of 204 of the complete 
genome 


0.26 


1489 


XMJ) 11446 


Homo sapiens hypothetical protein ELJ10775 (FLJ10775), mRNA 


6E-16 


1490 


NM 030714 


Mus musculus deltex 3 homolog (Drosophila) (Dtx3), mRNA 


0.39 


1491 


AE001407 


Plasmodium falciparum chromosome 2, section 44 of 73 of the 
complete sequence 


0.001 


1492 


BC005530 


Mus musculus, clone IMAGE:3495762, mRNA 


6.4 


1493 


XM017183 


Homo sapiens hypothetical protein FLJ22474 (FLJ22474), mRNA 


0.0007 


1494 


AF198964 


Arabidopsis thalfona ATP sulfurylase gene, complete cds 


0.057 


1495 


AK026781 


Homo sapiens cDNA: FLJ23 128 fis, clone LNG08384 


0.0002 


1496 


AF307860 


Homo sapiens CYR61 protein (CYR61) gene, complete cds 


3.9 


1497 


AL591667 


Human DNA sequence from clone RP11-389N9 on chromosome 6, 
complete sequence [Homo sapiens] 


| 0.004 


1498 


XM 002800 


Homo sapiens kinesin-like protein 2 (hklp2), mRNA 


2E-53 


1499 


M90970 


Atriplex nummularia DNA sequence 


4.8 


1500 


D34630 


Arabidopsis thaliana mRNA for acetyl-CoA carboxylase, complete cds 


0.22 


1501 


AC004498 


Homo sapiens chromosome 5, PI clone 1209C1 (LBNL H104), 
complete sequence 


3E-13 


1502 


AF104350 


Dictyostelium discoideum prespore-specific protein (pspC) gene, 
partial cds; and unknown gene 


0.017 


1503 


XM 028059 


Homo sapiens KIAA1681 protein (KIAA168 1), mRNA 


0.35 


1504 


U83462 


Mus musculus serotonin N-acetyltransferase ( AANAT) gene, 
complete cds 


0.017 


1505 


AD001534 


Borrelia afzelli (isolate UMOl), sequence fragment 


0.7 


1506 


AF092379 


Rivulus haraldsiolii tRNA-Ile, tRNA-Gln, and tRNA-Met genes, 
complete sequence; NADH dehydrogenase subunit n gene, complete 
cds; tRNA-Trp, tRNA-Ala, tRNA-Asn, tRNA-Cys, andtRNA-Tyr 
genes, complete sequence; and cytochrome c oxidase subunit I gene, 
pa> 


0.002 


1507 


AF3 97158 


Homo sapiens clone 11 pur alpha-associated ribosomal RNA gene, 
partial sequence 


1E-39 


1509 


AF370352 


Arabidopsis thaliana putative ribosomal protein (AT4gl6720) mRNA, 
complete cds 


1.2 
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SEQID 
NO 


ACCESSN 


DESCRIP 


P VALUE 


1510 


AF286240 


HIV-1 isolate 97CDKFE4 from Republic of the Congo gag protein 
(gag) and pol protein (pol) genes, partial cds 


1.2 


1511 


L29256 


Human dinucleotide repeat polymorphism at the DXS1690 locus 


5E-50 


1512 


AF214115 


Peromyscus maniculatus H19 mRNA, complete cds 


2.1 


1513 


XM 003799 


Homo sapiens protocadherin beta 10 (PCDHB10), mRNA 


0.57 


1514 


AF387030 


Homo sapiens DNA polymerase epsilon subunit B (POLE2) gene, 
exons 14 and 15 


0.72 


1515 


M27054 


Homo sapiens asparagine synthetase (TS11) gene, exons 1-3 and 
promoter region 


0.2 


1516 


XMJ)51852 


Homo sapiens mesoderm development candidate 2 (MESDC2), 
mRNA 


5.9 


1517 


XM 052004 


Homo sapiens heterogeneous nuclear protein similar to rat helix 
destabilizing protein (FBRNP), mRNA 


0.00009 


1518 


XM_046730 


Homo sapiens hypothetical gene supported by L27560 (LOC92701), 
mRNA 


0.53 


1519 


AF136179 


Mus musculus spermine synthase gene, partial cds 


0.063 


1520 


U19360 


Onchocerca volvulus nuclear hormone receptor-like peptide, complete 
cds 


1.6 


1521 


AY039907 


Arabidopsis thaliana AT5g41810yK16L22 9 mRNA, complete cds 


5.9 


1522 


D86573 


Plasmodium falciparum DNA for flavoprotein subunit of succinate 
dehydrogenase, complete cds 


0.009 


1523 


NC 002548 


Acute bee paralysis virus, complete genome 


0.07 


1524 


U33959 


Macaca fascicularis fertilin beta mRNA, complete cds 


0.3 


1525 


AF270048 


Staphylococcus epidermidis strain SRI clone step.l043h07 genomic 
sequence 


0.76 


1526 


AB047239 


Staphylococcus aureus DNA, complete structure of cassette 
chromosome(SCC)-like element, strain:ATCC25923 


0.72 


1527 


U39684 


Mycoplasma genitalium section 6 of 51 of the complete genome 


0.21 


1528 


XM_030430 


Homo sapiens hypothetical protein (FLJ11127), mRNA 


2 


1529 


D38148 


Rotavirus strain A5-13 NSP1 gene, complete cds 


0.68 


1530 


AK020846 


Mus musculus adult retina cDNA, RIKEN full-length enriched 
library, clone: A930011E06, full insert sequence 


0.2 


1531 


AF130049 


Homo sapiens clone FLB3411 PRO0852 mRNA, complete cds 


0.067 


1532 


AF332140 


Mus musculus c-Fos (Fos) gene, 5* flanking and promoter regions 


1.8 


1533 


AF231842 


Jasminum odoratissimum tRNA-Leu (trnL) gene, partial intron and 3' 
exon, and trnL-trnF spacer region; chloroplast gene for chloroplast 
product 


1.3 


1534 


NC 001560 


Vesicular stomatitis virus, complete genome 


0.35 


1535 


AF213897 


Homo sapiens cAMP responsive element modulator (CREM) gene, 
exon C 


0.24 


1536 


XM_049251 


Homo sapiens KIAA0903 protein (KIAA0903), mRNA 


0.22 


1537 


AK027713 


Homo sapiens cDNA FLJ14807 fis, clone NT2RP4001760, weakly 
similar to PUTATIVE RHO/RAC GUANINE NUCLEOTIDE 
EXCHANGE FACTOR 


1E-36 
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SEQID 
NO 


ACCESSN 


DESCRIP 


P VALUE 


1538 


AF268091 


Chloris gayana phosphoenolpyruvate carboxylase (ppc) mRNA, 
complete cds 


6.1 


1539 


AC023351 


Homo sapiens clone RP1 1-637 J20, complete sequence 


1E-49 


1540 


NC 002161 


Bovine parainfluenza virus 3, complete genome 


6.4 


1541 


AE001092 


Archaeoglobus fiilgidus section 15 of 172 of the complete genome 


0.15 


1542 


AF044202 


Drosophila melanogaster glutamate receptor DGluRUB mRNA, 
complete cds 


0.008 


1543 


Z72767 


S.cerevisiae chromosome VII reading frame ORF YGL245w 


1.8 


1544 


AJ277899 


Nicotiana tabacum drepp4 gene, exons 1-4 


0.24 


1545 


XM 005131 


Homo sapiens chromatin accessibility complex 1 (CHRAC1), mRNA 


0.71 


1546 


AF270048 


Staphylococcus epidermidis strain SRI clone step.l043h07 genomic 
sequence 


0.67 


1547 


AL1 12743 


Botrytis cinerea strain T4 cDNA Ubrary under conditions of nitrogen 
deprivation 


0.076 


1548 


S68847 


YAP2= AP-1 -like stress-induced transcriptional activator 
[Saccharomyces cerevisiae, W303, Genomic, 2181 nt] 


0.21 


1549 


AF176634 


Reldia sp. Amaya and Smith 578 NADH dehydrogenase subunit F 
(ndhF) gene, partial cds; chloroplast gene for chloroplast product 


0.54 


1551 


X79988 


H. sapiens HEP gene, exon 1 


0.21 


1552 


AK015054 


Mus musculus adult male testis cDNA, RIKEN full-length enriched 
library, clone:4930402M22, full insert sequence 


2.2 


1553 


U07731 


Human quinone oxidoreductase2 (NQ02) gene, introns 1 and 2 and 
exon 2 


0.42 


1554 


AE001370 


Plasmodium falciparum chromosome 2, section 7 of 73 of the 
complete sequence 


0.02 


1555 


AF222996 


Danio rerio nocA-like Zn-finger protein (nlz) mRNA complete cds 


0.59 


1556 


AJ000080 


Trypanosoma brucei hsplOO and gpi-plc genes 


0.00004 


1557 


D30745 


Xenopus laevis MRP RNA gene 


5 


1558 


X17256 


Yeast THR4 gene for threonine synthase (EC 4.2.99.2) 


0.64 
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Table 3A Nearest Neighbor (BlastN vs. Genbank) 


SEQID 
NO 


ACCESSN 


DESCRIP 


P VALUE 


1568 


Z29373 


Rsapiens gene for neural cell adhesion molecule LI 


0.85 


1569 


AF138876 


Bacillus thuringiensis isolate Ml 5 insertion sequence IS231N 


0.48 


1570 


AF043087 


Hordeum vulgare dehydrin 1 (dhnl) gene, complete cds 


2.1 


1571 


AE002293 


Chlamydia muridarum, section 25 of 85 of the complete genome 


0.57 


1572 


AB017043 


Rattus norvegicus gene for hepatocyte nuclear factor 3 gamma, partial 
cds 


3.7 


1573 


XM_038951 


Homo sapiens hypothetical protein MGC15875 (MGC15875), mRNA 


0.43 


1574 


AF244979 i 


Mus musculus galectin-1 1 mRNA, variant b, complete cds, 
alternatively spliced 


0.24 


1575 


XM_006069 


Homo sapiens weel+ (S. pombe) homolog (WEE1), mRNA 


6E-10 


1576 


AK025037 


Homo sapiens cDNA: FLJ21384 fis, clone COL03354 


1.7 


1577 


Z71443 


S.cerevisiae chromosome XIV reading frame ORF YNL167c 


0.014 


1578 


AF263518 


Arabidopsis thaliana protein synthesis initiation factor 4G (EIF4G) 
gene, complete cds 


0.59 


1579 


AE000680 


Aquifex aeolicus section 12 of 109 of the complete genome 


1.2 


1580 


AE001404 


Plasmodium falciparum chromosome 2, section 41 of 73 of the 
complete sequence 


1.5 


1581 


AJ390510 


Candida albicans partial mRNA for serine/threonine kinase (sha3 
gene) 


0.18 


1582 


AC004173 


Homo sapiens clone UWGC:y23x011 from 6p21, complete sequence 


0.002 


1583 


Z71687 


S.cerevisiae chromosome XTV reading frame ORF YNR072w 


2.1 


1585 


X95065 


Rsapiens dinucleotide repeat (GT) from chromosome 13q (229bp) 


0.18 


1586 


Z24310 


H. sapiens (D10S593) DNA segment containing (CA) repeat; clone 
AFM3 1 lybl; single read 


0.002 


1587 


Y18476 


Trichophyton rubrum mitochondrial cytb gene and NADH1 to 
NADH5 genes 


6.9 


1588 


Z47046 


Human cosmid QLL2C9 from Xq28 


0.00007 


1589 


Z71687 


S.cerevisiae chromosome XIV reading frame ORF YNR072w 


1.6 


1590 


XM 005485 


Homo sapiens KIAA0414 protein (KIAA0414), mRNA 


0.002 | 


1591 


NM 024364 


Rattus norvegicus hairless (hi), mRNA 


0.22 


1593 


AF136150 


Caenorhabditis elegans homogentisate 1,2-dioxygenase (hgo) gene, 
complete cds 


L6 


1594 


XM 033896 


Homo sapiens insulin-like growth factor 2 receptor (IGF2R), mRNA 


1.4 


1595 


AJ404228 


Saccharomyces douglasii mitochondrial tRNA gene cluster 


0.008 


1596 


BC007460 


Homo sapiens, acid phosphatase, prostate, clone MGC: 12236 
IMAGE:3951204, mRNA, complete cds 


0.029 


1597 


NM 024201 


Mus musculus RKEN cDNA 06 100 1 1N22 gene (061001 lN22Rik), 
mRNA 


5.3 


1598 


U88166 


Caenorhabditis elegans cosmid M01A12 


0.61 


1599 


AE001394 


Plasmodium falciparum chromosome 2, section 3 1 of 73 of the 
complete sequence 


2 



273 



3NSDOCID: <WO 021 450OA2_l_> 



WO 02/14500 



PCT7US01/25840 





Table 3A Nearest Neighbor (BlastN vs. Genbank) 


SEQID 
NO 


ACCESSN 


DESCRIP 


P VALUE 


1600 


AJ306417 


Coiynebacterium glutamicum partial nrdF gene for ribonucleotide 
reductase beta-subunit and ctaD gene for cytochrome aa3 oxidase 
subunit I 


1.9 


1601 


AF287270 


Homo sapiens mucolipin (MCOLN1) gene, complete cds 


6 


1602 


AE002207 


Chlamydophila pneumoniae AR39, section 37 of 94 of the complete 
genome 


0.083 


1603 


AK003449 


Mus musculus 18 days embryo cDNA, RIKEN full-length enriched 
library, clone: 1110004P21, full insert sequence 


0.2 


1604 


AB008103 


Arabidopsis thaliana AtERF-1 mRNA for ethylene responsive element 
binding factor 1, complete cds 


0.79 


1605 


AE006406 


Lactococcus lactis subsp. lactis IL1403 section 168 of 218 of the 
complete genome 


0.18 


1606 


BC008122 


Homo sapiens, clone MGC: 18053 IMAGE:4 148889, mRNA, 
complete cds 


1.5 


1607 


D28598 


Chicken mRNA for c-maf proto-oncogene product c-Maf long form, 
complete cds 


0.44 


1608 


X97314 


Msativa mRNA for cdc2 kinase homologue, cdc2MsC 


1.5 


1609 


AP001407 


Homo sapiens genomic DNA, chromosome 21q22.2, clone:10C6, 
LB7T-ERG region, complete sequence 


1.1 


1611 


XM 017891 


Homo sapiens similar to heterogeneous nuclear ribonucleoprotein L 
(H. sapiens) (LOC91538), mRNA 


1.4 


1612 


U48562 


Corrigiola littoralis ORF2280 gene homolog, chloroplast gene 
encoding chloroplast protein, partial cds 


0.074 


1613 


XM 036274 


Homo sapiens similar to tumor protein p53-binding protein (H. 
sapiens) (LOC65591), mRNA 


2 


1614 


AJ271039 


Plasmodium yoelii yoelii partial mrk gene for M015-related kinase 


0.001 


1615 


AB016467 


Datisca cannabina chloroplast gene for maturase K, partial cds 


0.25 


I 1616 


AF027807 


Homo sapiens beta-casein (CSN2) gene, complete cds 


0.074 


1617 


AK002822 


Mus musculus adult male kidney cDNA, RKEN full-length enriched 
library, clone:0610039A20, full insert sequence 


0.55 


1618 


U67580 


Methanococcus jannaschii section 122 of 150 of the complete genome 


0.032 


1619 


U89140 


Mus musculus aldose reductase gene, promoter region and exon 1 


0.48 


1620 


AE002799 


Drosophila melanogaster genomic scaffold 142000013385566, 
complete sequence 


0.17 


1621 


L14855 


Bos taurus neurexin I-alpha mRNA, complete cds 


0.48 


1622 


AF379854 


Cloning vector pVLH/hsp, complete sequence 


2 


1623 


Z11883 


S.tuberosum UlsnRNA variant genes Ul-1, Ul-2, Ul-3, Ul-4, Ul-5 
andUl-6 


0.54 


1624 


AJ294727 


Mus musculus ILTTFa gene for IL-TTF alpha protein (IL-22), exons la 
5 


0.011 


1625 


XM 043073 


Homo sapiens hypothetical protein DKFZp434A1319 
(DKFZP434A1319), mRNA 


1.9 
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SEQED 
NO 


ACCESSN 


DESCRIP 


P VALUE 


1626 


M84732 


Plasmodium yoelii sporozoite surface protein 2 gene , complete cds 


0.007 


1627 


M83304 


G.hirsutum late embryogenesis-abundant protein 2-D (LeaA2-D) 
mRNA, complete cds 


9.8 


1628 


U32725 


Haemophilus influenzae Rd section 40 of 163 of the complete genome 


0.14 


1629 


AF185589 


Homo sapiens cytochrome P450 3 A4 (CYP3 A4) gene, promoter 
region 


0.027 


1630 


AL591667 


Human DNA sequence from clone RP11-389N9 on chromosome 6, 
complete sequence [Homo sapiens] 


0.15 


1631 


AB058397 


Oryza sativa CHS gene for chalcone synthase, complete cds 


0.16 


1632 


Ml 1043 


Mouse DNA fragment that hybridizes to HSV-1 Smal A fragment 


1.5 


1633 


AF145699 


Staphylococcus aureus putative membrane protein MprF (mprF) gene, 
complete cds; and unknown gene 


1.5 


1634 


AF088902 


Mus musculus SLIT1 protein (Slitl) mRNA, partial cds 


0.5 


1635 


AB048882 


Macaca fascicularis brain cDNA, clone:QnpA-16525 


0.21 | 


1636 


X98369 


M.musculus mRNA for SOX15 protein 


0.24 


1637 


AF131767 


Homo sapiens clone 24900 mRNA sequence 


0.25 


1638 


AL 117657 


Homo sapiens mRNA; cDNADKFZp586F1924 (from clone 
DKFZp586F1924) 


1.9 


1639 


AB042973 


Nicotiana tabacum wizz gene, 5* flanking region, partial cds 


0.072 


1640 


AJ272083 


Staphylococcus aureus empbp gene for extracellular matrix and 
plasma binding protein, strain SA 1 13 


0.008 


1641 


BC009423 


Homo sapiens, clone MGC: 15151 IMAGE:3538202, mRNA, 
complete cds 


1.9 


1642 


AF1 14905 


Saccharomyces sp. CID1 ATP synthase subunit 8 (ATP8) gene, 
mitochondrial gene encoding mitochondrial protein, complete cds 


0.18 


1643 


AF270417 


Staphylococcus epidermidis strain SRI clone step.4052d03a genomic 
sequence 


0.063 


1644 


X91340 


G.hybrida chs-like gene 


0.51 


1645 


AF097026 


Homo sapiens autosomal highly conserved protein (AHCP) gene, 
complete cds 


0.008 


1646 


XM 028810 


Homo sapiens 67292 (KIAA1755), mRNA 


5.5 


1647 


AL049339 


Homo sapiens mRNA; cDNA DKFZp564P046 (from clone 
DKFZp564P046) 


2.1 


1648 


AE001398 


Plasmodium falciparum chromosome 2, section 35 of 73 of the 
complete sequence 


0.13 


1649 


M55171 


Mouse opsin (MOPS) gene, complete cds 


0.52 


1650 


AB008103 


Arabidopsis thaliana AtERF-1 mRNA for ethylene responsive element 
binding factor 1, complete cds 


0.66 


1651 


AL589705 


Human DNA sequence from clone RP11-64M7 on chromosome 6, 
complete sequence [Homo sapiens] 


0.5 


1652 


X79559 


T.trypanoides (Rl) 18S rRNA gene 


0.069 


1653 


XM 048416 


Homo sapiens PRO0478 protein (PRO0478), mRNA 


0.0001 
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SEQID 
NO 


ACCESSN 


DESCRJP 


P VALUE 


1654 


AF082761 


Dasyurus maculatus isolate QM.E1 mitochondrial D-loop, partial 
sequence 


0.063 


1655 


AF282568 


Homo sapiens clone 20ptel_cl5bt7 sequence 


4E-39 


1656 


AB027617 


Pyrus pyrifolia gene for UDP-glucose pyrophosphorylase, complete 
cds 


0.19 


1657 


AK023864 


Homo sapiens cDNA FLJ13802 fis 5 clone THYRO1000186 


0.2 


1658 


NMJ)09905 


Mus musculus CDC-like kinase (Clk), mRNA 


1.7 1 


1659 


AL023846 


Caenorhabditis elegans cosmid Y52B11C, complete sequence 


2.3 


1660 


AF334780 


Homo sapiens pregnancy-induced growth inhibitor OKL38 gene, 
partial cds 


0.73 


1661 


XM 042066 


Homo sapiens mitogen-activated protein kinase kinase kinase 1 
(MAP3K1), mRNA 


1.9 


1662 


XM_034872 


Homo sapiens KIAA0202 protein (KIAA0202), mRNA 


0.23 


1663 


AC002252 


Homo sapiens (subclone l_g7 from BAC H76) DNA sequence, 
complete sequence 


3E-14 


1664 


AK023113 


Homo sapiens cDNA FLJ13051 As, clone NT2RP3001447 


2E-20 | 


1665 


BC003552 


Homo sapiens, calnexin, clone MGC:1196 IMAGE:3546389, mRNA, 
complete cds 


0.076 


1666 


AF097142 


Tapinaspis sp. 12S small, subunit ribosomal RNA gene, partial 
sequence; mitochondrial gene for mitochondrial product 


1.5 


1667 


AL1 17103 


Botrytis cinerea strain T4 cDNA library under conditions of nitrogen 
deprivation 


6.1 


1668 


NM 013825 


Mus musculus lymphocyte antigen 75 (Ly75), mRNA 


0.74 


1669 


NMJ)07043 


Homo sapiens HIV-1 rev binding protein 2 (HRB2), mRNA 


7E-22 


1670 


Z27359 


G.hirsutum (AD52 ) copia-like reverse transcriptase fragments 


0.24 


1671 


AK021535 


Homo sapiens cDNA FLJ11473 fis, clone HEMBA1001712 


0.25 


1672 


Y00145 


Dictyostelium discoideum UDP glucose pyrophosphorylase gene 
(UDPGP; EC 2.7.7.9) 


0.22 


1673 


M63291 


Drosophila silvestris alcohol dehydrogenase gene, complete cds 


0.028 


1674 


AF100401 


Influenza B virus B/Guangdong/5/94 segment 8 nonstructural protein 
(NS1) and nonstructural protein (NS2) genes, complete cds 


0.2 


1675 


U80449 


Caenorhabditis elegans cosmid R10A10 


1.6 


1676 


X15634 


C. elegans core histone genes H4(his-10) and H3(his-9) 


0.022 


1677 


XM 050366 


Homo sapiens zinc finger protein, X-linked (ZFX), mRNA 


0.23 


1678 


AE007046 


Mycobacterium tuberculosis CDC1551, section 132 of 280 of the 
complete genome 


0.062 


1679 


XM 008207 


Homo sapiens glucose-6-phosphatase, catalytic (glycogen storage 
disease type I, von Gierke disease) (G6PC), mRNA 


4E-18 


1680 


AB049118 


Hemicentrotus pulcherrimus mRNA for transcription factor HpLiml, 
complete cds 


0.21 


1681 


XM 002421 


Homo sapiens hypothetical protein FLJ20254 (FLJ20254), mRNA 


4.1 


1682 


X96783 


Rsapiens Syt V gene (genomic and cDNA sequence) 


0.000004 
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SEQID 
NO 


ACCESSN 


DESCRIP 


P VALUE 


1683 


U79732 


Plasmodium berghei extrachromosomal plastid PB-2, tRNA-Pro, 
tRNA-Glu tRNA-Lvs tRNA-Aso tRNA-Ser tRNA-Tyr, tRNA-Met 
tRNA-Leu. tRNA-Cvs and tRNA-His eenes comDlete sequence. ros4 
gene, complete cds, tRNA-Thr gene, complete sequence, and large 
subu> 


0.001 


1684 


NCJ)02619 


Pteropus scapulatus mitochondrion, complete genome 


6.3 


1685 


L16679 


Caenorhabditis elegans cosmid K07D8, complete sequence 


1.8 


1686 


Y08924 


P.falciparum mRNA for AARP2 protein 


0.003 


1687 


U32760 


Haemophilus influenzae Rd section 75 of 163 of the complete genome 


2.2 | 


1688 


AF298624 


Dictyostelium discoideum chromosome 2 repeat region 


0.009 


1689 


XM 031155 


Homo sapiens RAS protein activator like 2 (RASAL2), mRNA | 


0.33 


1690 


AE006899 


Sulfolobus solfataricus section 258 of 272 of the complete genome 


5 


1691 


U67556 


Methanococcus jannaschii section 98 of 150 of the complete genome 


0.42 


1692 


AKO 12631 


Mus musculus 10, 11 days embryo cDNA, RIKEN full-length 
enriched library, clone:2810001G20, full insert sequence 


0.064 


1693 


BC007399 


Homo sapiens, clone MGC16308 IMAGE:3836116, mRNA, 
complete cds 


0.022 


1694 


AL389897 


Human DNA sequence from clone RP11-110C10 on chromosome 20. 
Contains GSSs, complete sequence [Homo sapiens] 


1E-26 


1695 


Z62421 


Rsapiens CpG island DNA genomic Msel fragment, clone 68h2, 
forward read cpg68h2.ftla 


4E-70 


1696 


i BC010535 


Homo sapiens, clone IMAGE:3456494, mRNA 


0.002 


! 1697 


Z68319 


Caenorhabditis elegans cosmid T23G7, complete sequence 


0.18 


1698 


U48363 


Mus musculus transcriptional activator alpha-NAC (Naca) gene, 
complete cds 


0.058 


1699 


AF135430 


Ustilago nuda 18S ribosomal RNA gene, partial sequence; internal 
transcribed soacer 1 5 8S ribosomal RNA gene and internal 
transcribed spacer 2, complete sequence; and 28S ribosomal RNA 
gene, partial sequence 


1.4 


1701 


XM 007461 


Homo sapiens similar to enhancer of rudimentary (Drosophila) 
homolog (H. sapiens) (LOC65719), mRNA 


0.18 


1702 


NM 025083 


Homo sapiens hypothetical protein FLT21128 (FU21128), mRNA 


e-124 


1703 


XM_050592 


Homo sapiens uncharacterized hematopoietic stem/progenitor cells 
protein MDS032 (MDS032), mRNA 


0.068 


1704 


AF277167 


Homo sapiens Makorin RING zinc-finger protein 2 (MKRN2) gene, 
exon4 


2 


1705 


AE001433 


Plasmodium falciparum chromosome 2, section 70 of 73^ of the 
complete sequence 


0.67 


1706 


AF075003 


Homo sapiens full length insert cDNA YH97B03 


0.001 


1707 


NM 011378 


Mus musculus transcriptional regulator, SIN3A (yeast) (Sin3a), 
mRNA 


0.62 
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1708 


AE001321 


Chlamydia trachomatis section 48 of 87 of the complete genome 


0.026 


1709 


XM 046388 


Homo sapiens similar to elongation of very long chain fatty acids 
(FENl/Elo2, SUR4/Elo3, yeast)-like 1 (HL sapiens) (LOC92652), 
mRNA 


6.1 


1710 


S55685 


MOR6.5=ouabain resistance gene {repeat sequence} [mice, Genomic, 
6395 nt] 


5.7 


1711 


AB055378 


Macaca fascicularis brain cDNA, clone:QHA-13294 


0.22 


1712 


XM 043597 


Homo sapiens centaurin-alpha 2 protein (HSA272195), mRNA 


1E-20 


1713 


AE007426 


Streptococcus pneumoniae section 109 of 194 of the complete genome 


0.32 


1714 


AK016519 


Mus musculus adult male testis cDNA, RIKEN full-length enriched 
library, clone:4932411G06, full insert sequence 


e-129 


1715 


AL122033 


S.pombe chromosome II cosmid cl921 


0.53 


1716 


AK012960 


Mus musculus 10, 11 days embryo cDNA, RIKEN full-length 
enriched library, clone:2810401G03, full insert sequence 


0.075 


1717 


AL590123 


Human DNA sequence from clone RP1 1-473 J14 on chromosome X, 
complete sequence [Homo sapiens] 


0.19 


1718 


D17343 


Rattus norvegicus cytochrome P450 2B15 gene, exon 1 


0.66 


1719 


XM 028703 


Homo sapiens KIAA0889 protein (KIAA0889), mRNA 


0.00004 


1720 


AB060865 


Macaca fascicularis brain cDNA clone:QtrA-11953, foil insert 
sequence 


2.3 


1721 


AL133397 


Human DNA sequence from clone RPl-213Jlp on chromosome 
22ql 1.21-12.2, complete sequence (Homo sapiens] 


0.083 


1722 


X58913 


Mitochondrion Drosophila eugracilis ND2 and COI genes (partial) 
and genes for tRNA-Trp, tRNA-Tyr, and tRNA-Cys 


0.074 


1723 


AC002359 


Homo sapiens Xp22 Cosmid U239B3 (from Lawrence Livermore X 
library) complete sequence 


0.0008 


1724 


AE006322 


Lactococcus lactis subsp. lactis EL1403 section 84 of 218 of the 
complete genome 


1.8 


1725 


Z71290 


S.cerevisiae chromosome XIV reading frame ORF YNL014w 


0.67 


1726 


XM_005723 


Homo sapiens hypothetical protein PR02859 (PR02859), mRNA 


0.014 


1727 


AF123488 


South River virus polyprotein mRNA, complete cds 


0.57 


1728 


D78176 


Mouse DNA for primary transcript of atrial natriuretic clearance 
receptor 


0.061 


1 1730 


XM 006871 


Homo sapiens ethanolamine kinase (EKI1), mRNA 


4.1 


1731 


AL078640 


Human DNA sequence from clone 536P6 on chromosome 22. 
Contains an EST, complete sequence [Homo sapiens] 


0.07 


1732 


AC092030 


Homo sapiens clone RP1 1-636112, complete sequence 


0.029 


1733 


AK003349 


Mus musculus 18 days embryo cDNA, RIKEN full-length enriched 
library, clone: 1110003F05, full insert sequence 


0.16 


| 1734 


AF131767 


Homo sapiens clone 24900 mRNA sequence 


0.2 j 


1735 


Z74246 


S.cerevisiae chromosome IV reading frame ORF YDL198c 


0.023 


1736 


NM__021401 


Mus musculus secreted and transmembrane 1 (Sectml), mRNA 


0.069 


1737 


AF213931 


Buchnera aphidicola DNA polymerase IQ beta subunit (dnaN) gene, 
complete cds 


0.68 
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1738 


AF288892 


Homo sapiens clone 20r DH4-17/BCL-1 gene fusion reciprocal 
breakpoint sequence 


2 


1739 


X99719 


S.enterica hsdM, hsdS & hsdR genes 


0.23 


1740 


XMJH1866 


Homo sapiens KIAA1468 protein (KIAA1468), mRNA 


1 5.9 


1741 


AY010113 


Homo sapiens unknown mRNA sequence 


0.64 


1742 


AK009187 


Mus musculus adult male tongue cDNA, RKEN full-length enriched 
library, clone:23 10006124, full insert sequence 


4E-23 


1743 


AF130074 


Homo sapiens clone FLB9348 PR02523 mRNA, complete cds 


0.14 


1744 


AJ249165 


Plasmodium falciparum gc-beta gene, exons 1-13 


0.002 


1745 


AK014565 


Mus musculus 0 day neonate skin cDNA, RIKEN full-length enriched 
library, clone:4632411B12, full insert sequence 


0.54 


1746 


M15058 


Vaccinia virus (strain WR) Hindm D fragment DNA, complete 


0.15 


1747 


Z16416 


M.pulmonis DNA for insertion sequence IS1138 encoding putative 
transposase 


0.026 


1748 


XM_047638 


Homo sapiens Fzrl protein (FZR1), mRNA 


1.2 j 


1750 


U21853 


Anabaena sp. phycobilisome core component (apcF) gene, complete 
cds, and glutamine synthetase (glnA) gene, partial cds 


0.076 


1751 


AF169454 


Neisseria meningitidis strain Z2491 clone Cm045 unknown sequence 


0.19 


1752 


AF102750 


Apanteles canarsiae 16S large subunit ribosomal RNA gene, partial 
sequence; mitochondrial gene for mitochondrial product 


0.066 


1753 


AF034389 


Plasmodium falciparum sexual stage antigen (sl6) gene, promoter and 
partial cds 


0.18 


1754 


AB047239 


Staphylococcus aureus DNA, complete structure of cassette 
chromosome(SCC)-like element, strain: ATCC25 923 


1.5 


1755 


X98669 


A.thaliana zatl gene 


0.023 


1756 


X16619 


Chlamydomonas reinhardtii ARG7 gene for argininosuccinate lyase 


1.5 


1757 


XM 005488 


Homo sapiens KIAA0354 gene product (KIAA0354), mRNA 


0.063 


1758 


AF1 10966 


HTV-1 isolate C-96BW04.10 country Botswana, complete genome 


1.1 


1759 


AF247193 


Mus musculus endobrevin (Vamp8) gene, exon 1 


5.4 


1760 


XM_045302 


Homo sapiens proteasome (prosome, macropain) 26S subunit, non- 
ATPase, 5 (PSMD5), mRNA 


0.0008 


1761 


AE006350 


Lactococcus lactis subsp. lactis IL1403 section 1 12 of 218 of the 
complete genome 


0.69 


1762 


AJ247001 


Anguilla anguilla mitochondrial DNA for partial tRNA-Thr and 
tRNA-Pro genes and D-loop, isolate RM1 1 


0.052 


1763 


AB024996 


Cicer arietinum mRNA for rac-type small GTP-binding protein, 
complete cds 


1.6 


1764 


M13945 


Mus musculus pim-1 protein kinase gene, complete cds 


5.7 


1765 


AP106842 


Platanus racemosa PlaraLFY protein (PlaraLFY) gene, complete cds 


0.22 


1766 


AF269822 


Staphylococcus epidermidis strain SRI clone step. 1024f03 genomic , 
sequence 


7.5 


1767 


AF256830 


Salmo salar clone BHMS549 microsatellite sequence 


1.8 
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1768 


AE007395 


Streptococcus pneumoniae section 78 of 194 of the complete genome 


1.6 


1769 


AB028896 


Streptococcus agalactiae type-specific capsular polysaccharide 
biosynthesis gene region (cpsIaA to ung gene), complete cds 


0.025 


1770 


AL583831 


Human DNA sequence from clone RP11-469L10 on chromosome 6, 
complete sequence [Homo sapiens] 


0.15 


1771 


AL354683 


Human DNA sequence from clone RP1 1-259K1 1 on chromosome 20, 
complete sequence [Homo sapiens] 


0.21 


1772 


XM 047524 


Homo sapiens procollagen (type HI) N-endopeptidase (PCOLN3), 
mRNA 


4.6 


1773 


AC084500 


Caenorhabditis briggsae cosmid G13G15, complete sequence 


0.076 


1774 


M13945 


Mus musculus pim-1 protein kinase gene, complete cds 


6.1 


1775 


L27474 


Human arginase (ARG1) gene dinucleotide repeat polymorphism, 
intron 5 


0.21 


1776 


Z79696 


Caenorhabditis elegans cosmid F54F3, complete sequence 


1.9 


1777 


AYO 18282 


Oryza sativa microsatellite MRG0607 containing (AT)X14, closest to 
marker CI 32, genomic sequence 


0.071 


1778 


AF114938 


Saccharomyces sp. IFO 1815 small subunit ribosomal RNA gene, 
mitochondrial gene for mitochondrial RNA, complete sequence 


0.078 


1779 


AF170085 


Homo sapiens Charcot-Leyden crystal protein gene, partial sequence 


6E-18 


1780 


Y08919 


P.flesus mRNA for p53 protein 


0.082 


1781 


L46868 


Trypanosoma brucei dominant expression site (DES) gene, promoter 
region 


2.2 


1782 


AF157008 


Prunus dulcis self-incompatibility associated ribonuclease gene, 
complete cds 


0.44 


1783 


AB017459 


Carabus sylvestris mitochondrial ND5 gene, isolate: Baumgartner 
Austria, partial cds 


0.029 


1784 


AJ294714 


Pichia anomala leu2 gene for beta-isopropylmalate dehydrogenase 


0.26 


1785 


AF125972 


Caenorhabditis elegans cosmid H25P19 


0.64 


1786 


AB015413 


Helicobacter pylori cagA gene, complete cds, strain ATCC43526 


1.5 


1787 


AK021477 


Homo sapiens cDNA FLJ11415 fls, clone HEMBA1000942 


7E-12 j 


1788 


AC006812 


Caenorhabditis elegans cosmid Y64H9A, complete sequence 


0.63 


1789 


XM_004181 


Homo sapiens megakaryocyte-enhanced gene transcript 1 protein 
(MEGT1), mRNA 


7E-66 


1790 


AJ236640 


Homo sapiens chromosome 22 CpG island DNA, genomic Msel 
fragment, clone 22CGIB49A20 , complete read 


9E-19 


1791 


XMJ)41751 


Homo sapiens discoidin domain receptor family, member 1 (DDR1), 
mRNA 


3.5 


1792 


AL034546 


Human DNA sequence from clone RP5-898I4 on chromosome 
22ql3.33 Contains a GSS and a putative CpG island, complete 
sequence [Homo sapiens] 


0.000005 


1793 


AJ006873 


Buchnera aphidicola plasmid pBUsl, repA2 gene, leuA gene and 
ORF1 


0.23 



280 



1NSDOCID: <WO 021 4500A2_I_> 



WO 02/14500 



PCT7US01/25840 





Table 3A Nearest Neighbor (BlastN vs. Geribank) 


SEQID 
NO 


ACCESSN 


DESCRIP 


P VALUE 


1794 


X79779 


S.tuberosum KST1 mRNA 


0.67 


1795 


AF371373 


Cricetulus griseus hypothetical protein Al-3 mRNA, complete cds 


0.61 


1796 


AF165082 


Pollachius virens cytochrome b gene, partial cds; mitochondrial gene 
for mitochondrial product 


0.53 


1797 


AE007418 


Streptococcus pneumoniae section 101 of 194 of the complete genome 


0.027 


1798 


X67082 


Rsapiens AMBP gene transcription regulatory region 


0.006 


1799 


AL1 17655 


Homo sapiens mRNA; cDNA DKFZp586E2024 (from clone 
DKFZp586E2024) 


5.2 


1800 


AB048972 


Macaca fascicularis brain cDNA, clone:QnpA-12040 


0.53 


1801 


XMJ)30865 


Homo sapiens M-phase phosphoprotein 10 (U3 small nucleolar 
ribonucleoprotein) (MPHOSPH10), mRNA 


0.18 


1802 


AF123488 


South River virus polyprotein mRNA, complete cds 


0.49 


1803 


AB02759O 


Pachliopta aristolochiae mitochondrial mRNA for NADH 
dehydrogenase subunit 5, partial cds 


0.024 


1804 


J00889 


Chicken cellular myc proto-oncogene, complete cds 


0.008 


1805 


XM 030024 


Homo sapiens similar to protein kinase C binding protein 1 (H 
sapiens) (LOC90218), mRNA 


0.009 


1806 


AB023486 


Homo sapiens gene for histamine H2 receptor, promoter region and 
complete cds 


0.064 


1807 


D21196 


Dicryostelium discoideum mitochondrial DNA 


0.21 


1808 


AY035029 


Arabidopsis thaliana putative dihydroorotate dehydrogenase 
(MEB5.3/AT3gl7810) mRNA, complete cds 


5.3 


1809 


AB047389 


Clostridium difficile purL gene for formylglycinamidine 
ribonucleotide synthetase (FGAM synthetase), complete cds 


0.019 


1810 


AK012033 


Mus musculus 10 days embryo cDNA, RIKEN full-length enriched 
library, clone:2610315E15, full insert sequence 


2.2 


1811 


AF044202 


Drosophila melanogaster glutamate receptor DGluRUB mRNA, 
complete cds 


0.008 


1812 


AK026967 


Homo sapiens cDNA: FLJ233 14 fis, clone HEP1 1989 


0.66 


1813 


AB050432 


Macaca fascicularis brain cDNA, clone:QnpA-21861 


0.4 


1814 


X16871 


Human DNA for elongation factor 1 -alpha (clone lambda-4) 


4.7 


1815 


AE000001 


Mycoplasma pneumoniae Ml 29 section 16 of 63 of the complete 
genome 


1.9 


1816 


AK002671 


Mus musculus adult male kidney cDNA, RIKEN full-length enriched 
library, clone:0610025N14, full insert sequence 


0.052 


1817 


AF202903 


Srwropachydermia cereana var. D77-321B internal transcribed spacer 
1, 5.8S ribosomal RNA gene and internal transcribed spacer 2, 
complete sequence; and large subunit ribosomal RNA gene, partial 
sequence 


0.5 


1819 


L42167 


Mus musculus (clone R24) rds gene, partial cds 


0.16 


1820 


XM_027078 


Homo sapiens ATPase, Class V, type 10D (ATP10D), mRNA 


0.16 


1821 


L27619 


Macaca nemestrina T-cell receptor beta chain V region (Vbl5. 1) 
gene, partial cds 


0.016 
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1822 


BC003900 


Mus musculus, Similar to hypothetical protein 384D8_6, clone 
MGC:6766 IMAGE:3601298, mRNA, complete cds 


0.51 


1823 


XM 050561 


Homo sapiens KIAA0700 protein (KIAA0700), mRNA 


0.51 


1824 


S66407 


FLT4=receptor tyrosine kinase isofonn FLT4 long {3* region, 
alternatively spliced} [human, mRNA Partial, 216 nt] 


8E-36 


1825 


AY008805 


Equus caballus retinoblastoma (RBI) gene, partial cds 


0.058 | 


1826 


AJ276629 


Rattus norvegicus Sacm21/RT1-A intergenic region, haplotype RTln 
and partial RT1-A gene for MHC Class I antigen 


3.7 


1827 


AF029975 


Eucalyptus grandis MADS box protein (EGM1) mRNA, complete cds 


1.2 


1828 


AL109708 


Homo sapiens mRNA full length insert cDNA clone EUROIMAGE 
254679 


0.019 


1829 


AF270353 


Staphylococcus epidermidis strain SRI clone step.4025a02 genomic 
sequence 


0.051 


| 1831 


U72236 


Dictyostelium discoideum ModA (modA) gene, complete cds 


0.5 


1832 


L15234 


Mycoplasma pneumoniae Cys-tRNA, Pro-tRNA, Met-tRNA, lie- 
tRNA, Ser-tRNA, fMet-tRNA, Asp-tRNA and Phe-tRNA genes 


1.1 


1833 


XM_012722 


Homo sapiens RNA (guanine-7-) methyltransferase (RNMT), mRNA 


0.5 


1834 


AF108228 


Ginkgo biloba GinLFY protein (GinLFY) gene, complete cds 


4.2 


1836 


XM_049943 


Homo sapiens topoisomerase (DNA) II alpha (170kD) (TOP2A), 
mRNA 


0.47 


1837 


NC 002674 


Aulopus japonicus mitochondrion, complete genome 


3.1 


1838 


AE001399 


Plasmodium falciparum chromosome 2, section 36 of 73 of the 
complete sequence 


0.008 


1839 


U53695 


Leishmania enriettii multidrug resistance (lemdrl) gene, 5' UTR 


0.5 


1840 


AF257022 


Salmo salar clone BHMS7-009 microsatellite sequence 


0.7 


1841 


AF239571 


Acineta supeiba tRNA-Leu (trnL) gene, partial sequence; and trnL-F 
intergenic spacer region, complete sequence; chloroplast gene for 
chloroplast product 


5.4 


1842 


AL032651 


Caenorhabditis elegans cosmid Y6D1A, complete sequence 


0.019 


1844 


XM 034229 


Homo sapiens similar to hypothetical protein FLJ14260 (H. sapiens) 
(LOC90803), mRNA 


0.068 


1845 


AL137121 


Human DNA sequence from clone RP1 1-122K21 on chromosome 13, 
complete sequence [Homo sapiens] 


e-147 


1846 


AF137535 


Danio rerio clone Dare-D AXX DAXX protein mRNA, partial cds 


1.7 


1847 


AK023539 


Homo sapiens cDNA FLJ13477 fis, clone PLACE1003638 


0.21 


1848 


AF218799 


Gallus gallus clone LDH-BT1 lactate dehydrogenase B mRNA, 
complete cds 


6.4 


1849 


AF250345 


Xenopus laevis early growth response protein mRNA, complete cds 


0.52 


1850 


BC006858 


Mus musculus, clone IMAGE:3594635, mRNA 


1.5 


1852 


XM 006782 


Homo sapiens cyclin Tl (CCNT1), mRNA 


0.092 


1853 


J03883 


Rabbit zeta-1 globin processed pseudogene, complete cds 


1.5 
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1854 


AF349951 


Homo sapiens HP95 mRNA, complete cds 


3E-09 


1855 


AF268062 


Candidatus Carsonella ruddii natural-host Bactericera cockerelli RNA 
polymerase beta subunit (rpoB) and RNA polymerase beta-prime 
subunit (rpoC) genes, partial cds 


0.17 


1856 


AF1 14508 


Uncultured gamma proteobacterium SUR-ATT-8 16S ribosomal RNA 
gene, complete sequence 


1.5 


1857 


AE006785 


Sulfolobus solfataricus section 144 of 272 of the complete genome 


0.091 


1858 


AF078684 


Psoroptes ovis glutathione S-transferase mRNA, complete cds 


0.0006 


1859 


BC004671 


Mus musculus, FK506 binding protein la (12 kDa), clone MGC:6253 
IMAGE:3498682, mRNA, complete cds 


0.12 


1860 


NM 025083 


Homo sapiens hypothetical protein FLJ21128 (FLJ21128), mRNA 


e-121 


1861 


BC004161 


Homo sapiens, transmembrane 4 superfamily member (tetraspan NET- 
7), clone MGC:2447 IMAGE:2958221, mRNA, complete cds 


0.051 


1862 


BC009855 


Homo sapiens, Similar to nucleolin, clone MGC:16354 
IMAGE:3926227, mRNA, complete cds 


e-171 


1863 


AKO 16507 


Mus musculus adult male testis cDNA, RDCEN full-length enriched 
library, clone:4931439C15, full insert sequence 


0.019 


1864 


AF334756 


Homo sapiens interleukin-1 HY2 (IL1HY2) gene, complete cds 


1.5 


1865 


Z68881 


Human DNA sequence from cosmid L118G10, Huntington's Disease 
Region, chromosome 4pl6.3 


0.005 


1866 


AJ277737 


Homo sapiens partial POLR2J2 gene for RPB1 lbl protein, intron 4 
(1824 BP) 


0.0000001 


1867 


XM 051572 


Homo sapiens heat shock transcription factor 2 (HSF2), mRNA 


1E-11 


1868 


AF256918 


Salmo salar clone BHMS267 microsatellite sequence 


0.11 


1869 


AE000557 


Helicobacter pylori 26695 section 35 of 134 of the complete genome 


0.17 


1870 


XM 050469 


Homo sapiens ferritin, light polypeptide (FTL), mRNA 


0.081 


1871 


XM 042818 


Homo sapiens homeo box D4 (HOXD4), mRNA 


0.24 


1872 


AB035093 


Petunia x hybrida mRNA for lateral shoot inducing factor, complete 
cds 


1.9 


1873 


AE006184 


Pasteurella multocida PM70 section 151 of 204 of the complete 
genome 


5.6 


1874 


AE006880 


Sulfolobus solfataricus section 239 of 272 of the complete genome 


3.3 


1875 


NC 002669 


Bacteriophage bIL310, complete genome 


4.7 


1876 


AK012924 


Mus musculus 10, 11 days embryo cDNA, RDCEN full-length 
enriched library, clone:2810048H02, full insert sequence 


1.8 


1877 


XM 032188 


Homo sapiens cell division cycle 27 (CDC27), mRNA 


5.5 


1878 


M33662 j 


Hamster G-o protein alpha subunit 2 mRNA, complete cds 


5.8 


1879 


Y00478 


L.esculentum LEAC02 gene 


0.28 


1880 


XM 050519 


Homo sapiens annexin A2 (ANXA2), mRNA 


8E-34 


1881 


AF054822 


Drosophila melanogaster activin beta precursor, gene, partial cds 


0.43 
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1882 


AB062967 


Macaca fascicularis brain cDNA clone:QmoA-10357, full insert 
sequence 


1.7 


1883 


D87001 


Homo sapiens immunoglobulin lambda gene locus DNA, clone:47H9 
downstream contig 


0.64 


1884 


AF378137 


Plasmodium falciparum clone g2-4-4#5 MB2 gene, partial cds 


0.081 


1885 


XM 011791 


Homo sapiens laminin, gamma 3 (LAMC3), mRNA 


3E-58 


1886 


XM_007214 


Homo sapiens hypothetical protein FLJ10956 (FLJ10956), mRNA 


e-119 


1887 


AF227741 


Rattus norvegicus protein kinase WNK1 (WNK1) mRNA, complete 
cds 


0.96 


1888 


AL359578 


Homo sapiens mRNA; cDNA DKFZp547N163 (from clone 
DKFZp547Nl63) 


0.063 


1889 


AE002841 


Drosophila melanogaster genomic scaffold 142000013385478, 
complete sequence 


0.19 


1890 


X01121 


Mycoplasma capricolum genes for ribosomal proteins S8 and L6 


0.5 


1892 


AF082924 


Trissolcus basalis cytochrome oxidase II gene, partial cds; and tRNA- 
Asp and tRNA-Lys genes, complete sequence; mitochondrial genes for 
mitochondrial products 


0.02 


1893 


BC005693 


Mus musculus, Similar to RIKEN cDNA 1 110002 A15 gene, clone 
MGC:11574 IMAGE:3597594, mRNA, complete cds 


1.4 


1894 


U29721 


Entamoeba histolytica p21racB (Eh racB) gene, partial cds 


0.15 


1895 


AK023959 


Homo sapiens cDNA FU13897 fis, clone THYRO1001706 


0.17 


1896 


XM 038475 


Homo sapiens DKFZP564J102 protein (DKFZP564J102), mRNA 


6E-33 


1897 


AL010164 


Plasmodium falciparum DNA *** SEQUENCING IN PROGRESS 
*** from contig 3-95, complete sequence 


1.1 


1898 


AF176059 


Helcon sp. 16S ribosomal RNA gene, partial sequence; mitochondrial 
gene for mitochondrial product 


0.009 


1899 


L34640 


Homo sapiens platelet/endothelial cell adhesion molecule-1 (PECAM- 
1) gene, exons 7, 8 and 9 


0.007 


1901 


AC026061 


Homo sapiens BAC clone RP1 1-223K9 from Y, complete sequence 


1.5 


1902 


AF188516 


Staphylococcus aureus Bmr-like protein SblA (sblA) gene, complete 
cds 


0.08 


1903 


AL360177 


Human DNA sequence from clone RP1 1-86K9 on chromosome 10, 
complete sequence [Homo sapiens] 


0.077 


1904 


L76261 


Meloidogyne javanica mitochondrial transfer RNA His, 16S ribosomal 
RNA (16S rRNA) genes, ND3 gene, complete cds, and cytochrome b 
gene, 5* end of CDS 


0.17 


1905 


XM 042904 


Homo sapiens mitochondrial ribosomal protein S25 (MRPS25), 
mRNA 


2 


! 1906 


XM 051831 


Homo sapiens similar to CYTOCHROME P450 51 (CYPL1) 
(P450L1) (STEROL 14-ALPHA DEMETHYLASE) (LANOSTEROL 
14-ALPHA DEMETHYLASE) (LDM) (P450-14DM) (H. sapiens) 
(LOC93525), mRNA 


0.66 
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P VALUE 


1907 


XM 012289 


Homo sapiens cytoskeleton associated protein 2 (CKAP2), mRNA 


2 


1908 


U44391 


Human Down Syndrome region of chromosome 21, clone A12H1-1B2 


0.008 


1909 


L04961 


Mouse nuclear-localized inactive X-specific transcript (Xist) mRNA 


1.8 


1910 


X61202 


Methanococcus voltae fruA, fruB, firuG genes for subunits of F420- 
reducing hydrogenase and fi*uD orf 


0.078 


1911 


AP000242 


Homo sapiens genomic DNA, chromosome 21q22.1, clone:f43Dll, 
SOD-AML region, complete sequence 


0.0001 


1912 


AL023777 


S.pombe chromosome in cosmid cl827 


0.66 


1913 


XM 038450 


Homo sapiens hypothetical protein FLJ20694 (FLJ20694), mRNA 


0 


1914 


AF129510 


Mus musculus DACH protein (Dach) mRNA, complete cds 


2.1 


1915 


AE001405 


Plasmodium falciparum chromosome 2, section 42 of 73 of the 
complete sequence 


0.18 


1916 


XM_039566 


Homo sapiens KIAA0948 protein (KIAA0948), mRNA 


6.8 \ 


1917 


AP001431 


Homo sapiens genomic DNA, chromosome 21q22.2, clone:T1492, 
LB7T-ERG region, complete sequence 


0.074 


1918 


AF329137 


Camelus bactrianus microsatellite Cmsl sequence 


0.008 


1919 


NM 003816 


Homo sapiens a disintegrin and metalloproteinase domain 9 (meltrin 
gamma) (ADAM9), mRNA 


1E-17 


1920 


AE001414 


Plasmodium falciparum chromosome 2, section 51 of 73 of the 
complete sequence 


0.001 


1921 


U95074 I 


Caenorhabditis elegans protein disulphide isomerase isoform I (pdi-1) 
and cyclophilin isoform 9 (cyp-9) genes, complete cds 


0.029 


1922 


ABO 19534 


Homo sapiens gene for cathepsin L2, complete cds 


0.07 


1923 


AJ133777 


Arabidopsis thaliana mRNA for gamma-adaptin 2 


6.6 


1924 


XM 001304 


Homo sapiens cyclin-dependent kinase inhibitor 2C (pl8, inhibits 
CDK4) (CDKN2C), mRNA 


0.25 


1925 


AB037166 


Clostridium botulinum genes for ORF-22, HA-70, HA-17, HA-33, 
NTNHA, neurotoxin, complete cds 


0.2 


1926 


Z48243 


A. thaliana PARP mRNA for PARP protein 


0.081 


1927 


XM 041006 


Homo sapiens choreoacanthocytosis gene; KIAA0986 protein 
(KIAA0986), mRNA 


0.25 


1928 


XM_050592 


Homo sapiens uncharacterized hematopoietic stem/progenitor cells 
protein MDS032 (MDS032), mRNA 


0.083 


1929 


U34610 


Mus musculus alpha- l(XVm) collagen (COL18A1) gene, exon 6 and 
7 


0.008 


1930 


BC007658 


Homo sapiens, clone MGC:747 IMAGE:3343994, mRNA, complete 
cds 


6E-50 


1931 


AL451135 


Human DNA sequence from clone RP1 1-45G20 on chromosome 6, 
complete sequence [Homo sapiens] 


2 


1932 


AB004883 


Marchantia paleacea mRNA for ribulose 1,5-bisphosphate 
carboxylase/oxygenase small subunit, complete cds 


0.26 
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1933 


AF019973 


Rattus norvegicus neuron-specific enolase (NSE) mRNA, complete 
cds 


0.22 


1934 


AF231684 


Drosophila melanogaster outstretched (os) gene, complete cds 


0.75 


1935 


AC079136 


Homo sapiens clone RP1 1-153L12, complete sequence ] 


0.75 


1936 


AE006611 


Streptococcus pyogenes Ml GAS strain SF370, section 140 of 167 of 
the complete genome 


1.5 


1937 


AE007418 


Streptococcus pneumoniae section 101 of 194 of the complete genome 


0.029 


1938 


L26949 


Simian immunodeficiency virus (T5) surface envelope glycoprotein 
proviral gene, 5* end 


0.027 


1939 


AF343914 


Campylobacter jejuni NCTC 11828 LOS biosynthesis cluster, partial 
sequence 


0.58 


1940 


Z69925 


Human DNA sequence from clone LL22NC03-116A5 on chromosome 
22 Contains GSSs, complete sequence [Homo sapiens] 


6.2 


1941 


U96876 


Homo sapiens insulin induced protein 1 (INSIG1) gene, complete cds 


0.78 


1942 


XM_030024 


Homo sapiens similar to protein kinase C binding protein 1 (H. 
sapiens) (LOC90218), mRNA 


0.009 


1943 


AF226993 


Rattus norvegicus selective LIM binding factor mRNA, complete cds 


0.7 


1944 


AF033037 


Plasmodium falciparum merozoite surface protein 4, merozoite 
surface protein 5, merozoite surface protein 2, and adenylosuccinate 
lyase genes, complete cds 


0.69 


1945 


AB053087 


Human immunodeficiency virus type 1 pol gene, reverse transcriptase 
amino terminal part, partial cds, 99XP-NH3-II plasma virus 
clone:NH3RT-V3-2-9 


0.028 


1946 


XM 033335 


Homo sapiens hypthetical protein PR02389 (PR02389), mRNA 


0.0001 


1947 


XM_0 10334 


Homo sapiens transducin (beta)-like 1 (TBL1), mRNA 


2.2 


1948 


AK003943 


Mus musculus 18 days embryo cDNA, RIKEN full-length enriched 
library, clone: 1 1 10028F1 1, full insert sequence 


2E-10 


1949 


AF085600 


Drosophila melanogaster inorganic pyrophosphatase NURF-38 (Nurf- 
38) gene, complete cds 


0.78 


1950 


U79731 


Plasmodium berghei extrachromosomal plastid PB-1, ORF470 gene, 
partial cds, tRNA-Thr, large subunit ribosomal RNA, tRNA-Met, 
tRNA-Arg, tRNA-Val, tRNA-Arg, tRNA-Leu, tRNA-Asn, tRNA-Ala, 
and small subunit ribosomal RNA genes, complete sequences 


0.72 


1951 


AK022877 


Homo sapiens cDNA FU12815 fis, clone NT2RP2002546 


0.003 


1952 


M62755 


Potato 4-coumarate-CoA ligase (St4Cl-l) gene, complete cds 


0.085 


1953 


AK017902 


Mus musculus adult male thymus cDNA, RIKEN full-length enriched 
library, clone:5830406J20, full insert sequence 


2.3 


1954 


AF164418 


Sorghum bicolor maturase (matK) gene, complete cds; chloroplast 
gene for chloroplast product 


0.078 


1955 


X54567 


Human MFD22 dinucleotide repeat DNA 


3E-13 
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NO 
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DESCRIP 


P VALUE 


1956 


AB056832 


Macaca fascicularis brain cDNA clone:QflA-14079, fall insert 
sequence 


0.00009 


1957 


XM 034132 


Homo sapiens ubiquitously transcribed tetratricopeptide repeat gene, 
Y chromosome (UTY), mRNA 


7 


1958 


L09706 


Homo sapiens complement component 2 (C2) gene allele b, exons 1 
through 8 


0.24 


1959 


XM 017231 


Homo sapiens hypothetical protein FLJ23342 (FLJ23342), mRNA 


8E-76 


1960 


X83989 


P.palustris chloroplast rbcL gene, promoter region 


0.003 


1961 


AK025090 


Homo sapiens cDNA: FLJ21437 fis, clone COL04285 


0 


1962 


AK013805 


Mus musculus adult male hippocampus cDNA, RIKEN full-length 
enriched library, clone:2900079F10, full insert sequence 


0.14 


1963 


AE006210 


Pasteurella multocida PM70 section 177 of 204 of the complete 
genome 


0.042 


1964 


AF075080 


Homo sapiens foil length insert cDNA YQ80D07 


0.000001 


1965 


D42118 


Tobacco gene for parAs protein, promoter region and partial cds 


0.009 


1966 


AF269350 


Staphylococcus epidermidis strain SRI clone step.l002dl2 genomic 
sequence 


2.3 


1967 


AF068904 


Staphylococcus aureus cell division protein FtsZ (ftsZ) gene, partial 
cds; YlmD (ylmD), YlmE (ylmE), YlmF (ylmF), YlmG (ylmG), and 
YlmH (ylmH) genes, complete cds; and cell division protein DivIVA 
(divIVA) gene, partial cds 


0.19 


1968 


AL589920 


Human DNA sequence from clone RP1 1-1 17A20 on chromosome 6, 
complete sequence [Homo sapiens] 


0.003 


1969 


AF027688 


Mus musculus survival motor neuron (Smn) gene, promoter region 
and partial cds 


0.56 


1970 


AL049340 


Homo sapiens mRNA; cDNA DKFZp564P056 (from clone 
DKFZp564P056) 


0.25 


1971 


AJ223385 


Fowlpox virus strain HP-440 DNA, isolate FP9, 14.6 kb fragment 


0.25 


1972 


U25354 


Human mitochondrial DNA control region, Indonesian, sequence 1 of 
60 


3E-65 


1973 


AF269868 


Staphylococcus epidermidis strain SRI clone step.l027g02 genomic 
sequence 


0.083 


1974 


AE003326 


Drosophila melanogaster genomic scaffold 142000013385598, 
complete sequence 


0.087 


1975 


XM_050217 


Homo sapiens RNA binding protein (LOC84549), mRNA 


2.1 


1976 


AE001402 


Plasmodium falciparum chromosome 2, section 39 of 73 of the 
complete sequence 


0.026 


1977 


XM 001935 


Homo sapiens forkhead box D2 (FOXD2), mRNA 


2.1 


1978 


AL590384 


Human DNA sequence from clone RP11-349A16 on chromosome 
Xq22.3-24, complete sequence [Homo sapiens] 


0.008 


1979 


AL1 17050 


Botrytis cinerea strain T4 cDNA library under conditions of nitrogen 
deprivation 


2.2 


1980 


XM 037529 


Homo sapiens KIAA0410 gene product (KIAA0410), mRNA 


2.3 


! 1981 


X56212 


Celegans gene for vitellogenin 


1.8 
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SEQID 
NO 


ACCESSN 


DESCRIP 


P VALUE 


1982 


AF231684 


Drosophila melanogaster outstretched (os) gene, complete cds 


0.7 


1983 


AB032264 


Danio rerio mRNA for glycogen synthase kinase 3 alpha, complete 
cds 


2.2 


1984 


AK022877 


Homo sapiens cDNA FLJ12815 fis, clone NT2RP2002546 


0.009 


1985 


U89140 


Mus musculus aldose reductase gene, promoter region and exon 1 


0.69 


1986 


M37036 


Rat nucleolar proteins B23.1 and B23.2 


0.086 


1987 


XMJ335498 


Homo sapiens amiloride-sensitive cation channel 2, neuronal 
(ACCN2), mRNA 


0.25 


1988 


U40944 


Caenorhabditis elegans cosmid PDB1, complete sequence 


0.025 


1989 


AF102762 


Dasylagon n. sp. 'Mardulyn & Whitfield 1 16S large subunit ribosomal 
RNA gene, partial sequence; mitochondrial gene for mitochondrial 
product 


0.24 


1990 


AF247631 


Maoricicada iolanthe cytochrome oxidase subunit I (COI) gene, 
partial cds; mitochondrial gene for mitochondrial product 


0.026 


1991 


Z81538 


Caenorhabditis elegans cosmid F45H10, complete sequence 


6.2 


1992 


Z72516 


Caenorhabditis elegans cosmid T25G3, complete sequence 


0.75 


1993 


AE002201 


Chlamydophila pneumoniae AR39, section 32 of 94 of the complete 
genome ■ 


6.6 ! 


1994 


NM 003816 


Homo sapiens a disintegrin and metalloproteinase domain 9 (meltrin 
gamma) (ADAM9), mRNA 


2E-17 


1995 


AL158132 


Human DNA sequence from clone RP3-323A24 on chromosome 4. 
Contains GSSs and a putative CpG island, complete sequence [Homo 
sapiens] 


0.074 


1996 


NM 023122 


Mus musculus glycoprotein m6b (Gpm6b), mRNA 


5.7 | 


1997 


AF014502 


Glycine max seed coat peroxidase precursor (Ep) gene, complete cds 


1.8 


1998 


AJ270226 


Entodinium caudatum partial mRNA for putative glycosyltransferase, 
clone L48 


2.1 


1999 


AF159913 


Euplotes crassus transposon Tecl clone Tecl-2 orf 2 and orf 3 
pseudogenes, complete sequence 


0.011 


2000 


! AF101309 


Caenorhabditis elegans cosmid H24G06, complete sequence 


2.2 


2001 


AB021240 


Hypseleotris compressus mitochondrial gene for Cytochrome b, 
complete cds 


0.022 


2002 


AC078798 


Homo sapiens 3 BAC PAC-56F1 1 (Roswell Park Cancer Institute 
Human BAC Library) complete sequence 


0.001 


2003 


AK021504 


Homo sapiens cDNA FLJ11442 fis, clone HEMBA1001327 


0.76 


2004 


AF3 14531 


Chilli leaf curl virus A component, complete sequence 


0.23 


2005 


XM_047794 


Homo sapiens KIAA1641 protein (KIAA1641), mRNA 


j e-123 


2006 


AC079136 


Homo sapiens clone RP11-153L12, complete sequence 


0.65 


2007 


AE001422 


Plasmodium falciparum chromosome 2, section 59 of 73 of the 
complete sequence 


0.001 


2009 


AJ306692 


Drosophila virilis ORF1, rpL14 gene and ORF2 


| 0.26 


2010 


AC084242 


Arabidopsis thaliana chromosome 1 BAC T24P22 genomic sequence, 
complete sequence 


0.72 | 
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SEQID 
NO 


ACCESSN 


DESCRIP 


P VALUE 


2011 


Z16768 


H. sapiens (D12S90) DNA segment containing (CA) repeat; clone 
AFM172xd8; single read 


0.058 


2012 


AF3 11633 


Eimeria nieschulzi ORF470 gene, partial cds; apicoplast gene for 
apicoplast product 


1.9 


2013 


XM_012933 


Homo sapiens hypothetical protein FLJ11656 (FLJ11656), mRNA 


2.5 


2014 


AB035190 


Homo sapiens RHD gene, intron 2, complete sequence 


1.6 


2015 


NC_002135 


Pleurotus ostreatus mitochondrial plasmid mlpl, complete sequence 


0.22 


2016 


AF085619 


Elgaria paucicarinata NADH dehydrogenase subunit I (ND1) gene, 
partial cds; tRNA-Ile, tRNA-Gln, and tRNA-Met genes, complete 
sequence; NADH dehydrogenase subunit II (ND2) gene, complete cds; 
tRNA-Trp, tRNA-Ala, tRNA-Asn, tRNA-Cys, and tRNA-Tyr genes, 
O 


2.3 


2017 


D85424 


Homo sapiens gene for alphaSl-casein, 5flanking region 


0.077 


2018 


AE001138 


Borrelia burgdorferi (section 24 of 70) of the complete genome 


0.026 


2019 


BC007114 


Homo sapiens, clone IMAGE:4295422, mRNA 


0.23 | 


2020 


AF080510 


Homo sapiens mannose-binding protein gene, exon 4 and complete 
cds 


0.00001 


2022 


XM 027941 


Homo sapiens hypothetical protein DKFZp761J139 (DKFZp761J139), 
mRNA 


0.003 


2023 


Z68996 


H. sapiens mRNA for immunoglobulin kappa light chain VJ region 
(IDPOM021) 


0.15 


2024 


AB049191 


Mus musculus DNA, clone:lambda4/6 3 endogenous mnns? mammary 
tumor virus LTR region 


0.15 


2025 


U62057 


Mycoplasma capricolum NADH oxidase (naox) gene, partial cds, and 
lipoate-protein ligase (lpla), pyruvate dehydrogenase EI alpha subunit 
(odpa), pyruvate dehydrogenase EI beta subunit (odpb), pyruvate 
dehydrogenase KQ (odp2), dihydrolipoamide dehydroge> 


3 


2026 


AY018527 


Oryza sativa microsatellite MRG0852 containing (AT)X17, genomic 
sequence 


0.0003 


2027 


AF266084 


Phyllopetalia apicalis large subunit ribosomal RNA gene, partial 
sequence; tRNA-Valine gene, complete sequence; and small subunit 
ribosomal RNA gene, partial sequence; mitochondrial genes for 
mitochondrial products 


0.009 


2028 


AF249912 


Cucumis melo galactinol synthase (GAS1) gene, upstream sequence 
and partial cds 


0.021 


2029 


XM 048172 


Homo sapiens hypothetical protein DKFZp434F1017 
(DKFZP434F1017), mRNA 


6.5 


2030 


AF367436 


Erysimum cheiri chloroplast outer envelope membrane protein 
mRNA, complete cds 


1.2 


2031 


AF135026 


Homo sapiens kallikrein-like protein 3 (KLK9) gene, complete cds, 
alternatively spliced 


0.76 


2032 


Z69782 


A.thennophilum xylR gene, xynA gene and alfA gene 


0.0004 
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NO 
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P VALUE 


2033 


NM 013730 


Mus musculus signaling lymphocyte activation molecule (Slam), 
mRNA 


6.4 


2034 


NC 002811 


Tarsius bancanus mitochondrion, complete genome 


0.69 


2035 


AF184822 


Arabidopsis thaliana inhibitor tagged site ITS55 genomic sequence 


0.24 


2036 


AE006626 


Streptococcus pyogenes Ml GAS strain SF370, section 155 of 167 of 
the complete genome 


5.4 


2037 


AF324424 


Ichthyophthirins multifiliis immobilization antigen isoform (IAG52A) 
gene, complete cds 


0.003 


2038 


AL391223 


Human chromosome 14 DNA sequence Partial sequence from BAC R- 
325N7_PCR1 of library RPCI-1 1 from chromosome 14 of Homo 
sapiens (Human), complete sequence 


6.3 


2040 


AF369348 


Rabies virus V231.RB phosphoprotein (P) gene, complete cds 


1.4 


2041 


AF023666 


Apis mellifera sn-glycerol-3 -phosphate dehydrogenase (Gpdh) gene, 
complete cds 


0.25 


2042 


XM 011553 


Homo sapiens aldo-keto reductase family 1, member Dl (delta 4-3- 
ketosteroid-5-beta-reductase) (AKR1D1), mRNA 


0.001 


2043 


XM 004530 


Homo sapiens nuclear receptor subfamily 2, group E, member 1 
(NR2E1), mRNA 


5E-41 


2044 


AJ277602 


Gallus gallus partial mRNA for initiation of translation factor 2 (cIF2 
gene) 


0.72 


2045 


XM 041537 


Homo sapiens echinoderm microtubule-associated protein-like 
(EMAPL), mRNA 


1.7 


2046 


AE001429 


Plasmodium falciparum chromosome 2, section 66 of 73 of the 
complete sequence 


0.57 


2047 


L03188 


Saccharomyces cerevisiae integrin analogue gene, complete cds 


0.23 


2048 


XM 034810 


Homo sapiens similar to ALU SUBFAMILY SQ SEQUENCE 
CONTAMINATION WARNING ENTRY (H sapiens) (LOC90908), 
mRNA 


2E-25 


2049 


AE006570 


Streptococcus pyogenes Ml GAS strain SF370, section 99 of 167 of 
the complete genome 


0.64 


2050 


AY023977 


Oryza sativa microsatellite MRG6302 containing (ATTA)X6, closest 
to marker R1674, genomic sequence 


0.6 


2051 


M91384 


Dictyostelium discoideum thioredoxin (TRX1) mRNA, complete cds 


0.68 


2052 


XM 032748 


Homo sapiens KIAA1634 protein (KLAA1634), mRNA 


0.066 


2053 


AB004814 


Oryza sativa DNA for NLS receptor, complete cds 


0.009 


2054 


AF130049 


Homo sapiens clone FLB3411 PRO0852 mRNA, complete cds 


0.078 


2055 


Y19213 


Homo sapiens putative psihHbA pseudogene for hair keratin, exons 2 
to 7 


0.038 


2056 


AF305146 


Picea abies clone 1PABCA3F hypomethylated genomic sequence 


1.8 


2057 


Z67997 


Human DNA sequence from cosmid L206D7, Huntington's Disease 
Region, chromosome 4pl6.3 


0.0001 


2058 


U51024 


Human Down Syndrome region of chromosome 21 DNA 


0.0008 


2059 


AC092063 


Homo sapiens clone RP1 1-745H7, complete sequence 


0 
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SEQID 
NO 


ACCESSN 


DESCRIP 


P VALUE 


2060 


AE002123 


Ureaplasma urealyticum section 24 of 59 of the complete genome 


L8 


2061 


Z68019 


HIV-1 DNA V3 region (homosexual H1031, seroconversion sample 
1987) 


0.25 


2062 


XM 006487 


Homo sapiens spectrin, beta, non-erythrocytic 2 (SPTBN2), mRNA 


0.000003 


2063 


AF329716 


Oncorhynchus mykiss clone OSU9 T cell receptor beta chain variable 
region gene, partial cds 


0.0008 


2064 


AC024773 


Caenorhabditis elegans cosmid Y40C7 A, complete sequence 


0.035 


2065 


XM_032748 


Homo sapiens KIAA1634 protein (KIAA1634), mRNA 


0.079 


2066 


M58526 


Human alpha-5 collagen type IV (COL4A5) mRNA, 3' end 


2.1 


2068 


X56697 


B.thuringiensis gene for sigma factor 35 


0.59 


2069 


D00768 


Pig pituitary glycoprotein hormone alpha subunit gene, exons 3 and 4 
and 3 'flank 


1.6 


2070 


AF110621 


Boophilus microplus 12S ribosomal RNA gene, partial sequence; 
tRNA-Ile, tRNA-Gln, and tRNA-Phe genes, complete sequence; and 
NADH dehydrogenase subunit 5 (ND5) gene, partial cds, 
mitochondrial genes for mitochondrial products 


0.21 


2071 


AC024779 


Caenorhabditis elegans cosmid Y43B1 1AL, complete sequence 


0.24 


2072 


AK023589 


Homo sapiens cDNA FLJ13527 fis, clone PLACE1006076 


0 


2073 


X77691 


W.suaveolens mitochondrial DNA intergenic region between ORF1 
and 15S rRNA genes 


0.24 


2074 


Z68749 


Caenorhabditis elegans cosmid F56H11, complete sequence 


1.4 


2075 


AF302113 


Solanum tuberosum glutamine synthetase GS2 (gin) mRNA, partial 
cds; nuclear gene for plastid product 


1.4 


2076 


U27078 


Trichaptum abietinum mitochondrial small subunit ribosomal RNA, 
mitochondrial gene, partial sequence 


0.64 


2077 


XM 012456 


Homo sapiens golgin-67 (KIAA0855), mRNA 


2.2 


2078 


AB032960 


Homo sapiens mRNA for KIAA1134 protein, partial cds 


1E-36 


2079 


XM 052376 


Homo sapiens hypothetical gene supported by AF106046; AK000332 
(LOC93587), mRNA 


0.66 


2080 


AB021866 


Homo sapiens KIP gene, complete cds 


1.3 


2081 


XM 029213 


Homo sapiens hypothetical protein DKFZp564O0523 
(DKFZP564O0523), mRNA 


0.38 


2082 


AF384143 


Triticum aestivum pathogenesis-related protein 1 mRNA, complete 
cds 


0.0003 I 


2083 


AE002368 


Neisseria meningitidis serogroup B strain MC58 section 10 of 206 of 
the complete genome 


0.71 


2084 


AE006662 


Sulfolobus solfataricus section 21 of 272 of the complete genome 


0.63 


2085 


AY020760 


Oryza sativa microsatellite MRG3085 containing (TA)X13, closest to 
marker R2976, genomic sequence 


6.9 


2086 


AF161419 


Homo sapiens HSPC301 mRNA, partial cds 


1.8 


2087 


AJ249838 


Gallus gallus mRNA for radixin (rdx gene) 


2.1 


2088 


U43536 


Coiynebacterium glutamicum heat shock, ATP-binding protein (clpB) 
gene, complete cds 


0.22 


2089 


XM 037588 


Homo sapiens KIAA0874 protein (KIAA0874), mRNA 


2E-34 
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NO 
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2090 


NC 002697 


Chiysomya chloropyga mitochondrion, complete genome 


0.077 


2091 


AF332207 


Caenorhabditis elegans clone yk509b7 nuclear receptor NHR-57 gene, 
partial cds 


0.45 j 


2092 


U39886 


Bos taurus Y-chromosome specific genomic sequence 


0.2 


2093 


Z19055 


B.aphidicola tryptophan operon 


0.26 


2094 


AJ290309 


Simethis mattiazzii chloroplast trnL-trnF intergenic spacer 


0.57 


2095 


AK013631 


Mus musculus adult male hippocampus cDNA, RIKEN full-length 
enriched library, clone:2900041A09, full insert sequence 


0.18 


2096 


AF015468 


Plasmodium falciparum microsatellite LT103 sequence 


0.11 | 


2097 


AJ277734 


Sus scrofa microsatellite sequence S0601 


0.4 


2098 


AF203341 


Glycine max chloroplast carboxyl transferase alpha subunit (accA-4) 
nuclear pseudogene, partial sequence; and putative steroid reductase 
gene, complete cds 


0.001 


2099 


XM_052597 


Homo sapiens KIAA1350 protein (KIAA1350), mRNA 


o [ 


2100 


Z96649 


Rsapiens telomeric DNA sequence, clone 6QTEL010, read 
6QTELOO010.seq 


0.007 


2101 


BC009105 


Mus musculus, Harvey rat sarcoma oncogene, subgroup R, clone 
MGC:6162 IMAGE:34833 13, mRNA, complete cds 


4.1 


2102 


AF245117 


Mus musculus leukocyte cell-surface molecule (Ly9) gene, exon 1 


0.025 


2103 


AL023810 


Caenorhabditis elegans cosmid C16D2, complete sequence 


0.008 


2104 


X17051 


E.gracilis DNA for ribosomal protein operon 


0.21 


2105 


AF005392 


Homo sapiens alpha tubulin (TUB A2) gene, partial cds 


0.63 | 


2106 


AE002798 


Drosophila melanogaster genomic scaffold 142000013385466, 
complete sequence 


0.025 


2107 


AE002114 


Ureaplasma urealyticum section 15 of 59 of the complete genome 


0.074 


2108 


L48039 


Fibrobacter succinogenes endo-l,4-beta-D-glucanase gene, complete 
cds, endo-l,4-beta-D-glucanase gene, 5* end of cds 


2 


2109 


D87922 


Rat DNA for 3UTR of skeletal muscle sodium channel, partial 
sequence 


0.71 


2110 


AF157137 


Gongronellabutleri 18S ribosomal RNA gene, partial sequence 


0.24 


2111 


j Z92970 


Caenorhabditis elegans cosmid H06O01, complete sequence 


0.24 j 


2112 


AB035494 


Alternaria alternate gene for Akt3-2, complete cds 


0.61 


2113 


M38351 


Rat embryonic glutamic acid decarboxylase gene, embryonic stop (ES) 
exon 


0.67 


2114 


X16715 


D. melanogaster gene for chorion protein sl6 


0.21 


2115 


U16271 


Homo sapiens AMP deaminase isoform L (AMPD2) gene, exons 1 A 
and IB 


0.07 


2116 


AF3 17672 


Helicobacter pylori isolate 4424 vacuolating cytotoxin VacA (vacA) 
gene, partial cds 


0.24 


2117 


NM 010250 


Mus musculus gamma-aminobutyric acid (GAB A-A) receptor, subunit 
alpha 1 (Gabral), mRNA 


0.76 


2118 


BC010786 


Mus musculus, Similar to CREB/ATF family transcription factor, 
clone MGC: 18836 IMAGE:4211480, mRNA, complete cds 


0.69 
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2119 


U67482 ] 


Methanococcus jannaschii section 24 of 150 of the complete genome 


0.24 


2120 


U73373 


EflV-1 clone 1063-2-VPRvpr gene, complete cds 


2 


2121 


AE007370 


Streptococcus pneumoniae section 53 of 194 of the complete genome 


0.68 


2122 


AL449163 


Human DNA sequence from clone RP1 1-533E16 on chromosome 6, 
complete sequence [Homo sapiens] 


2E-11 


2123 


X63382 


Antithamnion sp. rhodoplast genes atpl, atpH, atpG, atpF, atpD, atpA, 
orfl, orf2 and orG 


0.081 


2124 


AF186797 


Homo sapiens RalGDS-like (RGL) gene, exon 17 


0.009 


2125 


AF110796 


Rattus norvegicus GABAB1 receptor (GABABR1) gene, exons 1 
through 9 


4.2 


2126 


XM_038798 


Homo sapiens KIAA1457 protein (KIAA1457), mRNA 


2E-96 


2127 


U96998 


Homo sapiens MET proto-oncogene, intron 19, 3' end 


0.67 


2128 


AE006410 


Lactococcus lactis subsp. lactis IL1403 section 172 of 218 of the 
complete genome 


0.084 


2129 


AB050422 


Macaca fascicularis brain cDNA, clone:QnpA-21421 


6.9 


2130 


AF375597 


Mus musculus medium and short chain L-3 -hydroxyacyl-Coenzyme A 
dehydrogenase (Mschad) gene, exons 2 through 8, and complete cds; 
nuclear gene for mitochondrial product 


2.2 


2131 


AE002758 


Drosophila melanogaster genomic scaffold 142000013385931, 
complete sequence 


0.26 


2132 


AJ008105 


Chrysolina timarchoides COI gene, partial CDS 


0.031 


2133 


U67526 


Methanococcus jannaschii section 68 of 150 of the complete genome 


2 


2134 


AJ223966 


Mus musculus NAGA gene 


0.63 


2135 


AF016986 


Callicebus moloch gamma 1-globin gene, complete cds 


0.68 


2136 


U66872 


Danio rerio enhancer of rudimentary homolog mRNA, complete cds 


0.029 ! 


2137 


1 AJ271220 


Homo sapiens partial gene for dystrophin, intron 49 


0.009 


2138 


AF075602 


Canis familiaris prostaglandin E2 receptor subtype EP2 mRNA, 
complete cds 


0.23 


2139 


XM 045015 


Homo sapiens hypothetical protein FU00052 (FLJ00052), mRNA 


0.75 


2140 


AY034614 


Danio rerio Sec61 alpha form B mRNA, complete cds 


0.1 


2141 


XM_028150 


Homo sapiens similar to MACROPHAGE MANNOSE RECEPTOR 
PRECURSOR (H. sapiens) (LOC90010), mRNA 


1.9 


2142 


M24026 


Rat MHC class I RT1 (RT44) mRNA (u haplotype), 3' end 


0.24 


2143 


AK010194 


Mus musculus adult male tongue cDNA, RIKEN full-length enriched 
hbrary, clone:2310076E16, full insert sequence 


2 


2144 


AF165062 


Hepatitis C virus strain MD9-2 complete genome 


1.2 


2145 


AE007469 


Streptococcus pneumoniae section 152 of 194 of the complete genome 


2.1 


2146 


Z97349 


Plasmodium falciparum DNA *** SEQUENCING IN PROGRESS 
*** from contig 3-06, complete sequence 


0.046 


2147 


NM 010161 


Mus musculus ecotropic viral integration site 2 (Evil), mRNA 


0.02 
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2148 


AE005586 


Escherichia coli 0157:H7 EDL933 genome, contig 3 of 3, section 205 
of 290 


L2 


2149 


AF332928 


Psacothea hilaris isolate HA15 16S ribosomal RNA gene, partial 
sequence; mitochondrial gene for mitochondrial product 


0.069 ! 


2150 


D49504 


Borrelia garinii gene for outer surface protein C, complete cds 


0.37 


2151 


L25647 


Homo sapiens fibroblast growth factor receptor gene (located in the 
central MHC) signal peptide and consecutive exon 


0.05 


2152 


AB040746 


Carassius auratus gene for c-MYC, complete cds 


0.063 


2153 


X55978 


T. brucei ESAG 8 gene for a leucine-rich repeat family protein 


0.12 


2154 


AB010426 


Phytoplasma sp. gene for AL1 like protein, complete cds 


0.009 


2155 


XM_005841 


Homo sapiens cdk inhibitor p21 binding protein (TOK-1), mRNA 


5.8 


2156 


AK025222 


Homo sapiens cDNA: FLJ21569 fis, clone COL06508 


1E-33 


2157 


AF234652 


Mesembryanthemum ciystallinum protein kinase MK5 mRNA, 
complete cds j 


0.2 


2158 


AE006513 


Streptococcus pyogenes Ml GAS strain SF370, section 42 of 167 of 
the complete genome 1 


9.2 


2159 


XM 003660 


Homo sapiens calcium modulating ligand (CAMLG), mRNA 


0.023 I 


2160 


AF003490 


Mesostoa kerri 16S ribosomal RNA gene, partial sequence 


0.000004 


2161 


XMJ)48768 


Homo sapiens tetratricopeptide repeat domain 3 (TTC3), mRNA 


6.7 


2162 


AF323442 ! 


Plasmodium yoelii yoelii clone 10 235 kDa rhoptiy protein (IIIa.l) 
gene, partial cds 


0.019 


2163 


AK016560 


Mus musculus adult male testis cDNA, RIKEN full-length enriched 
library, clone:4932441H21, full insert sequence 


4.7 


2164 


Y12332 


E.tenellaplastid genes ipsl2, rps7, tuf, ORF45, and tRNA-Phe 


0.003 


2165 


AL1 14081 


Botrytis cinerea strain T4 cDNA library under conditions of nitrogen 
deprivation 


0.002 


2166 


AF332207 


Caenorhabditis elegans clone yk509b7 nuclear receptor NHR-57 gene, 
partial cds 


0.49 


2167 


AF102263 


Helicoverpa zea clone S211 cytochrome P450 (CYP6B8) mRNA, 
complete cds 


0.074 


2168 


AL133397 


Human DNA sequence from clone RPl-213Jlp on chromosome 
22ql 1.21-12.2, complete sequence [Homo sapiens] 


0.039 


2169 


AL157423 


Homo sapiens mRNA; cDNA DKFZp761O0511 (from clone 
DKFZp761O0511) 


7E-15 


2170 


AF282578 


Homo sapiens clone 20ptel_cl622_12_3.3f sequence 


9E-17 


2171 


AB022280 


Oryzias latipes gene for soluble guanylyl cyclase alpha subunit, 
complete cds 


3.7 


2172 


AF052744 


Homo sapiens transcriptional repressor p54 gene, promoter region and 
partial cds 


2 


2173 


XM 007685 


Homo sapiens MAGE-like 2 (MAGEL2), mRNA 


1 0.008 


2174 


AK019539 


Mus musculus adult male testis cDNA, RKEN full-length enriched 
library, clone:4921514G21, full insert sequence 


1.6 


2175 


AE007472 


Streptococcus pneumoniae section 155 of 194 of the complete genome 


6.2 


2176 


AL162381 


Human DNA sequence from clone RP1 1-174B19 on chromosome 6, 
complete sequence [Homo sapiens] 


0.059 I 
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2177 


M21836 


Mouse extra-embryonic endodermal cytokeratin type n (EndoA) 
mRNA, complete cds 


e-138 


2178 


X06292 


Human c-fes/fps proto-oncogene 


1.7 


2179 


AF360278 


Arabidopsis thaliana putative MLH1 protein (AT4g09140) mRNA, 
complete cds 


0.0001 


2180 


X78548 


G.max gene encoding epoxide hydrolase 


0.68 


2181 


Z56222 


H.sapiens CpG island DNA genomic Msel fragment, clone 97b6, 
reverse read cpg97b6.rtla 


5E-60 


2182 


AF200412 


Wiebesia brusi cytochrome oxidase subunit I (COT) gene, partial cds; 
tRNA-Leu gene, complete sequence; and cytochrome oxidase subunit 
II (COII) gene gene, partial cds; mitochondrial genes for 
mitochondrial products 


0.084 


2183 


AF360327 


Arabidopsis thaliana putative metal-binding protein 
(MLE2.16/AT5g63530) mRNA, complete cds 


0.014 


2184 


XM 005837 


Homo sapiens CUG triplet repeat, RNA-binding protein 2 (CUGBP2), 
mRNA 


0.22 


2185 


BC008152 


Mus musculus, caspase 1, clone MGC:6106 IMAGE:3583883, 1 
mRNA, complete cds 


1.9 


2186 


U41289 ! 


Dictyostelium discoideum K7 kinesin-like protein mRNA, complete 
cds 


0.66 


2187 


AF163863 


Mustela vison tyrosine aminotransferase gene, complete cds 


0.23 


2188 


XM_017198 


Homo sapiens hypothetical protein FLJ12085 0FLJ12O85), mRNA 


4E-36 


2189 


D16180 


Human PMP2 gene for peripheral myelin protein 2, exon 2 and 3 


0.68 ( 


2190 


AB037920 


Clostridium botulinum genes for ORF-22, HA-70, HA- 17, HA-33, 
NTNHA, neurotoxin, complete cds 


0.083 


2191 


! AE006703 


Sulfolobus solfataricus section 62 of 272 of the complete genome 


1.6 


2192 


AB047401 


Periplaneta americana Vg-2 mRNA for vitellogenin-2, complete cds 


0.25 


2193 


U11491 


Hitman rotavirus M37 non-structural protein NSP1 mRNA, complete 
cds 


0.17 


2194 


AF338818 


Mus musculus flavohemoprotein b5/b5R mRNA, complete cds 


2.3 


1 2195 


AP001290 


Mus musculus genomic DNA, chromosome 7, clone:B118_5, 
complete sequence 


0.56 


2196 


XM 010940 


Homo sapiens putative GR6 protein (GR6), mRNA 


2E-61 


2197 


AF1 19554 


Plasmodium falciparum para-aminobenzoic acid synthetase gene, 
complete cds 


0.009 


2198 


AY035995 


Streptococcus pneumoniae strain SPN1506 topoisomerase IV subunit 
A (parC) gene, complete cds 


1.2 


2199 


XM 006892 


Homo sapiens c AMP responsive element binding protein-like 2 
(CREBL2), mRNA 


0.049 


2200 


X87127 


Borrelia burgdorferi repeated DNA element, 30.5 kb circular plasmid 
copy 


0.46 


2201 


AB010958 


Clostridium thermocellum xynA and xynB genes for xylanase, 
complete cds 


0.23 
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2202 


AE003004 


Drosophila melanogaster genomic scaffold 142000013385492, 
complete sequence 


0.23 


2203 


U63418 


Mus musculus uncoupling protein (Ucp) gene, nuclear gene encoding 
mitochondrial protein, complete cds 


0.23 


2204 


AJ243265 


Homo sapiens partial PGM1 gene for phosphoglucomutase 1, exons 5- 
7 


e-135 


2205 


X13883 


Yersinia pseudotuberculosis virulence plasmid pIBI yopA gene for 
Yopl protein 


2.1 


2206 


XM__011639 


Homo sapiens KIAA0535 gene product (KIAA0535), mRNA 


8E-22 


2207 


AF098990 


Caenorhabditis elegans cosmid H35N03 i 


2.2 


2208 


AB037920 


Clostridium botulinum genes for ORF-22, HA-70, HA-17, HA-33, 
NTNHA, neurotoxin, complete cds 


0.2 | 


2209 


AB053087 


Human immunodeficiency virus type 1 pol gene, reverse transcriptase 
amino terminal part, partial cds, 99JP-NH3-II plasma virus 
clone:NH3RT-V3-2«9 


0.028 


2210 


NM 008989 


Mus musculus purine rich element binding protein A (Pura), mRNA 


0.028 


2211 


U97291 


Equus caballus random genomic clone JH226 


0.008 


2212 


AK001699 


Homo sapiens cDNA FLJ10837 fis, clone NT2RP4001260, highly 
similar to Homo sapiens mRNA for KLAA0875 protein 


0.016 


2213 


U84532 


Human dystrobrevin (DTN) gene, exon 4 


0.23 


2214 


Z54522 


H.sapiens CpG island DNA genomic Msel fragment, clone 12b6, 
forward read cpgl2b6.ftlc 


1.5 


2215 


Z69666 


Human DNA sequence from cosmid 24F8 from a contig from the tip 
of the short arm of chromosome 16, spanning 2Mb of 16pl3.3. 
Contains ESTs, repeat polymorphism and CpG island 


6.5 


2216 


M69205 


Ockelbo virus complete genome 


2.2 


2217 


AF202552 


Homo sapiens DNA methyltransferase (DNMT1) gene, exons 2, 3, 
and 4 


0.24 


2218 


D16541 


Chicken DNA for connectin(titin), partial sequence 


0.064 


2220 


AC023351 


Homo sapiens clone RP1 1-637 J20, complete sequence 


1E-59 


2221 


AL5 12305 


Human DNA sequence from clone RP11-85E24 on chromosome 6, 
complete sequence [Homo sapiens] 


0.002 


2222 


U67581 


Methanococcus jannaschii section 123 of 150 of the complete genome 


5 


2223 


AE007428 


Streptococcus pneumoniae section 111 of 194 of the complete genome 


1.9 


2224 


AK023601 


Homo sapiens cDNA FLJ13539 fis, clone PLACE1006640 


1.7 


2225 


AF375468 


Homo sapiens endothelial protein C receptor (PROCR) gene, complete 
cds 


0.016 


2226 


AF350726 


Symphyglossum sanguineum chloroplast tRNA-Leu (trriL) gene and 
trnL-trnF intergenic spacer, partial sequence 


1.4 


2227 


NM_008308 


Mus musculus 5-hydroxytryptamine (serotonin) receptor 1A (Htrla), 
mRNA 


0.23 


2228 


Z62421 


H.sapiens CpG island DNA genomic Msel fragment, clone 68h2, 
forward read cpg68h2.ftla 


4E-68 
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2229 


AF2 10834 ] 


Homo sapiens HARP (HARP) gene, exons 3 and 4 


0.008 


2230 


1 

L81672 < 


Homo sapiens (subclone 2_bl2 from PI H49) DNA sequence, 
complete sequence 


0.13 


2231 


] 

XM 008431 


Homo sapiens integrin, alpha 3 (antigen CD49C, alpha 3 subunit of 
VLA-3 receptor) (ITGA3), mRNA 


1.8 


2232 


AE006295 


Lactococcus lactis subsp. lactis IL1403 section 57 of 218 of the 
complete genome 


0.19 


2233 


U39693 


Mycoplasma genitalium section 15 of 51 of the complete genome 


0.21 J 


2234 


AL032629 


Caenorhabditis elegans cosmid Y102F5A, complete sequence 


0.46 


2235 


AF029114 


Ascogaster sp. 16S ribosomal RNA gene, mitochondrial gene for 
mitochondrial RNA, partial sequence 


0.076 


2236 


AF150036 


Hyalomma dromedarii 12S small ribosomal RNA gene, partial 
sequence; mitochondrial gene for mitochondrial product 


0.63 


2237 


AF013711 


Homo sapiens 22 kDa actin-binding protein (SM22) gene, complete 
cds 


0.6 i 


2238 


XM 035961 


Homo sapiens hypothetical protein FLJ10849 (FLJ10849), mRNA 


0.77 


2239 


AJ290957 


Drosophila erecta partial eIF2g gene for eukaiyotic translation 
initiation factor 2 gamma and Su(var)3-9 gene for heterochromatin 
protein 


0.24 


2240 


AF282004 


Homo sapiens clone 15qtel c366at7 sequence 


8.2 


2241 


AF104915 


Plasmodium falciparum protein serine/threonine kinase- 1 (pskl) gene, 
complete cds 


0.009 


2242 


M22064 


Mouse MHC class I Lyt-2-a gene encoding lyt-2.1 T-cell surface 
alloantigen, complete cds 


0.049 


2243 


AF265368 


Hypseleotris compressa 12S ribosomal RNA gene, complete sequence; 
mitochondrial gene for mitochondrial product 


0.21 


2245 


AE001397 


Plasmodium falciparum chromosome 2, section 34 of 73 of the 
complete sequence 


0.024 


2246 


AK004676 


Mus musculus adult male lung cDNA, RIKEN full-length enriched 
library, clone: 1200009124, full insert sequence 


0.56 


2247 


XM 002436 


Homo sapiens sterile-alpha motif and leucine zipper containing kinase 
AZK (ZAK), mRNA 


0.67 


2248 


AE006324 


Lactococcus lactis subsp. lactis IL1403 section 86 of 218 of the 
complete genome 


0.47 


2249 


XM_042833 


Homo sapiens KIAA0295 protein (KIAA0295), mRNA 


5.1 


2250 


X63412 


B.burgdorferi plasmid ospA gene for outer surface protein A 


0.36 


2251 


AK024235 


Homo sapiens cDNA FLJ14173 fis, clone NT2RP2002755 


1.8 


2252 


U69826 


Eryx miliaris nogajorum cytochrome b (cytb) gene, mitochondrial 
gene encoding mitochondrial protein, partial cds 


5.9 


2254 


AF038086 


HTV-1 isolate TZB0036 from Tanzania, gpl20 C2-C4 region (env) 
gene, partial cds 


0.077 


2255 


AF3 17082 


Adoncholaimus thalassophygas isolate Sou39 large subunit ribosomal 
RNA gene, partial sequence; mitochondrial gene for mitochondrial 
product 


0.003 


2256 


NM_005386 


Homo sapiens neuronatin (NNAT), mRNA 


0.6 
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2257 


XM_037301 


Homo sapiens hypothetical protein FLJ12697 (FLJ12697), mRNA 


2E-61 


2258 


XM 028046 


Homo sapiens KIAA0220 protein (KIAA0220), mRNA 


5E-61 


2259 


U94520 


Lactococcus lactis plasmid pND861 abortive phage resistance proteins 
(abiLi) and (abiLii) genes, complete cds 


0.81 


2260 


U00037 


Caenorhabditis elegans cosmid T20H4 


0.081 


2261 


Z74866 


S.cerevisiae chromosome XV reading frame ORF YOL124c 


0.55 


2262 


Z92779 


Caenorhabditis elegans cosmid C44E1, complete sequence 


0.7 


2263 


V00694 


Saccharomyces cerevisiae mitochondrion COX/OXI3 gene encoding 
for cytochrome oxidase subunit 1 


0.021 


2264 


AC010728 


Homo sapiens BAC clone RP11-258E22 fromY, complete sequence 


3E-30 


2265 


AJ406392 


Beta vulgaris partial En/Spm-like transposon, clone 44.9 


0.054 


2266 


AE001430 


Plasmodium falciparum chromosome 2, section 67 of 73 of the 
complete sequence 


0.064 


2267 


AF378270 


Pangasius sp. ZZSH-2001 microsatellite PSP-G 456 sequence 


1.1 


2268 


XM_047227 


Homo sapiens hypothetical gene supported by AL122112 
(LOC92776), mRNA 


0.38 


2269 


AE002758 ' 


Drosophila melanogaster genomic scaffold 142000013385931, 
complete sequence 


0.2 


2270 


AL1 15861 


Botrytis cinerea strain T4 cDNA library under conditions of nitrogen 
deprivation 


0.14 


2271 


AE006407 


Lactococcus lactis subsp. lactis IL1403 section 169 of 218 of the 
complete genome 


1.8 


2272 


XM 049351 


Homo sapiens KIAA1600 protein (KIAA1600), mRNA 


0.0007 


2273 


AE005977 


Caulobacter crescentus section 303 of 359 of the complete genome 


5.2 


2274 


XM 007057 


Homo sapiens chromosome 12 open reading frame 5 (C12orf5), 
mRNA 


0.63 


2275 


XM 031923 


Homo sapiens hypothetical gene supported by AK023761 
(LOC90466), mRNA 


0.0000007 


2276 


j AJ005330 


pGAII(-) SK positive selection cloning vector gltS gene 


3E-21 1 


2277 


XM_029246 


Homo sapiens collagen, type I, alpha 2 (COL1A2), mRNA 


0.051 


2278 


AF185279 


Homo sapiens polymerase delta small subunit (POLD2) gene, partial 
sequence 


0.002 


2279 


I AF080592 


Mus musculus centrin (Cetn2) gene, complete cds 


1.8 


2280 


XM 033626 


Homo sapiens nucleoporin-like protein 1 (NLP_1), mRNA 


3.6 


2281 


ABO 11667 


Ipomoea purpurea genes for dihydroflavonol 4-reductase, complete 
cds 


0.25 


2282 


AB011667 


Ipomoea purpurea genes for dihydroflavonol 4-reductase, complete 
cds 


0.25 


2283 


XM 002698 


Homo sapiens LOC89635 (LOC89635), mRNA 


6.3 


2284 


1 BC009385 


Homo sapiens, clone IMAGE:4 127835, mRNA 


0.61 


2285 


U39718 


Mycoplasma genitalium section 40 of 51 of the complete genome 


3.4 


2286 


AB036755 


Schizosaccharomyces pombe spol4/stll gene for Secl2-like protein, 
complete cds 


0.079 



298 



WSDOCID: <WO 0214500A2_I_> 



WO 02/14500 



PCT/US01/25840 





Table 3A NearestNeighbor (BlastN vs. Genbank) 


SEQID 
NO 


ACCESSN 


DESCRIP 


P VALUE 


2287 


AF369960 


Sepiella maindroni isolate c 16S ribosomal RNA gene, partial 
sequence; mitochondrial gene for mitochondrial product 


0.22 


2288 


AE005360 


Escherichia coli 0157:H7 EDL933 genome, contig 2 of 3, section 29 
of 55 


0.6 


2289 


U01034 


Eastern equine encephalomyelitis virus North American antigenic 
variety nonstructural polyprotein and structural polyprotein genes, 
complete cds 


1.7 


2290 


AF355103 


Pleurotus ostreatus linear mitochondrial plasmid mlp2, partial 
sequence 


5.7 


2291 


AC000043 


Homo sapiens Chromosome 22ql3 Cosmid Clone p74a8, complete 
sequence 


0.085 


2292 


X89398 


H. sapiens ung gene for uracil DNA-glycosylase 


0.71 


2293 


AY008264 


Yersinia enterocolitica phage shock protein locus, complete sequence 


2.2 


2294 


AJO 11320 


DrosoDhila melano&aster SufPi and anon-73Bl eenes and nartial o25 
gene and Pros26 gene 


6.5 


2295 


XM_040290 


Homo sapiens glutamate receptor, ionotropic, AMPA 1 (GRIA1), 
mRNA 


4 


2296 


AJ293578 


Homo sapiens partial MOCS1 gene, exons 2-8 


6.3 


2297 


BC007051 ! 


Homo sapiens, clone MGC: 12465 1MAGE:3683051, mRNA, 
complete cds 


7E-32. 


2298 


M23103 


B.subtilis spoIVCB gene, complete cds 


0.2 


2299 


AF280606 


Triticum aestivum omeea diadin Dseudoeene conm-lete seauence 


0.24 


2300 


AF266095 


Platycnemis pennipes large subunit ribosomal RNA gene, partial 

^efliience* tRNA-Vflline ffene efvmnlete ^emif*Tiee* anH omall QiiHiinit 

OkA^UvllUvj L1V1 "lii. V CM 1 111V gwiiv, vVJllipiL'lL/ Ow^llMlwVj ClXlvl o 1X1X1X1 OUUUIUI 

ribosomal RNA gene, partial sequence; mitochondrial genes for 
mitochondrial products 


0.077 


2301 


AF1 14927 


Saccharomyces pastorianus CBS 1538 small subunit ribosomal RNA 
gene, mitochondrial gene for mitochondrial RNA, complete sequence 


0.23 


2302 


M68901 


P.multocida adenylate cyclase (cya) gene, complete cds 


0.23 


2303 


AJ235878 


Amorphophallus kiusiuensis chloroplast atpB-rbcL spacer, isolate PL1 


0.23 


2304 


NM 016158 


Homo sapiens erythrocyte transmembrane protein (LOC51145), 
mRNA 


5E-34 


2305 


Y17254 


Euplotes octocarinatus gamma-tubulin 2 micronuclear gene 


2.3 


2306 


X99872 


Sulfolobus acidocaldarius pyrB gene 


0.033 


2307 


X81001 


Ksapiens HCG II mRNA 


0.001 


2308 


Z80213 


Caenorhabditis elegans cosmid C09E9, complete sequence 


0.025 


2309 


AF257022 


Salmo salar clone BHMS7-009 microsatellite sequence 


0.74 


2310 


AL138748 


Human DNA sequence from clone RP4-732E19 on chromosome 
Xq21.1-21.33 Contains GSSs, complete sequence [Homo sapiens] 


0.21 


2311 


AJ409108 


Entamoeba histolytica rpl27a-2 gene for ribosomal protein large 
subunit 27a-2, exons 1-2 


0.009 


2312 


U26556 


Human ferritin H (3FTHL13) pseudogene 


0.008 
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P VALUE 


2313 


AF126320 


Protoclythia modesta 12S ribosomal RNA gene, partial sequence; 
mitochondrial gene for mitochondrial product 


0.26 


2314 


AL137590 


Homo sapiens mRNA; cDNA DKFZp434K0610 (from clone 
DKFZp434K0610) 


2 


2315 


Z81471 


Caenorhabditis elegans cosmid C14B4, complete sequence 


0.029 


2316 


AE001567 


Helicobacter pylori, strain J99 section 128 of 132 of the complete 
genome 


0.028 


2317 


U47913 


Tribolium freemani Woot retrotransposon, partial sequence and 3' 
insertion junction F3 (1) 


0.081 


2318 


XM 037521 


Homo sapiens centrosomal P4.1 -associated protein; uncharacterized 
bone marrow protein BM032 (BM032), mRNA 


2.2 


2319 


AF269701 


Staphylococcus epidermidis strain SRI clone step.l017g02 genomic 
sequence 


0.7 


2320 


AF378183 


Oryza sativa cultivar Milyang23 cytosolic fructose- 1 mRNA, partial 
cds I 


6.9 


2321 


AYO 19090 


Oiyza sativa microsatellite MRG1415 containing (AT)X26, genomic 
sequence 


0.085 


2322 


AL449163 


Human DNA sequence from clone RP1 1-533E16 on chromosome 6, 
complete sequence [Homo sapiens] 


0.00004 


2323 


M19540 


Adenovirus type 41 DNA-binding protein (DBP) gene, exons 2 and 3, 
23K protease, complete cds, hexon and 100K protein genes, last exons 


0.026 


2324 


AJ272029 


Homo sapiens partial CD30 gene for cytokine receptor CD30 and 
promoter region 


0.003 


2325 


AF082966 


Homo sapiens genomic sequence 


0.75 


2326 


AE002940 


Drosophila melanogaster genomic scaffold 142000013385820, 
complete sequence 


0.028 


2327 


Z54147 


Human DNA sequence from cosmid L129H7, Huntington's Disease 
Region, chromosome 4pl6.3 contains CpG island 


0.001 


2328 


D12590 


Rattus norvegicus DNA, RFLP marker at locus A403 


0.077 


2329 


XMJ)35524 


Homo sapiens KIAA1201 protein (KIAA1201), mRNA 


0.025 I 


2330 


XM 003575 


Homo sapiens soluble liver antigen/liver pancreas antigen 
(LOC51091), mRNA 


4E-27 


2331 


BC009503 


Homo sapiens, Gl to S phase transition- 1, clone MGC:1735 
IMAGE:2822947, mRNA, complete cds 


0.69 


.2332 


BC005572 


Mus musculus, golgi vesicular membrane trafficking protein pl8, 
clone MGC:11601 IMAGE:3967968, mRNA, complete cds 


0.78 


2333 


U39726 


Mycoplasma genitalium section 48 of 51 of the complete genome 


0.086 


2334 


XM 039942 


Homo sapiens Meis (mouse) homolog 3 (MEIS3), mRNA 


0.72 


2335 


X67506 


C.thermocellum ancA gene 


0.084 


2336 


AL1 17553 


Homo sapiens mRNA; cDNA DKFZp564F1171 (from clone 
DKFZp564F1171) 


0.22 


2337 


AF115574 


Pisum sativum pathogenesis-related protein (DRR206-c) gene, 
complete cds 


0.009 


2338 


Y11874 


M. musculus uPA gene, promoter sequence 


2.3 
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2339 


BC008277 


Mus musculus, Similar to Endothelin receptor type A, clone 
MGC:6567 IMAGE:2812426, mRNA, complete cds 


2.2 


2340 


XM_009543 


Homo sapiens transcription factor AP-2 gamma (activating enhancer- 
binding protein 2 gamma) (TFAP2C), mRNA 


0.21 • 


2341 


NM015S09 


Mus musculus keratin-associated protein 5-4 (Krtap5-4), mRNA 


0.42 


2342 


AE000813 ; 


Methanobacterium thermoautotrophicum from bases 206991 to 
217929 (section 19 of 148) of the complete genome 


5.2 


2343 


XM 034632 


Homo sapiens hypothetical protein PR02832 (PR02832), mRNA 


0.0000005 


2344 


XM_034229 


Homo sapiens similar to hypothetical protein FLJ14260 (EL sapiens) 
(LOC90803), mRNA 


0.069 


2345 


XM 033361 


Homo sapiens 62537 (FLJ00024), mRNA 


0.008 


2346 


BC005332 


Homo sapiens, Similar to immunoglobulin kappa constant, clone 
MGC: 12418 IMAGE:3934658, mRNA, complete cds 


3E-33 


2347 


AF321120 


Mus musculus gap junction membrane channel protein alpha 8 (Gja8) 
gene, complete cds 


2.1 


2348 


AB054512 


Berardius bairdii DNA, SINE flanking sequence Tuti35 locus 


0.77 


2349 


Z98549 


Plasmodium falciparum DNA *** SEQUENCING IN PROGRESS 
*** from contig 3-14, complete sequence 


0.029 


2351 


XM_033483 


Homo sapiens hypothetical gene supported by L39957; L43086; 
M85256; M99603; S56182; S73911; U07989; U79763; U96290; 
U96294; U96397; X58082; Z18327; AB006847; AB027436; 
AB027438; AF007572; AF035035; AF052535; AF099196; 
AF103384; AF103390; AF103466; AF10> 


8E-33 


2352 


XM 033483 


Homo sapiens hypothetical gene supported by L3 9957; L43086; 
M85256; M99603; S56182; S73911; U07989; U79763; U96290; 
U96294; U96397; X58082; Z18327; AB006847; AB027436; 
AB027438; AF007572; AF035035; AF052535; AF099196; 
AF103384; AF103390; AF103466; AF10> 


1E-27 


2353 


XM 033483 


Homo sapiens hypothetical eene sunnorted bv L39957* L43086* 
M85256; M99603; S56182; S73911; U07989; U79763; U96290; 
U96294; U96397; X58082; Z18327; AB006847; AB027436; 
AB027438; AF007572; AF035035; AF052535; AF099196; 
AF103384; AF103390; AF103466; AF10> 


8E-33 


2354 


XM 043728 


Homo sapiens hypothetical gene supported by AF074988 
(LOC92227), mRNA 


0.001 


2355 


AE001425 


Plasmodium Mciparum chromosome 2, section 62 of 73 of the 
complete sequence 


6.4 


2356 


XM 033483 


Homo sapiens hypothetical gene supported by L39957; L43086; 
M85256; M99603; S56182; S73911; U07989; U79763; U96290; 
U96294; U96397; X58082; Z18327; AB006847; AB027436; 
AB027438; AF007572; AF035035; AF052535; AF099196; 
AF103384; AF103390; AF103466; AF10> 


9E-14 


2357 


AF184072 


Homo sapiens target of methylation-induced silencing 1 (TMS1) gene, 
complete cds 


6.6 
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2358 


XMJ)32347 


Homo sapiens region containing TLS-associated serine-arginine 
protein 1; TLS-associated serine-arginine protein 2; TLS-associated 
serine-arginine protein 2; TLS-associated serine-arginine protein 2; 
TLS-associated serine-arginine protein 1; TLS-associa> 


0 


2360 


X54807 


Human CYP2C8 gene for cytochrome P-450, 5* flank and exon 1 


0.00003 


2361 


X67506 


C.thermocellum ancA gene 


0.054 


2362 


AL513350 


Human DNA sequence from clone RP1 1-509D8 on chromosome 9, 
complete sequence [Homo sapiens] 


0.003 


2363 


AF020663 


Homo sapiens myotubularin (MTM1) gene, promoter and exon 1 


0.068 


2364 


U43626 


Human chromosome 15ql l-ql3 putative DNA replication origin in 
the g-aminobutyric acid receptor b3 and a5 gene cluster 


0.059 


2365 


AY019090 


Oiyza sativa microsatellite MRG1415 containing (AT)X26> genomic 
sequence 


0.081 


2366 


U27586 


Cytophaga heparina heparinase IE (HepC) gene, complete cds 


0.38 | 


2367 


XM 047192 


Homo sapiens similar to MYOSIN HEAVY CHAIN, CARDIAC 
MUSCLE ALPHA ISOFORM (MYHC-ALPHA) (M. musculus) 
(LOC92771), mRNA 


e-133 


2368 


M20319 


Sxerevisiae ATR1 gene conferring aminotriazole resistance, complete 
cds 


1.3 


2369 


XM 051036 


Homo sapiens carbohydrate (keratan sulfate Gal-6) sulfotransferase 1 
(CHST1), mRNA 


4.6 


2370 


AB062951 


Macaca fescicularis brain cDNA clone:QmoA-12201, full insert 
sequence 


1.5 


2371 


U29335 


Pisum sativum clone MFRTPCR62 wound-inducible cytochrome P450 
mRNA, partial cds 


0.41 


2372 


AF272630 


Ureaplasma urealyticum serovar 14 23 S ribosomal RNA and 5S 
ribosomal RNA genes, complete sequence; and dihydrofolate 
reductase (folA) gene, partial cds 


0.061 


2373 


AF179375 


Mycoplasma fermentans orfDl gene, Insertion sequence IS1630D, 
orfD2 gene 


0.61 


2374 


X16876 


Soybean ENOD2B gene for Ngm-75 


0.96 


2375 


AF263831 


Homo sapiens chromosome 11 DNasel hypersensitive region 


2.7 


2376 


AF157162 


Syzygites megalocarpus 18S ribosomal RNA gene, partial sequence 


5.4 


2377 


AJ308588 


Medicago truncatula partial mRNA for nodulin 25 (nod25 gene), 
clone 3 


4.6 


2378 


X02870 


Bovine gene for cytokeratin VIb 


0.053 


2379 


AL590157 


Human DNA sequence from clone RP13-420K18 on chromosome X, 
complete sequence [Homo sapiens] 


4E-08 


2380 


AF282242 


Plasmodium falciparum strain Dd2 chloroquine resistance marker 
protein gene, complete cds 


4.2 


2381 


Z24745 


Athaliana 2S albumin gene isoforms 1 and 2, complete CDS's 


0.6 
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2382 


AL035418 


Human DNA sequence from clone RP1-141I3 on chromosome 
22ql3. 1-13.33 Contains a GSS, complete sequence [Homo sapiens] 


1.6 


2383 


Y17968 


Gallus gallus mRNA for high mobility group 1 protein 


1.5 


2384 


XM 034735 


Homo sapiens similar to hypothetical protein FLJ21463 (H. sapiens) 
(LOC90883), mRNA 


5.6 


2385 


AF001415 


Arabidopsis thaliana 14-3-3-like protein GF14 upsilon (GRF5) gene, 
complete cds 


1.3 


2386 


XM.009026 


Homo sapiens retinol dehydrogenase 8 (all-trans) (RDH8), mRNA 


1.9 


2387 


XM 003095 


Homo sapiens zinc finger protein 288 (ZNF288), mRNA 


1.8 


2388 


AE003083 


Drosophila melanogaster genomic scaffold 142000013385449, 
complete sequence 


0.51 


2389 


AE002798 


Drosophila melanogaster genomic scaffold 142000013385466, 
complete sequence 


0.018 


2390 


AL359506 


Human DNA sequence from clone CTD-2007 A7 on chromosome 20. 
Contains GSSs, complete sequence [Homo sapiens] 


0.28 


2391 


U32726 


Haemophilus influenzae Rd section 41 of 163 of the complete genome 


0.3 


2392 


ABO 10468 


Cyprinus carpio mRNA for CXC chemokine receptor-1, complete cds 


0.9 


2393 


AE006886 


Sulfolobus solfataricus section 245 of 272 of the complete genome 


0.19 


2394 


M19810 


Human apolipoprotein B-100 (apoB) gene, exon 4 


0.43 


2395 


AB005548 


Homo sapiens gene for squamous cell carcinoma antigen 2, partial cds 


0.0004 


2396 


AJ3 18337 


Plasmodium falciparum krueppell gene for Krueppel-like protein 


0.62 


2397 


U01102 


Human lung Clara cells 10 kda secretory protein (CC10) gene, 
satellite and Alu repeat sequences, complete cds 


3.8 


2398 


AK017766 


Mus musculus 8 days embryo cDNA, RIKEN full-length enriched 
library, clone:5730512J02, full insert sequence 


1.7 


2399 


AE000032 


Mycoplasma pneumoniae M129 section 48 of 63 of the complete 
genome 


0.55 


2400 


AF333234 


Homo sapiens receptor activator of nuclear factor kappa B ligand 
(RANKL) gene, promoter region 


0.22 


2401 


BC010652 


Homo sapiens, clone MGC:9753 1MAGE:3855206, mRNA, complete 
cds 


1.8 


2402 


AK007856 


Mus musculus 10 day old male pancreas cDNA, RIKEN full-length 
enriched library, clone: 1810054D07, foil insert sequence 


2E-94 


2403 


AF172723 


Homo sapiens integrin alpha 10 subunit (TTGA10) gene, exons 7 
through 17 and partial cds 


0.21 


2404 


NM 026570 


Mus musculus glioma-amplified sequence-41 (Gas41-pending), 
mRNA 


1.1 


2405 


U78770 


Mus musculus spasmolytic polypeptide (mSP) gene, complete cds 


1.8 
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2406 


U26729 


Human neprilysin gene, non-coding exon 3 


0.92 | 


2407 


AE006346 


Lactococcus lactis subsp. lactis IL1403 section 108 of 218 of the 
complete genome 


2 


2408 


AF258783 


Felis catus Niemann-Pick type CI disease protein (NPC1) mRNA, 
complete cds 


0.23 


2409 


AC007280 


Homo sapiens BAC clone RP11-332C7 from 7pll.2-p21, complete 
sequence 


0.39 


2410 


D13065 


Mycoplasma capricolum mcs4 gene for MCS4 RNA 


0.062 


2411 


AF196058 


Drosophila pseudoobscura strain PSU606 period (per) gene, intron 4, 
partial sequence 


1.5 


2412 


AF370534 


Arabidopsis thaliana Putative L-ascorbate peroxidase (AT4g09010) 
mRNA, complete cds 


1.8 


2413 


AF084197 


Pongo pygmaeus gamma-aminobutyric acid receptor A5 subunit 
duplicated gene, 5'UTR region 


0.00009 


2414 


Y17045 


Plasmodium falciparum gltS gene 


0.063 


2415 


AL021177 


Caenorhabditis elegans cosmid Yl A5A, complete sequence 


0.22 


2416 


AJ238847 


Drosophila melanogaster mRNA for drosophila dodeca-satellite ! 
protein 1 (DDP-1) 


0.22 


2417 


AB056333 


Macaca fascicularis brain cDNA, clone:QflA- 13486 


e-126 


2418 


AF104821 


Danio rerio Hlx2 gene, promoter region 


0.056 


2419 


AF138960 


Neospora caninum yc£24 protein (yc£24) gene, partial cds; DNA 
dependent RNA polymerase beta subunit (rpoB) gene, complete cds; 
and DNA dependent RNA polymerase beta subiimf (rpoCl) gene, 
partial cds, plastid genes for plastid products 


0.21 


2420 


AF338712 


Emeus crassus mitochondrion, partial genome 


1.3 


2421 


Z82625 


R.prowazekii genomic DNA fragment (clone A396F) 


0.46 


2422 


XM 015945 


Homo sapiens hypothetical protein FLJ13842 (FLJ13842), mRNA \ 


6E-62 


2423 


X57010 


Human COL2A1 gene for collagen II alpha 1 chain, exons E2-E15 


0.63 


2424 


AF283525 


Homo sapiens FIP2 gene, exons 1 1, 12 and 13 j 


4E-71 


2425 


AK013530 


Mus musculus adult male hippocampus cDNA, RIKEN full-length 
enriched library, clone:2900012M01, full insert sequence 


5.3 


2426 


D86077 


Homo sapiens DNA for cyclin G, partial cds 


0.02 


2427 


X94208 


Rsapiens TPR gene (1 121bp) 


0.19 


2428 


AF211528 


Nicotiana tabacum Avr9/Cf-9 rapidly elicited protein 4 (ACRE4) 
mRNA, complete cds 


0.22 


2430 


L35688 


Homo sapiens (subclone H9 10_dl0 from PI LBL#3) DNA sequence 


5E-08 


2431 


U59711 


Cavia porcellus seminal vesicle secretory protein SYP-1/-3/-4 gene, 
complete cds 


0.14 


2432 


AK023305 


Homo sapiens cDNA FLJ13243 fis, clone OVARC1000622 


0.002 


2433 


AB048868 


Macaca fascicularis brain cDNA, clone:QnpA-10055 


0.044 


2434 


AF167585 


Homo sapiens clone C13 immunoglobulin heavy chain variable region 
gene, partial cds 


0.13 


2435 


AB013796 


Ceratopteris richardii mRNA for CRHB6, complete cds 


0.024 
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2436 


AF071481 


Homo sapiens potassium channel (KCNQ3) gene, exon 4 


L9 


2437 


AF282242 


Plasmodium falciparum strain Dd2 chloroquine resistance marker 
protein gene, complete cds 


0.22 ! 


2438 


AY018307 


Oryza sativa microsatellite MRG0632 containing (AT)X14, genomic 
sequence 


0.073 


2439 


Z74273 


S.cerevisiae chromosome IV reading frame ORF YDL225w 


0.028 


2440 


AY021257 


Oryza sativa microsatellite MRG3582 containing (TA)X22, genomic 
sequence 


0.018 


2441 


XM_012164 


Homo sapiens STE20-like kinase (JIK), mRNA I 


0.27 


2442 


XM_036587 


Homo sapiens hypothetical protein DKFZp564B 1023 
(DKFZP564B1023), mRNA 


0.008 


2443 


AL1 12743 


Botrytis cinerea strain T4 cDN A library under conditions of nitrogen 
deprivation 


0.069 


2444 


AB020575 


Conocephalum conicum psbA gene, complete cds, strain J type 


0.074 


2445 


BC001772 


Homo sapiens, Similar to glutaminyl-tRNA synthetase, clone 
MGC:1525 IMAGE:3543728, mRNA, complete cds 


5.5 


2446 


AF289113 


Drosophila montana clone mon 10 transposon Penelope gene 
sequence 


0.064 


2447 


XM_0 15806 


Homo sapiens hypothetical protein FLJ22621 (FU22621), mRNA 


0 


2448 


AY047559 


Drosophila melanogaster GH07383 full length cDNA 


0.92 


2449 


XM_052662 


Homo sapiens ammo-terminal enhancer of split (AES), mRNA 


0.59 


2450 


AL138723 


Human DNA sequence from clone RP1 1-446N1 on chromosome 6, 
complete sequence [Homo sapiens] 


0.008 


2451 


AF151389 


Dennatobia hominis strain Ponta Grossa tRNA-Ile gene, partial 
sequence; D-loop, complete sequence; and 12S ribosomal RNA, 
partial sequence; mitochondrial genes for mitochondrial products 


0.46 


2452 


NM_018123 


Homo sapiens hypothetical protein FLJ105 17 (ELJ10517), mRNA 


0.21 


2453 


XM_042803 


Homo sapiens hypothetical gene supported by AL3 59651 
(LOC92091), mRNA 


1.9 


2454 


AE001397 


Plasmodium falciparum chromosome 2, section 34 of 73 of the 
complete sequence 


5 


2455 


AF017996 


Mus musculus lysosomal acid lipase gene, promoter region 


0.022 


2456 


XM 028409 


Homo sapiens KIAA1871 protein (KIAA1871), mRNA 


0.0003 


2457 


XM 051692 


Homo sapiens KIAA1705 protein (KIAA1705), mRNA 


0.0003 


2458 


U00037 


Caenorhabditis elegans cosmid T20H4 


0.073 


2459 


AL138748 


Hinnan DNA sequence from clone RP4-732E19 on chromosome 
Xq21.1-21.33 Contains GSSs, complete sequence [Homo sapiens] 


0.2 


2460 


AF052744 


Homo sapiens transcriptional repressor p54 gene, promoter region and 
partial cds 


1.8 


2461 


XM 007006 


Homo sapiens KIAA0985 protein (KIAA0985), mRNA 


1.7 


2462 


AF303046 


Homo sapiens prostinogen mRNA, complete cds 


0.44 


2463 


AK023562 


Homo sapiens cDNA FLJ13500 fis, clone PLACE1004693 


e-116 
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2464 


AF076692 


Aspergillus fumigatus aureobasidin-resistance protein (aurl) mRNA, 
complete cds 


3.2 


2465 


AF054616 


Glossina morsitans morsitans thrombin inhibitor precursor ( i l l) 
mRNA complete cds 


0.44 


2466 


Z54147 


Human DNA sequence from cosmid L129H7, Huntington's Disease 
Region, chromosome 4pl6.3 contains CpG island 


4E-09 


2467 


XM 040290 


Homo sapiens glutamate receptor, ionotropic, AMPA 1 (GRIA1), 
mRNA 


4 


2468 


AY026512 


Rattus norvegicus dynein-associated protein RKM23 (km23) mRNA, 
complete cds 


1.5 


2469 


AY033091 


Plasmodium berghei Pbs36 gene, complete cds 


0.0008 


2470 


XM 051572 


Homo sapiens heat shock transcription factor 2 (HSF2), mRNA 


2E-42 


2471 


AF287003 


Floriceps minacanthus 18S ribosomal RNA gene, complete sequence 


1.8 


2472 


X99872 


Sulfolobus acidocaldarius pyrB gene 


0.2 


2473 


U49451 


Bombina orientalis SAP bombesin preprohormone mRNA, complete 
cds 


0.56 


2474 


AC024819 


Caenorhabditis elegans cosmid Y55B1AL, complete sequence 


6.9 


2475 


AF282602 


Litoria subglandulosa strain SAMAJR51051 NADH dehydrogenase 
subunit 4 (ND4) gene, partial cds; mitochondrial gene for 
mitochondrial product 


0.073 


2476 


AL591343 


Human DNA sequence from clone RP1 1-260H5 on chromosome 6, 
complete sequence [Homo sapiens] 


0.24 


2477 


Z69375 


Human DNA sequence from cosmid L174G8, Huntington's Disease 
Region, chromosome 4pl6.3 


3E-16 


2478 


U32174 


Dictyostelium discoideum spore lysis A (splA) gene, complete cds j 


0.25 


2479 


XM_045095 


Homo sapiens KIAA1320 protein (KIAA1320), mRNA 


0.53 


2480 


AY035224 


Gallus gallus cardiac muscle factor 1 gene, partial cds 


0.78 


2481 


D14567 


Penicillium urticae mitochondrial 1-rRNA (large rRNA) gene and its 
flanking region 


0.25 


2482 


AF119218 


HIV-1 strain 96CG38. 10 from Republic of the Congo, envelope 
glycoprotein, C2-C4 region (env) gene, partial cds 


0.002 


2483 


AF386963 


Arabidopsis thaliana Unknown protein mRNA, complete cds 


0.057 


2484 


Z19595 


C.elegans her-1 gene 


0.19 


2485 


AE006341 


Lactococcus lactis subsp. lactis IL1403 section 103 of 218 of the 
complete genome 


6.1 j 


2486 


AL583831 


Human DNA sequence from clone RP11-469L10 on chromosome 6, 
complete sequence [Homo sapiens] 


0.23 


2487 


NC 001617 


Human rhinovirus 89 


0.078 


2488 


X65859 


H. sapiens pseudogene for the low affinity IL-8 receptor 


5.7 


2489 


AE007508 


Streptococcus pneumoniae section 191 of 194 of the complete genome 


0.63 


2490 


AF199367 


Entamoeba histolytica cdc48-like protein gene, complete cds 


0.56 


2491 


AF211160 


Pseudonaja textilis tissue-type liver phospholipase A2 inhibitor beta 
subunit isoform PTI-2B mRNA, complete cds 


2.2 
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2492 


] 

AE000813 : 


Methanobacterium thermoautotrophicum from bases 206991 to 
217929 (section 19 of 148) of the complete genome 


5.5 


2493 


U67604 


Vlethanococcus jannaschii section 146 of 150 of the complete genome 


1 


2494 


D16253 


Pichia canadensis mitochondrial gene for NADH dehydrogenase 
subunit 5, complete cds 


0.006 


2495 


NC 001530 


Human papillomavirus type 47, complete genome j 


5.4 


2496 


AF286437 


Pseudavenzoaria ochropodis from host Tringa ochropus 16S 
ribosomal RNA gene, partial sequence; mitochondrial gene for 
mitochondrial product 


0.4 


2497 


XM 006069 


Homo sapiens weel+ (S. pombe) homolog (WEE1), mRNA 


3E-26 


2498 


AF3 14019 


Rana boylii NADH dehydrogenase subunit 1 (ND1) gene, partial cds; 
tRNA-Qe, tRNA-Gln, and tRNA-Met genes, complete sequence; 
NADH dehydrogenase subunit n (ND2) gene, complete cds; tRNA- 
Trp, tRNA-Ala, tRNA-Asn, tRNA-Cys, and tRNA-Tyr genes, 
complete se> 


1.4 


2499 


AJ3 14911 


Dictyostelium discoideum DrnA gene for putative RNasein, exons 1-3 


0.51 


2500 


M35228 


Vesicular stomatitis virus (strain 86-GM-B) glycoprotein gene, 
complete cds 


0.45 


2501 


AB021290 


Homo sapiens gene for chondromodulin-1, promoter and partial cds 


0.72 


2502 


AJ307807 


Astrammina rara partial 18S rRNA gene, isolate A44 


3.5 


2503 


AB005148 


Bos taurus mRNA for IL-1 receptor antagonist, complete cds 


0.84 


2504 


AE002687 


Drosophila melanogaster genomic scaffold 142000013385743, 
complete sequence 


0.23 


2505 


AF053496 


Caenorhabditis elegans beta chain spectrin homolog Smal (smal) 
mRNA, complete cds 


5.9 


2506 


X59611 


E.coli IncFV plasmid pED208 oriT region with genes traM, traJ and 
traY (part.) 


0.043 


2507 


XM 051897 


Homo sapiens phospholipase A2, group IVA (cytosolic, calcium- 
dependent) (PLA2G4A), mRNA 


3E-66 


2508 


AJ132890 


Bos taurus mRNA for acetyl-CoA-cafboxylase 


0.18 


2509 


Y13537 


Homo sapiens LAG-3 gene, 3' region 


1.7 


2510 


AE002149 


Ureaplasma urealyticum section 50 of 59 of the complete genome 


0.027 


2511 


AF170573 


Influenza C virus Cyjohannesburg/1/66 segment 5 nucleoprotein (NP) 
gene, partial cds 


2.4 


2512 


AK016507 


Mus musculus adult male testis cDNA, RIKEN full-length enriched 
library, clone:4931439C15, full insert sequence 


6.8 


2513 


AL3 54664 


Human DNA sequence from clone RP1 1-209D8 on chromosome 6. 
Contains ESTs and GSSs. Contains part of the COL12A1 gene for 
collagen type XII, alpha 1, complete sequence [Homo sapiens] 


0.082 


2514 


AB042531 


Pisum sativum PsEXT mRNA for xyloglucan endotransglycosylase, 
complete cds 


2E-12 



307 



3NSDOCID: <WO 021 4500A2_I_> 



WO 02/14500 



PCT/US01/25840 





Table 3A Nearest Neighbor (BlastN vs. Genbank) 


SEQID 
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2515 


AL590371 


Human DNA sequence from clone RP11-295H23 on chromosome 6, 
complete sequence [Homo sapiens] 


1.5 


2516 


AF334780 ! 


Homo sapiens pregnancy-induced growth inhibitor OKL38 gene, 
partial cds 


0.8 


2517 


AF362371 


Dictyostelium discoideum histidine kinase Dhkl (dhkl) gene, 
complete cds 


0.021 


2518 


NM 012608 


Rattus norvegicus Membrane metallo-endopeptidase (neutral 
endopeptidase/enkephalinase) (Mme), mRNA 


2 


2519 


BC007173 


Mus musculus, clone IMAGE:3493127, mRNA, partial cds 


5E-53 


2520 


Y13870 


Homo sapiens mRNA containing (CAG)6 repeat, clone CZ-CAG-12 


0.06 


2521 


AF051769 


Homo sapiens hyaluronidase-like pseudogene 1 (HYALP1), partial 
sequence 


0.009 


2522 


AF309947 


Dictyostelium discoideum Racl A (racl A) gene, complete cds; and 
unknown gene 


0.77 


2523 


AF128392 


Arabidopsis thaliana BAC F15P23 


0.22 


2524 


AK020739 


Mus musculus adult male spinal cord cDNA, RIKEN full-length 
enriched library, clone:A330106L17, full insert sequence 


7.2 


2525 


Y09472 


B.taurus gene encoding preprododecapeptide 


0.24 


2526 


D87922 


Rat DNA for 3TJTR of skeletal muscle sodium channel, partial 
sequence 


0.4 


2527 


U70865 


Carassius auratus CCK8 precursor, mRNA, complete cds 


0.67 


2528 


AB012624 


Homo sapiens FLU gene for ERGB transcription fuctor, intron 4 and 
partial cds 


0.054 


2529 


AF180492 


Crocuta crocuta microsatellite Ccrl2 sequence 


0.083 


2530 


AL078640 


Human DNA sequence from clone 536P6 on chromosome 22. 
Contains an EST, complete sequence [Homo sapiens] 


2E-11 


2531 


Z63184 


Ksapiens CpG island DNA genomic Msel fragment, clone 7al, 
forward read cpg7al.ftld 


6E-22 


2532 


AL162055 


Homo sapiens mRNA; cDNA DKFZp761M2223 (from clone 
DKFZp761M2223) 


0.022 


2533 


AE001392 


Plasmodium falciparum chromosome 2, section 29 of 73 of the 
complete sequence 


6.5 


2535 


AF202903 


Sporopachydermia cereana var. D77-321B internal transcribed spacer 
1, 5.8S ribosomal RNA gene and internal transcribed spacer 2, 
complete sequence; and large subunit ribosomal RNA gene, partial 
sequence 


1 0.14 


2536 


U59487 


Borrelia burgdorferi P35 antigen protein gene, and 7.5 kDa 
lipoprotein gene, complete cds 


0.029 


2537 


XM_030999 


Homo sapiens KIAA0444 protein (KIAA0444), mRNA 


0.012 


2538 


AF302838 


Mus musculus putative tumor suppressor LEU2 gene, exons 1 and 2 


0.63 


2539 


AF125423 


Bresiiiidae gen. sp. LSP21-1 cytochrome oxidase I gene, partial cds; 
mitochondrial gene for mitochondrial product 


0.66 


2540 


AP001348 


Homo sapiens genomic DNA, chromosome 21q21.1-q21.2 
clone:f39G8, LL56-APP region, complete sequence 


0.42 
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2541 


XM 028324 


Homo sapiens hypothetical protein FLJ14195; KIAA1714 protein 
(FLJ14195), mRNA 


8E-98 


2542 


AF238860 


Streptococcus mutans mutacin I biosynthesis gene cluster, partial 
sequence 


4.6 


2543 


AE005391 


Escherichia coli 0157:H7 EDL933 genome, contig 3 of 3, section 10 
of 290 


1.6 


2544 


AK024518 


Homo sapiens cDNA: FLJ20865 fis, clone ADKA01850 


0.06 


2545 


AK026595 


Homo sapiens cDNA: FEJ22942 fis, clone KAT08170, highly similar 
to HUMMHBW62 Homo sapiens (clone pMF18) MHC class I HLA- 
Bw62 mRNA 


4E-39 


2546 


AL449163 


Human DNA sequence from clone RP1 1-533E16 on chromosome 6, 
complete sequence [Homo sapiens] 


2.3 


2547 


AJ305053 


Caprine arthritis-encephalitis virus proviral partial pol gene for Pol 
protein (reverse transcriptase), isolate CAEVBr/UFRGS-2/C767 


0.69 


2548 


AF283067 


Homo sapiens clone 3ptel_cllbt3 sequence 


1E-41 


2549 


AF286431 


Bychovskiata intermedia from host Charadrius leschenaulti 16S 
ribosomal RNA gene, partial sequence; mitochondrial gene for 
mitochondrial product 


0.94 


2550 


L35232 


Schizosaccharomyces pombe deoxyribonuclease (exo2) gene, exons 1- 
3, complete cds 


0.93 


2551 


AF3 19044 


Methanosarcina acetivorans strain C2 A MtaF (mtaF) and MtaG 
(mtaG) genes, complete cds 


0.034 


2552 


XM_018334 


Homo sapiens hypothetical protein FLJ22418 (FU22418), mRNA 


0.013 


2553 


U35862 


Visna virus clone 678 env3.2 envelope polyprotein (env) gene, partial 
cds 


0.17 


2554 


M20005 


N.crassa conidation-specific protein genes con- 10 (complete cds.) and 
con-13 (3 1 flank) 


0.079 


2555 


AB050531 


Macaca fascicularis brain cDNA, clone:QnpA-20907 


1.4 


2556 


AB009693 


Mus musculus gene for mafG, complete cds 


5.6 


2557 


U67593 


Methanococcus jannaschii section 135 of 150 of the complete genome 


0.024 


2558 


XM 028292 


Homo sapiens x 009 protein (MDS009), mRNA 


4 


2559 


U67578 


Methanococcus jannaschii section 120 of 150 of the complete genome 


1.8 


2560 


AE001759 


Thermotoga maritima section 71 of 136 of the complete genome 


1.9 


2561 


AF363230 


Phascolion strombi homeodomain transcription factor Gsx (Gsx) 
gene, partial cds 


0.59 


2562 


NM019211 


Rattus norvegicus RAS guanyl releasing protein 1 (Rasgrp), mRNA 


2.5 


2563 


U60070 


Solanum tuberosum disease resistance homolog (Stl21) gene, partial 
cds 


0.68 


2564 


AE006593 


Streptococcus pyogenes Ml GAS strain SF370, section 122 of 167 of 
the complete genome 


0.062 
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2565 


AF147231 


Wilkesia gymnoxiphium ASAPETALA1-A (ASAP1-A) gene, exons 3 
through 8, partial cds 


0.026 


2566 


AK018656 


Mus musculus adult male cecum cDNA, RIKEN full-length enriched 
library, clone:9130402C12, full insert sequence 


0.74 


2567 


X65859 


H.sapiens pseudogene for the low affinity IL-8 receptor 


5.4 


2568 


XM_030S55 


Homo sapiens similar to KIAA1361 protein (H. sapiens) (LOC90315), 
mRNA 


0.65 


2569 


XM 031726 


Homo sapiens hypothetical protein FLJ12577 (FLJ12577), mRNA 


2.1 


2570 


Z68749 


Caenorhabditis elegans cosmid F56H11, complete sequence 


0.025 


2571 


AE000020 


Mycoplasma pneumoniae M129 section 60 of 63 of the complete 
genome 


0.17 


2572 


U63091 


Oryctolagus cuniculus beta-globin-like gene cluster locus control 
region 


0.13 


2573 


BC001339 


Homo sapiens, clone IMAGE:3461987, mRNA, partial cds 


1E-31 


2574 


U86962 


Dictyostelium discoideum MigA (migA) gene, complete cds 


0.9 


2575 


AY021629 


Oryza sativa microsatellite MRG3954 containing (TA)X30, genomic 
sequence 


6.5 


2576 


AB063077 


Macaca fascicularis brain cDNA clone:QtrA-13349, full insert 
sequence 


0.59 


2577 


XM 044332 


Homo sapiens piwi (Drosophila)-like 1 (PIWIL1), mRNA 


4E-28 


2578 


Z97185 


S.pombe chromosome I cosmid cl6A10 


1.5 


2579 


L43603 


Gallus gallus Na+/K+-ATPase alpha-1 subunit gene, promoter region 


0.7 


2580 


AC016645 


Homo sapiens chromosome 5 clone RP11-54A24, complete sequence 


0.0001 


2581 


AE001408 


Plasmodium falciparum chromosome 2, section 45 of 73 of the 
complete sequence 


0.25 


2582 


AF067699 


Ischnura prognata cytochrome b (cytb) gene, mitochondrial gene 
encoding mitochondrial protein, partial cds 


0.45 


2583 


AC026061 


Homo sapiens BAC clone RP11-223K9 from Y, complete sequence 


2E-15 


2584 


Y13537 


Homo sapiens LAG-3 gene, 3* region 


! 0.24 


2585 


AE005977 


Caulobacter crescentus section 303 of 359 of the complete genome 


5.5 


2586 


U13190 


Angiostrongylus cantonensis clone G-2 retrotransposable element 
AcT-1 


0.077 


2587 


AF364550 


Drosophila melanogaster transposon cruiser, complete sequence 


0.07 


2588 


XM_034146 


Homo sapiens CD36 antigen (collagen type I receptor, 
thrombospondin receptor) (CD36), mRNA 


0.62 


2589 


AF014580 


nnninrtpn* viminflliR 175s rihnsomal RNA gene 3 mitochondrial gene 
for mitochondrial RNA, partial sequence 


2.1 


2590 


XM_042643 


Homo sapiens hypothetical protein DKFZp434G131 
(DKFZP434G131), mRNA 


0.68 


2591 


BC004324 


Homo sapiens, ribosomal protein S16, clone MGC: 10931 
IMAGE:3628799, mRNA, complete cds 


0.006 
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2592 


U42430 


Mesocricetus auratus CD36 mRNA, complete cds 


0.71 


2593 


U16137 


Saccharomyces cerevisiae Mvplp (MVP1) gene, complete cds 


4.7 


2594 


AB047954 


Macaca fascicularis brain cDNA, clone:QnpA-12630 


0.65 


2595 


AF261697 


Homo sapiens clone 17ptel_c39t3 sequence 


0.024 


2596 


AF162598 


Drosophila virilis strain wl59 zeste protein (z) gene, partial cds 


0.063 


2597 


U40255 


Peromyscus sejugis ND3 and ND4L genes, complete cds, tRNA(arg) 
gene, complete sequence, tRNA(gly) gene, partial sequence, and ND4 
gene, partial cds, mitochondrial genes encoding mitochondrial 
products 


0.67 


2598 


Z63251 


Rsapiens CpG island DNA genomic Msel fragment, clone 7hl0, 
forward read cpg7hl0.fla 


0.003 


2599 


U68150 


Chelonus sp. 16S mitochondrial ribosomal RNA, mitochondrial gene, 
partial sequence 


0.003 


2600 


AF138281 


Arabidopsis thaliana phospholipase D-gamma-2 mRNA, complete cds 


0.071 


2601 


AF069985 


Mus musculus nitrilase homolog 1 (Nitl) gene, alternatively spliced 
product, complete cds 


0.39 


2602 


M83134 


Human germline IgH chain (hv3005) Y3-region 


0.35 


2603 


AF275245 


Trama rara 12S small subunit ribosomal RNA gene, partial sequence; 
tRNA-Val gene, complete sequence; and 16S large subunit ribosomal 
RNA gene, partial sequence; mitochondrial genes for mitochondrial 
products 


0.59 


2604 


X93302 


S.cerevisiae mRNA for Msn5 protein 


0.05 


2605 


AK015536 


Mus musculus adult male testis cDNA, RUCEN full-length enriched 
library, clone:4930470H18, full insert sequence 


0.7 1 


2606 


AC087735 


Caenorhabditis elegans cosmid Y35H6, complete sequence 


0.022 


2607 


AJ271002 


Mus musculus TFFl/pS2 gene for Trefoil Factor l/pS2, exons 1-3 


0.73 


2608 


NM_019664 


Mus musculus potassium inwardly-rectifying channel, subfamily J, 
member 15 (Kcnjl5), mRNA 


5.8 


2609 


AL450330 


Human DNA sequence from clone RP11-162H13 on chromosome 6 
Contains GSSs, complete sequence [Homo sapiens] 


0.25 


2610 


Z24671 


H. sapiens (D1S514) DNA segment containing (CA) repeat; clone 
AFMal51za5; single read 


0.007 


2611 


AE000773 


Aquifex aeolicus section 105 of 109 of the complete genome 


0.34 


2612 


X93508 


E. histolytica mRNA for 170kDa subunit of galactose/N- 
acetylgalactosamine binding lectin 


0.054 


2613 


AK003780 


Mus musculus 18 days embryo cDNA, RDCEN full-length enriched 
library, clone: 11 10018121, full insert sequence 


0.15 


2614 


L81687 


Homo sapiens (subclone l_cl2 from PI H54) DNA sequence, 
complete sequence 


5.2 


2615 


I M10090 


Human myoglobin gene, exon 1 


0.13 


2616 


NC 002670 


Bacteriophage bIL3 11, complete genome 


0.061 


2617 


AB016259 


Oryzias latipes gene for cardiac muscle actin, complete cds 


0.19 


2618 


U67578 


Methanococcus jannaschii section 120 of 150 of the complete genome 


2 
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2619 


AF186258 


Homo sapiens suifbtransferase 1C1 (SULT1C1) gene, exons 2 and 3 


0.002 


2620 


X79777 


P.furiosus aor, cmo and ado-hcy genes ! 


6.1 


2621 


XM 042013 


Homo sapiens cell division cycle 34 (CDC34), mRNA 


1.5 


2622 


AB044879 


Equus caballus DNA, microsatellite TKY379 


0.18 


2623 


AF1 17951 


Mus musculus lysyl oxidase-like protein 2 mRNA, partial cds 


0.015 


2624 


AF3 10887 


Dictyostelium discoideum RacB (racB) gene, partial cds; and 
unknown gene 


0.17 


2625 


XM 034353 


Homo sapiens KIAA1500 protein (KIAA1500), mRNA 


4E-89 


2626 


AK021859 


Homo sapiens cDNA FIJI 1797 fis, clone HEMBA1006182 


0 


2627 


AF280894 


Pongo pygmaeus clone 3458 chromosome Xq genomic sequence 


0.0002 


2628 


AE001396 


Plasmodium falciparum chromosome 2, section 33 of 73 of the 
complete sequence 


0.074 


2629 


XM_046097 


Homo sapiens similar to RIKEN cDNA 2610307121 gene; 
hypothetical protein, MNCb-4273 (M. musculus) (LOC92606), 
mRNA 


0.58 


2630 


X02438 


Leishmania tarentolae maxicircle DNA fragment ] 


• 6.2 


2631 


AF181582 


Bombus terrestris 16S large subunit ribosomal RNA gene, partial 
sequence 


0.2 


2632 


M34149 


S.cerevisiae mitochondrial gap 12 sequence 


0.64 


2633 


AJ400612 


Trioxys angelicae partial mitochondrial ATPase 6 gene for ATPase 6 
subunit 


0.65 


2634 


AB021290 


Homo sapiens gene for chondromodulin-1, promoter and partial qds 


0.75 


2635 


AJ133756 


Drosophila melanogaster mRNA for jim zinc finger protein 


0.028 


2636 


AL121589 


Human DNA sequence from clone RP5-1140M3 on chromosome 20. 
Contains GSSs, complete sequence [Homo sapiens] 


0.002 


2637 


XM 040430 


Homo sapiens hypothetical protein FLJ10599 (FLJ10599), mRNA 


0.24 


2638 


ABO 15484 


Dugesia japonica mRNA for myosin heavy chain, complete cds 


0.2 


2639 


XM 008161 


Homo sapiens hypothetical protein FLJ20345 (FLJ20345), mRNA 


0.075 


2640 


AF024621 


Mus musculus gamma-aminobutyric acid receptor rho2 subunit 
mRNA, complete cds 


6.8 


2641 


L78777 


Homo sapiens (subclone 2 b8 from PI H49) DNA sequence 


8E-11 


2642 


AE001028 


Archaeoglobus fulgidus section 79 of 172 of the complete genome 


2 


2643 


AB013796 


Ceratopteris richardii mRNA for CRHB6, complete cds 


0.028 


2644 


X03848 


ChloreDa eUipsoidea chloroplast 16S-23S DNA spacer and ORF1 to 
ORF4 DNA 


0.086 


2645 


Z21938 


B.aphidicola trpE and trpG genes for anthranilate synthase, 
components I and n 


0.74 


2646 


XM 039805 


Homo sapiens hypothetical protein MGC3040 (MGC3040), mRNA 


2.1 
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2647 


AF267217 


Candidate Carsonella raddii natural-host Pachypsylla celtidis ATP 
synthase alpha subunit (atpA) gene, partial cds; ATP synthase gamma 
subunit (atpG) gene, complete cds; and ATP synthase beta subunit 
(atpD) gene, partial cds 


0.71 


2648 


L39786 


Lupinus angustifolius conglutin gamma gene, complete cds 


0.084 


2649 


AB049879 


Macaca fascicularis brain cDNA, clone:QnpA-18812 


0.74 


2650 


AE002648 


Drosophila melanogaster genomic scaffold 142000013385436, 
complete sequence 


2.1 


2651 


XM 032228 


Homo sapiens hypothetical gene supported by AL080068 
(LOC90510),mRNA 


2.4 


2652 


XM_044529 


Homo sapiens calcium and integrin binding protein (DNA-dependent 
protein kinase interacting protein) (SIP2-28), mRNA 


0.76 


2653 


AB035494 


Alternaria alternata gene for Akt3-2, complete cds 


0.7 


2654 


AE001381 


Plasmodium falciparum chromosome 2, section 18 of 73 of the 
complete sequence 


0.27 


2655 


XM_013145 


Homo sapiens Alport syndrome, mental retardation, midface 
hypoplasia and elliptocytosis chromosomal region, gene 1 
(AMMECR1), mRNA 


0.65 


2656 


NM 007976 


Mus musculus coagulation factor V (F5), mRNA 


0.077 


2657 


U47913 


Tribolium freemani Woot retrotransposon, partial sequence and 3' 
insertion junction F3(l) 


0.087 


2658 


D10331 


Pneumonia virus of mice mRNA for nonstructural proteins and major 
nucleocapsid protein, complete cds 


0.084 


2659 


AL390874 


Human DNA sequence from clone RP1 1-238K16 on chromosome 9, 
complete sequence [Homo sapiens] 


0.003 


2660 


AF020282 


Dictyostelium discoideumDG2033 gene, partial cds 


2.3 


2661 


AF393505 


Mus musculus potential meiotic DNA repair locus 


2.2 


2663 


U12186 


Oryctolagus cuniculus sodium/dicarboxylate cotransporter mRNA, 
partial cds 


2.3 


2664 


AJ305053 


Caprine arthritis-encephalitis virus proviral partial pol gene for Pol 
protein (reverse transcriptase), isolate CAEV Br/UFRGS-2/C767 


0.78 


2665 


AE006628 


Streptococcus pyogenes Ml GAS strain SF370, section 157 of 167 of 
the complete genome 


2.4 


2666 


L76504 


Cronartium arizonicum isolate FSprP-1 18S ribosomal RNA gene, 
partial sequence; internal transcribed spacer 1, 5.8S ribosomal RNA 
gene and internal transcribed spacer 2, complete sequence; and 26S 
ribosomal RNA gene, partial sequence 


0.79 


2667 


XM 032307 


Homo sapiens neuronal thread protein (AD7C-NTP), mRNA 


7.2 


2668 


XM 010126 


Homo sapiens stromal antigen 2 (STAG2), mRNA 


e-123 


2670 


AB018798 


Plasmodium yoelii gene for P-Type ATPase, complete cds 


j 0.029 


2671 


Z24358 


H. sapiens (D2S377) DNA segment containing (CA) repeat; clone 
AFM319zf9; single read 


0.0004 


2672 


AE000878 


Methanobacterium thermoautotrophicum from bases 976801 to 
992232 (section 84 of 148) of the complete genome 


6.2 j 
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2673 


AL137717 


Homo sapiens mRNA; cDNA DKFZp434J1630 (from clone 
DKFZp434J1630) 


2E-31 


2674 


X61377 


S.cerevisiae XYK gene for xylulokinase 


2.4 


2675 


XM_034353 


Homo sapiens KIAA1500 protein (KIAA1500), mRNA 


4E-89 


2676 


JO 1268 


Slime mold (D.discoideum) actin 2 (sub 2) pseudogene 5' end 


0.085 


2677 


AF113932 


Carios puertoricensis 16S ribosomal RNA, partial sequence 


0.079 


2678 


XM_028752 


Homo sapiens ribophorin n (RPN2), mRNA 


2.1 


2679 


BC003254 


Mus musculus, Similar to dentatorubral pallidoluysian atrophy, clone 
MGC:5758, mRNA, complete cds 


0.78 


2680 


NC_001871 


Feline foamy virus, complete genome 


0.27 


2681 


AF188115 


Simian immunodeficiency virus Gag protein gene, complete cds; Pol 
protein gene, partial cds; Vif protein (Vif), Vpr protein (Vpr), 
truncated Tat protein (Tat), Rev protein (Rev), Env protein (Env), and 
Nef protein (Nef) genes, complete cds; and long te> 


0.77 


2682 


AE007382 


Streptococcus pneumoniae section 65 of 194 of the complete genome 


0.73 


2683 


AF190869 


Gorilla gorilla X chromosome clone 1, partial sequence 


0.082 


2685 


AK004792 


Mus musculus adult male lung cDNA, RIKEN full-length enriched 
library, clone: 1200015107, full insert sequence 


1E-15 


2686 


AE001387 


Plasmodium falciparum chromosome 2, section 24 of 73 of the 
complete sequence 


0.063 


2687 


AB051438 


Homo sapiens mRNA for KIAA165 1 protein, partial cds 


e-141 


2688 


AY047509 


Drosophila melanogaster GH01721 full length cDNA 


0.21 


2689 


AKO 16678 


Mus musculus adult male testis cDNA, RIKEN full-length enriched 
library, clone:4933405P16, full insert sequence 


0.21 


2690 


Z19055 


B.aphidicola tryptophan operon 


0.077 


2691 


XM_047737 


Homo sapiens ubiquitin specific protease 25 (USP25), mRNA 


9E-56 


2692 


U13190 


Angiostrongylus cantonensis clone G-2 retrotransposable element 
AcT-1 


0.26 


2693 


X00858 


H.sapiens DHFR gene, exon 5 


0.27 


2695 


AK026620 


Homo sapiens cDNA: FLJ22967 fis, clone KAT10573, highly similar 
to AF151892 Homo sapiens CGI-134 protein mRNA 


0.077 


2696 


D87001 


Homo sapiens immunoglobulin lambda gene locus DNA, clone:47H9 
downstream contig 


0.25 


2697 


U93276 


Mus musculus glucokinase gene, intron 1 and hypersensitive site W 
region 


0.73 


2698 


AE006705 


Sulfolobus solfataricus section 64 of 272 of the complete genome 


1.2 


2699 


AE001423 


Plasmodium falciparum chromosome 2, section 60 of 73 of the 
complete sequence 


2.2 


2700 


X76799 


H.sapiens genomic DNA dinucleotide (AC) repeat clone alO 


6.4 


2701 


Z93324 


L.major DNA containing open reading frame 


0.23 


2702 


AJ251163 


Dictyostelium discoideum dymB gene for dynamin B homologue 


6.3 


2703 


AKO20562 


Mus musculus adult male urinary bladder cDNA, RIKEN full-length 
enriched library, clone:9530018H14, full insert sequence 


0.76 
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2704 


AF 116983 


Stenomesson variegatum ribulose 1,5-bisphosphate carboxylase large 
subunit (ibcL) gene, partial cds; chloroplast gene for chloroplast 
product 


6.4 


2706 


XM_051524 


Homo sapiens archain 1 (ARCN1), mRNA 


0.66 


2707 


AF351620 


Homo sapiens lipocalin-1 interacting membrane receptor (LIMR) 
gene, complete cds 


0.084 


2708 


AF180472 


Homo sapiens aryl-hydrocarbon interacting protein-like 1 (A1PL1) 
gene, complete cds 


5.9 


2709 


AY008848 


Ictalurus punctatus clone Icpu-UA/3 MHC class I antigen gene, 
complete cds 


0.025 


2710 


NM_016862 


Mus musculus vesicle transport through interaction with t-SNAREs 1 
homolog (Vtil-pending), mRNA 


0.65 


2711 


AE004023 


Xylella fastidiosa 9a5c, section 169 of 229 of the complete genome 


3.9 


2712 


XM 037239 


Homo sapiens actin, beta (ACTB), mRNA 


1E-49 


2713 


AE001404 


Plasmodium falciparum chromosome 2, section 41 of 73 of the 
complete sequence 1 


0.7 


2714 


XM_050114 


Homo sapiens TERA protein (TERA), mRNA 


5.9 


2715 


AK001082 


Homo sapiens cDNA FLJ10220 fis, clone HEMBA1007342 


0.56 


2716 


AF380155 


Ctenopharyngodon idella c-Fos (c-fos) mRNA, complete cds 


3.2 


2717 


AB027352 


Cordyceps sp. 97003 mitochondrial DNA for small subunit ribosomal 
RNA 


0.47 


2718 


AB033220 


Lactococcus lactis gadB gene, untranslated by upstream frameshift, 
strain:01-l 


1.9 


2719 


U01841 


Mus musculus B6/CBAF1J peroxisome proliferator activated receptor 
gamma (ppar) mRNA, complete cds 


0.009 


2720 


AL023814 


Caenorhabditis elegans cosmid H08J19, complete sequence 


0.021 


2721 


XM_047737 


Homo sapiens ubiquitin specific protease 25 (USP25), mRNA 


5E-56 


2723 


AF389403 


Takifugu rubripes pericentriolar material 1 protein (PCM1) mRNA, 
complete cds 


1.9 


2724 


AE006369 


Lactococcus lactis subsp. lactis IL1403 section 131 of 218 of the 
complete genome 


0.21 


2726 


AF367865 


Mesembiyanthemum ciystallinum potassium transporter HAK2p 
(HAK2) mRNA, complete cds 


0.39 


2727 


XM 043040 


Homo sapiens hypothetical protein MGC2718 (MGC2718), mRNA 


1E-09 


2728 


AE001392 


Plasmodium falciparum chromosome 2, section 29 of 73 of the 
complete sequence 


0.0000004 


2729 


U31371 


Bovine herpesvirus 4 1.7-kb late RNA (L1.7) mRNA, repeat region 


1.9 


2730 


AF195728 


Libellula flavida 16S ribosomal RNA gene, partial sequence; 
mitochondrial gene for mitochondrial product 


0.025 


2731 


j AK024658 


Homo sapiens cDNA: FLJ21005 fis, clone CAE03745 


0.64 


2732 


| AK025613 


Homo sapiens cDNA: FLJ21960 fis, clone HEP05517 


1.4 


2733 


L11016 


Homo sapiens iymphotoxin-beta gene, complete cds 


0.0001 


2734 


Y14422 


M. musculus DNA for retinal protein 


1.2 
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2735 


X83757 


E.histolytica 170 kDa lectin gene, 5' region 


0.64 


2736 


AK017535 


Mus musculus 8 days embiyo cDNA, RIKEN full-length enriched 
library, clone:5730409K12, full insert sequence 


0.64 


2737 


AY047509 


Drosophila melanogaster GH01721 foil length cDNA 


0.21 . 


2738 


ABO 19433 


Homo sapiens gene for T-cell nuclear receptor NOT (Nurrl), complete 
cds 


0.024 


2739 


U58731 


Caenorhabditis elegans cosmid W09B12 


0.61 


2740 


U35620 


Nicotiana tabacum ribulose-1,5 bisphosphate carboxylase/oxygenase 
large subunit N-methyltransferase (rbcmtT) mRNA, complete cds 


1 


2741 


AF181720 


Homo sapiens RU2AS (RU2) gene, complete cds; and RU2S (RU2) 
gene, partial cds 


5.4 


2742 


AE006560 


Streptococcus pyogenes Ml GAS strain SF370, section 89 of 167 of 
the complete genome 


0.24 


2743 


AF346287 


Gonioctena pallida haplotype 61 mitochondrial control region, partial 
sequence 


5 


2744 


L05617 


Dictyostelium purpureum (Dpp5) DNA sequence, repeat region j 


0.021 


2745 


AK021104 


Mus musculus adult male corpus striatum cDNA, RIKEN full-length 
enriched library, clone:C030026M15, full insert sequence 


1 


2746 


L08174 


Romanomermis culicivorax mitochondrial NADH dehydrogenase 
subunits 6 and 3 (ND6, ND3) cytochrome P450-like protein genes, 
complete CDS 


1.6 


2747 


AB006130 


Mesocricetus auratus mRNA for haptoglobin, complete cds 


0.24 


2748 


AK001448 


Homo sapiens cDNA FLJ10586 fis, clone NT2RP2003986 


0.0008 


2749 


M95192 


Drosophila melanogaster hairless protein (H) mRNA, complete cds 


3.6 


2750 


X53426 


Nicotiana sylvestris NySS41 and partial NySS42 genes for RUBISCO 
small subunit (SSU) 


0.63 


2751 


NC 001336 


Methanbbacteritun thermoautotrophicum plasmid pFVl, complete 
sequence 


0.074 


2752 


NM 011595 


Mus musculus tissue inhibitor of metalloproteinase 3 (Timp3), mRNA 


0.012 


2753 


! AF299340 


Homo sapiens CD 164 gene, complete cds, alternatively spliced 


0.074 


2754 


AL121765 


S.pombe chromosome I cosmid cl786 


0.025 


2755 


U27078 


Trichaptum abietinum mitochondrial small subunit ribosomal RNA, 
mitochondrial gene, partial sequence 


0.44 


2756 


AF279292 


Mycoplasma hyopneumoniae strain ISU 232 YX1 (yxl) gene, partial 
cds; pi 10 membrane protein precursor, gene, complete cds; and YX2 
(yx2) gene, partial cds 


0.024 


2757 


AL590371 


Human DNA sequence from clone RP1 1-295H23 on chromosome 6, 
complete sequence [Homo sapiens] 


0.61 


2758 


XM_041904 


Homo sapiens cathepsin S (CTSS), mRNA 


6 


2759 


D63170 


Gallus domesticus DNA for the terminal heterochromatic region of 
the Z chromosome 


1.4 


2760 


NM 023197 


Mus musculus RIKEN cDNA 2310008H09 gene (2310008H09Rik), 
nxRNA 


0.025 



316 



INSDOCID: <WO 0214500A2_I_> 



WO 02/14500 



PCT/US01/25840 





Table 3A Nearest Neighbor (BlastN vs. Geribank) 


SEQID 
NO 


ACCESSN 


DESCRIP 


P VALUE 


2761 


BC007150 


Mus musculus, dystroglycan 1, clone MGC:6651 IMAGE:3496914, 
mRNA, complete cds 


0.18 


2762 


BC000480 


Homo sapiens, amyloid beta precursor protein-binding protein 1, 
59kD, clone MGC:8690 IMAGE:2964638, mRNA, complete cds 


8E-65 


2764 


XM 036683 


Homo sapiens a disintegrin-like and metalloprotease (reprolysin type) 
with thrombospondin type 1 motif, 3 (ADAMTS3), mRNA 


3E-46 


2765 


AJ005581 


Encephalitozoon cuniculi complete rDNA unit (16S, 5.8S, 23S, ITS1 
andIGS) 


0.42 


2766 


AE005227 


Escherichia coli 0157:H7 EDL933 genome, contig 1 of 3, section 51 
of 155 


0.007 


2767 


XM 048172 


Homo sapiens hypothetical protein DKFZp434F1017 
(DKFZP434F1017), mRNA 


3.8 


2769 


AF129087 


Medicago sativa mitogen-activated protein kinase homologue (TDY1) 
gene, complete cds 


0.66 


2770 


XM_008176 


Homo sapiens MYB binding protein (P160) la (MYBBP1 A), mRNA 


5.2 


2771 


AC024865 


Caenorhabditis elegans cosmid Y74C10AL, complete sequence 


0.039 


2772 


XM_036164 


Homo sapiens RAN binding protein 2 (RANBP2), mRNA 


1E-59 


2773 


AL023816 


Caenorhabditis elegans cosmid T05G11, complete sequence 


0.64 


2774 


XM_004980 


Homo sapiens cystic fibrosis transmembrane conductance regulator, 
ATP-binding cassette (sub-family C, member 7) (CFTR), mRNA 


0.016 


2775 


XM_032209 


Homo sapiens hypothetical gene supported by AK023337 
(TLOC90509), mRNA 


0.018 


2776 


Z69650 


Human DNA sequence from cosmid L69F7B, Huntington's Disease 
Region, chromosome 4pl6.3 contains Huntington Disease (HD) gene 


0.64 


2777 


AJ305053 


Caprine arthritis-encephalitis virus proviral partial pol gene for Pol 
protein (reverse transcriptase), isolate CAEVBr/UFRGS-2/C767 


0.46 


2778 


AK017531 


Mus musculus 8 days embryo cDNA, RIKEN full-length enriched 
library, clone:5730409F18, full insert sequence 


0.67 


2779 


AF028784 


Rattus norvegicus glial fibrillary acidic proteins alpha and delta 
(GFAP) gene, alternatively spliced products, complete cds 


0.11 


2780 


AF101236 


Naja sputatrix neutral, phospholipase A2 (NPLA2) gene, complete cds 


0.054 


2781 


U71280 


Mesocricetus auratus cytochrome P450 aldosterone synthase 
(CYP11B2) gene, complete cds 


0.023 


2782 


X80007 


P. sativum gene for chalcone synthase 


0.15 


2783 


AF171040 


Anopheles funestus clone AF10 microsatellite sequence 1 


2 


2784 


XM_011464 


Homo sapiens hypothetical protein FLJ20170 (FLJ20170), mRNA 


0.21 


2785 


Y18929 


Homo sapiens wild type AF-4/FEL gene, part of intron 3, individual 
PB 


3 


2786 


L81581 


Homo sapiens (subclone 4_a4 from PI Hll) DNA sequence, complete 
sequence 


0.024 
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2787 


NM_024381 


Rattus norvegicus ATP-stimulated glucocorticoid-receptor 
translocaton promoter (Gyk), mRNA 


0.17 


2788 


X72688 


L.stagnalis LS-rabl mRNA 


0.2 


2789 


XM_033154 


Homo sapiens syntrophin, gamma 1 (SNTG1), mRNA 


0.021 


2790 


AB017186 


Clostridium perfringens genes of hem operon, complete cds > 


L4 


2791 


U47416 


Solanum allophyllum NADH dehydrogenase subunit (ndhF) gene, 
chloroplast gene encoding chloroplast protein, partial cds 


0.2 


2792 


M21258 


Human small nuclear ribonucleoprotein (snRNP) E gene, exon 5 and 
Alu repeats copy N and O 


0.051 


2793 


L05004 


Staphylococcus aureus dehydroquinate synthase (aroB) gene, 3* end 
cds; 3-phosphoshikimate-l-carboxyv^ (aroA) gene, 
complete cds; ORF3, complete cds 


0.21 


2794 


AF302005 


Uma scoparia isolate ROM3406 cytochrome b gene, partial cds; 
mitochondrial gene for mitochondrial product 


2 


2795 


AF288209 


Homo sapiens beta galactosyltransferase bGalT7 mRNA, complete cds 


2 


2796 


AB048897 


Macaca fascicularis brain cDNA, clone:QnpA-17253 


1.7 


2797 


AK025054 


Homo sapiens cDNA: FLJ21401 fis, clone COL03678 


0.071 


2798 


U39886 


Bos taurus Y-chromosome specific genomic sequence 


0.002 | 


2799 


AJ277649 


Caenorhabditis elegans mRNA for CHE-14 protein 


0.6 


2800 


AF058764 


Xenopus laevis bone morphogenetic protein 4 (BMP-4) gene, 
complete cds 


0.56 


2801 


AE001398 


Plasmodium falciparum chromosome 2, section 35 of 73 of the 
complete sequence 


0.15 


2802 


AF079456 


O'nyong-nyong virus strain SG650, complete genome 


0.003 


2803 


AF087913 


Human endogenous retrovirus HERV-P-T47D 


! 6 


2804 


1 AB005053 


Anomala cuprea entomopoxvirus gene for spheroidin, complete cds 


1.8 


2805 


AF027656 


Homo sapiens cholesteryl ester transfer protein gene, promoter region 


0.004 


2806 


XM 031478 


Homo sapiens vacuolar protein sorting protein 18 (VPS18), mRNA 


1.7 


2807 


XM 001718 


Homo sapiens KIAA0475 gene product (KIAA0475), mRNA 


0.023 


2808 


L21906 


Human troponin I, slow-twitch isoform (TNNI1) gene, exons 2-4 


1.5 


2809 


NM 006633 


Homo sapiens IQ motif containing GTPase activating protein 2 
(IQGAP2), mRNA 


5E-39 


2810 


AE001541 


Helicobacter pylori, strain J99 section 102 of 132 of the complete 
genome 


0.2 


2811 


M63577 


S.cerevisiae SFP1 gene, complete cds 


0.023 


2812 


XM 004980 


Homo sapiens cystic fibrosis transmembrane conductance regulator, 
ATP-binding cassette (sub-family C, member 7) (CFTR), mRNA 


0.013 


2813 


AJ305053 


Caprine arthritis-encephalitis virus proviral partial pol gene for Pol 
protein (reverse transcriptase), isolate CAEV Br/UFRGS-2/C767 


0.5 
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2814 


AK014450 


Mus musculus 18 days pregnant adult female placenta and extra 
embryonic tissue cDNA, RJKEN full-length enriched library, 
clone:3830422A13, full insert sequence 


0.075 


2815 


X99691 


B.taurus DNA for agouti gene 


2 


2816 


Z64630 


H.sapiens CpG island DNA genomic Msel fragment, clone 152h5, 
forward read cpgl52h5.ftla 


0.14 | 


2817 


AF357971 


Ovine pulmonary adenocarcinoma virus, complete sequence; and 
flanking Ovis aries sequence 


4.9 


2818 


AE003336 


Drosophila melanogaster genomic scaffold 142000013385800, 
complete sequence 


0.16 


2819 


U72236 


Dictyostelium discoideum ModA (modA) gene, complete cds 


0.61 


2820 


XM 018110 


Homo sapiens gap junction protein, beta 3, 31kD (connexin 31) 
(GJB3), mRNA 


5.1 


2821 


AJ005588 


Capsicum annuum gene encoding 5-epi-aristolochene synthase, exons 
lto7 


0.23 


2822 


AK022842 


Homo sapiens cDNA FU12780 fis, clone NT2RP2001813 


0.074 


2823 


AF138281 


Arabidopsis thaliana phospholipase D-gamma-2 mRNA, complete cds 


0.071 


2824 


AB040882 


Homo sapiens mRNA for KIAA1449 protein, partial cds 


0.18 


2825 


AB022156 


Mus musculus Cctb gene for chaperonin containing TCP-1 beta 
subunit, complete cds 


0.5 


2826 


BC005516 


Mus musculus, RIKEN cDNA 1200007D18 gene, clone MGC:7601 
IMAGE:3494155, mRNA, complete cds 


0.51 


2827 


AK025207 


Homo sapiens cDNA: FLJ21554 fis, clone COL06330 


4.4 


2828 


NM 018798 


Mus musculus ubiquilin 2 (Ubqln2), mRNA 


0.043 


2829 


Z23770 


H. sapiens (D17S842) DNA segment containing (CA) repeat; clone 
AFM240xe5; single read 


0.00004 


2830 


BC005762 


Mus musculus, Similar to proteasome (prosome, macropain) subunit, 
alpha type 1, clone MGC:6546 1MAGE:2655483, mRNA, complete 
cds 


0.67 


2831 


AB047932 


Macaca fascicularis brain cDNA, clone:QnpA-10158 


0.68 


2832 


U19765 


Human nucleic acid binding protein gene, complete cds 


0.18 


2833 


AF200327 


Plasmodium chabaudi unknown genes 


0.028 


2834 


U67544 


Methanococcus jannaschii section 86 of 150 of the complete genome 


0.14 


2835 


BC010357 


Homo sapiens, Similar to hypothetical protein, clone MGC: 13367 
IMAGE:4249317, mRNA, complete cds 


2E-55 


2836 


AE007471 


Streptococcus pneumoniae section 154 of 194 of the complete genome 


5.2 


Zo3 / 


AJ320185 


Solanum tuberosum snl gene for snakin-1, exons 1-2 


0.44 


2838 


AF039954 


Homo sapiens CC chemokine LCC-1 precursor, gene, complete cds 


0.002 


2839 


Y13631 


Clostridium botulinum P-21, P-47 ntnh, bonT genes 


0.14 


2840 


AL390181 


Homo sapiens mRNA; cDNA DKFZp547 J125 (from clone 
DKFZp547J125) 


0.019 
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2841 


AF366200 


Diadema antillarum isolate dmid3 tRNA-Lys gene, partial sequence; 
ATPase 8 gene, complete cds; and ATPase 6 gene, partial cds; 
mitochondrial genes for mitochondrial products 


6.1 


2842 


AJ237688 


Canis familiaris polymorphic marker clone XA07 


0.53 


2843 


AF244979 


Mus musculus galectin-1 1 mRNA, variant b, complete cds, 
alternatively spliced 


0.25 


2844 


AL390174 


Homo sapiens mRNA; cDNA DKFZp547J184 (from clone 
DKFZp547J184) 


0.21 


2845 


U80230 


Human clotting factor VIII gene, junction regions of the deletion of 
exons 15 through 20 and LINE- 1 -like repeat region 


2E-09 


2846 


U31086 


Gallus gallus neuron-glia adhesion molecule (Ng-CAM) gene, partial 
cds, exons 1 and 2 


0.064 


2847 


D 16685 


Rice gene for lactate dehydrogenase, complete cds 


0.003 


2848 


AB007459 


Chara corallina ccml mRNA for myosin, complete cds 


0.067 | 


2849 


U63328 


Pissodes strobi vitellogenin (Vtg) mRNA, partial cds 


0.68 


2850 


U50727 


Human p40-phox (NCF4) gene, exon 8 


0.023 


2851 


NM_017190 


Rattus norvegicus Myelin-associated glycoprotein (Mag), mRNA 


6.5 


2852 


AF361105 


Homo sapiens interleuktn 9 (IL9) gene, complete cds 


* 0.37 


2853 


AP001416 


Homo sapiens genomic DNA, chromosome 21q22.2, clone:D34, 
LB7T-ERG region, complete sequence 


1.5 ! 


2854 


Y16771 


Phylica pubescens chloroplast partial traL gene & intergenic spacer 


0.6 


2855 


AE005624 


Escherichia coli 0157:H7 EDL933 genome, contig 3 of 3, section 243 
of 290 


0.075 


2856 


AF159568 


Microbotryum violaceum ammonium transporter MEPa (MEPa) 
mRNA, complete cds 


0.7 


2857 


M34149 


S.cerevisiae mitochondrial gap 12 sequence 


0.51 


2858 


D16541 


Chicken DNA for connectin(titin), partial sequence 


0.064 


2859 


AC007039 


Homo sapiens BAC clone RP1 1-263 A15 from Y, complete sequence 


e-149 


2860 


AJ294714 


Pichia anomala leu2 gene for beta-isopropylmalate dehydrogenase 


0.056 


2861 


XM_051160 


Homo sapiens KIAA1463 protein (KIAA1463), mRNA 


0.22 


2862 


XM 031527 


Homo sapiens leucine-rich PPR-motif containing (LRPPRC), mRNA 


0.0003 


2863 


AF323443 


Plasmodium yoelii yoelii clone 1 1 235 kDa rhoptry protein (IIa.2) 
gene, partial cds 


0.02 


2864 


Y14324 


Staphylococcus aureus partial ORF292 and ORF271, ORF331 and 
ORF314 


0.62 


2865 


BC007568 


Homo sapiens, clone IMAGE:3028427, mRNA partial cds 


0.16 


2866 


I U67518 


Methanococcus jannaschii section 60 of 150 of the complete genome 


0.67 


2867 


AJ133742 


Sus scrofa ryr3 gene (partial), exon 2 


0.68 


2868 


i AF1 16520 


Mus musculus periplakin (ppl) gene, exons 2-14 


0.083 


2869 


D38044 


Human gene for Ah-receptor, exon 7-9 


0.25 
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NO 
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2870 


XM010162 


Homo sapiens methyl CpG binding protein 2 (Rett syndrome) 
(MECP2), mRNA 


1.7 


2871 


AF303030 


Lavatera phoenicea isolate 804 internal transcribed spacer 1, partial 
sequence, 5.8S ribosomal RNA, complete sequence and internal 
transcribed spacer 2, partial sequence 


0,01 


2872 


AE002274 


Chlamydia muridamm, section 6 of 85 of the complete genome 


1.8 


2873 


AB062056 


Homo sapiens p53DINPl gene for p53DINPla, p53DINPlb, complete 
cds, alternative splicing 


8E-26 


2874 


AF150965 


Bacillus anthracis plasmid pXOl resolvase X (resX) and transposase 
X (traX) genes, complete cds 


1.6 


2875 


U90292 


Human chromosome 16 duplicated adrenoleukodystrophy (ALD) 
gene, 5* breakpoint sequence 


3E-41 


2876 


U75361 


Rattus norvegicus Muncl3-3 mRNA, complete cds 


3E-18 


2877 


AF006686 


Plecotus auritus clone Paur06 microsatellite sequence 


5E-10 


2878 


NM 009694 


Mus musculus apolipoprotein B editing complex 2 (Apobec2), mRNA 


0.00009 


2879 


AF315316 


Vigna radiata ACC oxidase gene, complete cds 


0.003 


2880 


AE005714 


Caulobacter crescentus section 40 of 359 of the complete genome 


1.5 


2881 


AL451135 


Human DNA sequence from clone RP11-45G20 on chromosome 6, 
complete sequence [Homo sapiens] 


1.9 


2882 


AF315316 


Vigna radiata ACC oxidase gene, complete cds 


0.003 


2883 


NMJ)09694 


Mus musculus apolipoprotein B editing complex 2 (Apobec2), mRNA 


0.00008 


2884 


AB047932 


Macaca fascicularis brain cDNA, clone:QnpA-10158 


0.16 


2885 


U18671 


Human Stat2 gene, complete cds 


0.23 


2886 


AB040670 


Bombyx mori copg2 mRNA for nonclathiin coat protein gamma2- 
COP, partial cds 


0.074 


2887 


AK017987 


Mus musculus adult male thymus cDNA, RIKEN full-length enriched 
library, clone: 5830443 J22, full insert sequence 


0.035 


2889 


AE001419 


Plasmodium falciparum chromosome 2, section 56 of 73 of the 
complete sequence 


0.12 


2890 


AF275225 


Eriosoma lanieenim 12S small subunit ribosomal RNA eene nartial 
sequence; tRNA-Val gene, complete sequence; and 16S large subunit 
ribosomal RNA gene, partial sequence; mitochondrial genes for 
mitochondrial products 


0.35 


2891 


AF372971 


Arabidopsis thaliana AT3g23580/MDB19_7 mRNA, complete cds 


0.53 


2892 


AJ404851 


Homo sapiens partial ARNT gene for aryl hydrocarbon receptor 
nuclear translocator, exon 4 


3.7 


2893 


AJ012380 


Fugu rubripes partial rara gene, exons 4-10 


1.4 


2894 


AB064593 


Escherichia coli icdA gene for isocitrate dehydrogenase, complete cds 


0.027 


2895 


AF 182444 


Drosophila melanogaster disco-interacting protein (dipl) gene, 
alternatively spliced products,jcomplete cds; and TO42 gene, partial 
cds 


1.6 
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2896 


AY020760 


Oryza sativa microsatellite MRG3085 containing (TA)X13, closest to 
marker R2976, genomic sequence 


0.007 


2897 


AE006318 


Lactococcus lactis subsp. lactis IL1403 section 80 of 218 of the 
complete genome 


0.53 


2898 


X01973 


Mouse gene for interferon alpha 4 (Mu IFN-alpha 4) 


3.9 


2899 


AF176677 


Candida albicans gamma-glutamylcysteine synthetase (GCS1) gene, 
complete cds 


0.07 


2900 


AF109009 


Prolaupala kukui clone 1092 transposon Laul sequence 


3.6 


2901 


D49739 


Chicken mRNA for leucine zipper protein, complete cds 


2.4 


2902 


U34610 


Mus musculus alpha-l(XVm) collagen (COL18A1) gene, exon 6 and 
7 


0.006 


2903 


AK026763 


Homo sapiens cDNA: FLJ23 1 10 fis, clone LNG07802 


0.34 


2904 


AF343914 


Campylobacter jejuni NCTC 11828 LOS biosynthesis cluster, partial 
sequence 


0.024 


2905 


AE002722 


Drosophila melanogaster genomic scaffold 142000013385665, 
complete sequence 


0.007 


2906 


XM 027242 


Homo sapiens KIAA0805 protein (KIAA0805), mRNA 


1.1 


2907 


Y15082 


Homo sapiens pl4.5 gene, partial exon 1 and promoter; hPOPl gene 
exon 1 and promoter and bidirectional promoter region 


0.67 


2908 


XM 009536 


Homo sapiens GTPase GES; REM protein (REM), mRNA 


5.9 


2909 


XM 036351 


Homo sapiens nuclear transcription factor Y, alpha (NFYA), mRNA 


1E-32 


2910 


AB037851 


Homo sapiens mRNA for KIAA1430 protein, partial cds 


0.58 


2911 


XM 002943 


Homo sapiens similar to PRO0327 protein (H. sapiens) (LOC92570), 
mRNA 


2.4 


2912 


AF069179 


Lysiphlebus cardui NADH dehydrogenase 1 gene, mitochondrial gene 
encoding mitochondrial protein, partial cds 


0.008 


2913 


AJ243545 


Arabidopsis thaliana gene for 67 kD chloroplastic RNA-binding 
protein, P67 


0.54 


2914 


AJ295795 


Isolepis producta chloroplast trnL gene intron 


2.2 


2915 


AJ3 11050 


Medicago sativa pr 10.2 gene for class 10 PR protein 


0.26 


2916 


AK021512 


Homo sapiens cDNA FLJ1 1450 fis, clone HEMBA1001432 


0.25 


2917 


AK002979 


Mus musculus adult male brain cDNA, RIKEN full-length enriched 
library, clone:0710001P07, full insert sequence 


0.023 


2918 


AY007505 


Streptococcus mitis phage SMI PblA (pblA) and PblB (pblB) genes, 
complete cds; lysin (lys) gene, partial cds; and unknown genes 


0.026 


2919 


AK022043 


Homo sapiens cDNAFLJ11981 fis, clone HEMBB1001317 


0.69 


2920 


AF162969 


Mycoplasma capricolum capricolum strain 960038 fructose 
biphosphate aldolase gene, partial cds 


0.064 


2921 


XM 051364 


Homo sapiens similar to immediate early protein (H. sapiens) 
(LOC93434), mRNA 


0.086 


2922 


XM 027642 


Homo sapiens KIAA0349 protein (KIAA0349), mRNA 


0.24 


2923 


XM 045819 


Homo sapiens cofilin 2 (muscle) (CFL2), mRNA 


1.9 


2924 


AF146362 


Drosophila melanogaster Huntington disease protein homolog 
(Hsap\HD) mRNA, complete cds 


0.23 
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NO 
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P VALUE 


2925 


U91993 | 


Solanum chacoense transposable element Sch2, sequence 


0.027 


2926 


AK021782 


Homo sapiens cDNA FLJ11720 fis, clone HEMBA1005293 


2.3 


2927 


AF274444 


Carsonella ruddii ribosomal protein LI 1 (rpLl 1), ribosomal protein 
LI (rpLl), ribosomal protein L10 (ipLlO), ribosomal protein L7/L12 
(rpL7/L12), beta subunit of RNA polymerase (rpoB), beta-prime 
subunit of RNA polymerase (ipoC), ribosomal protein S12 > 


2 


2928 


AF003047 


Simian immunodeficiency virus isolate P2S1 patient P2, gpl20 (env) 
gene, partial cds 


0.064 


2929 


AK022102 


Homo sapiens cDNA FLJ12040 fis, clone HEMBB 1001944 


0 


2930 


AC004239 


Homo sapiens BAC clone CTB-34F8 from 7pl5-p21, complete 
sequence 


. 0.001 


2931 


AF104819 


Strumaria truncata tmL gene, partial sequence; chloroplast gene for 
chloroplast product 


0.25 


2932 


AF171916 


Trimeresurus flavomaculatus cytochrome b (cytb) gene, partial cds; 
mitochondrial gene for mitochondrial product 


0.21 


2933 


AJ011596 


Homo sapiens trapped 3' terminal exon, clone B2E8 


0.0003 


2934 


AF269137 


Homo sapiens GABAA receptor gamma 3 subunit (GABRG3) gene, 
exon 3 


0.8 


2935 


AB030586 


Arabidopsis thaliana AATL1 gene for amino acid transporter-like 
protein 1, complete cds 


1.9 


2936 


L76484 


Cronartium appal achianum isolate Ca-1 18S ribosomal RNA gene, 
partial sequence; internal transcribed spacer 1, 5.8S ribosomal RNA 
gene and internal transcribed spacer 2, complete sequence; and 26S 
ribosomal RNA gene, partial sequence 


0.71 


2937 


AF366570 


Trypanosoma congolense calreticulin mRNA, complete cds 


0.084 


2938 


NM 004682 


Homo sapiens PC4 and SFRS1 interacting protein 1 (PSIP1), mRNA 


2 


2939 


AL050219 


Homo sapiens mRNA; cDNADKFZp586J1623 (from clone 
DKFZp586J1623) 


1.8 


2940 


AF275687 


Plasmodium falciparum knob-associated kistidine-rich protein 
(KAHRP) gene, complete cds 


0.009 


2941 


AF142701 


Vigna umbellata maturase-like protein (matK) gene, complete cds; 
chloroplast gene for chloroplast product 


0.56 


2942 


AF226775 


Mitthyridium undulatum small ribosomal protein 4 (rps4) gene, 
partial cds; chloroplast gene for chloroplast product 


0.073 


2943 


XM 034735 


Homo sapiens similar to hypothetical protein FLJ21463 (H. sapiens) 
(LOC90883), mRNA 


0 


2944 


AK017899 


Mus musculus adult male thymus cDNA, RIKEN full-length enriched 
Hbrary, clone:5830405M20, full insert sequence 


0.28 


2945 


Y10159 


D.discoideum racGAP gene 


6.2 


2946 


XM 011562 


Homo sapiens hypothetical gene supported by NM_000181 
(LOC92707), mRNA 


0.025 


2947 


AF191257 


Trichuris trichiura microsatellite ttstrl4 


0.078 


2948 


Y08449 


E.caballus DNA segment containing CA microsatellite VHL137 


0.00003 
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SEQID 
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2949 


AK003959 


Mus musculus 18 days embryo cDNA, RJKEN full-length enriched 
library, clone: 1110029G12, full insert sequence 


0.077 


2950 


NM_0 12779 


Rattus norvegicus Aquaporin 5 (Aqp5), mRNA 


2.2 


2951 


AJ404295 


Borelis schlumbergeri 18S rRNA gene, isolate 191 


0.25 


2952 


AE001365 


Plasmodium falciparum chromosome 2, section 2 of 73 of the 
complete sequence 


0.49 


2954 


Y13434 


Ovis aries mRNA for thyroid stimulating hormone receptor 


1.8 


2955 


AB026906 


Homo sapiens SDHD gene for small subunit of cytochrome b of 
succinate dehydrogenase, complete cds 


0.001 


2956 


BC010367 


Homo sapiens., hypothetical protein FLJ20534, clone MGC: 13432 
IMAGE:4334172, mRNA, complete cds 


4 


2957 


AJ251957 


Ciona intestinalis mRNA for nuclear lamin (lamin L2 gene) 


2 


2958 


AK016695 


Mus musculus adult male testis cDNA, RJKEN foil-length enriched 
library, clone:4933406J08, full insert sequence 


2E-14 


2959 


Z13985 


M.musculus control region sequence DNA 


0.072 


2960 


AL591590 


Human DNA sequence from clone CTD-2053G21 on chromosome 
Xql3.3-21.2, complete sequence [Homo sapiens] 


0.007 


2961 


AB002807 


Glycine max DNA for nodulin 35, partial cds 


0.24 


2962 


AE001399 


Plasmodium falciparum chromosome 2, section 36 of 73 of the 
complete sequence 


0.003 


2963 


AF298224 


Homo sapiens RPCI-II 289E13 CD21 gene, partial cds 


0.00001 


2964 


XM 018458 


Homo sapiens similar to hypothetical protein PR02822 (R sapiens) 
(LOC93537), mRNA 


0.21 


2965 


t AF288778 


Homo sapiens erythroleukemia translocation breakpoint region 


0.068 


2966 


AF073994 


Drosophila melanogaster gamma-tubulin (GTUB) gene, partial cds; 
and recombination repair protein 1 (RRP1) and thymidylate synthase 
(TS) genes, complete cds; and RNA-binding protein (RBP9) gene, 
partial sequence 


0.14 


2967 


AL121822 


Human DNA sequence from clone 346P1 1 on chromosome Xq21.2- 
21.33, complete sequence [Homo sapiens] 


0.0000001 


2968 


U31284 


Mesotaenium caldariorum clone mesphylb phytochrome gene, 
complete cds 


0.59 


2969 


XMJ)08806 


Homo sapiens mitogen-activated protein kinase 4 (MAPK4), mRNA 


1.8 


2970 


XM 027307 


Homo sapiens KIAA1209 protein (KIAA1209), mRNA 


0.001 


2971 


AF179711 


Hylobates sp. RH50 glycoprotein (RHAG) gene, intron 9 


0.56 


2972 


BC007133 


Mus musculus, apoptosis inhibitory protein 5, clone MGC:5830 
IMAGE:3594358, mRNA, complete cds 


5.4 


2973 


AF282018 


Homo sapiens clone 15qtel_ctg0082 c262bt3_c290bt3 sequence 


2E-53 


2974 


AE004790 


Pseudomonas aeruginosa PA01, section 351 of 529 of the complete 
genome 


1.8 


2975 


XM 003050 


Homo sapiens similar to seven transmembrane domain orphan 
receptor (H. sapiens) (LOC63108), mRNA 


0.27 


2976 


AF353728 


Mus musculus myo-inositol monophosphatase 1 (Impal) gene, exons 
1 through 7 


1.2 
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NO 
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P VALUE 


2977 


1 

AK005535 . 


Mus musculus adult female placenta cDNA, RIKEN foil-length 
enriched library, clone:1600025H15, foil insert sequence 


0.55 


2978 


AJ002479 : 


Medicago truncatula ENBP1 gene, exons 1 to 12 


0.64 


2979 


AF154836 


Homo sapiens Ras-like GTP-binding protein (RAB27 A) gene, exons 
lb and 2 


0.000001 


29S1 


AF260819 


Plasmodium falciparum CGI protein gene, complete cds 


0.73 


2982 


AE006714 


Sulfolobus solfataricus section 73 of 272 of the complete genome 


0.02 


2984 


NM_004114 


Homo sapiens fibroblast growth factor 13 (FGF13), mRNA 


6.7 


2985 


AF155369 


Buthus martensii putative potassium ion channel blocker TXKsl 
mRNA, complete cds 


0.72 


2986 


XMJ)41836 


Homo sapiens similar to MHC class I region ORF (H. sapiens) 
(LOC91955), mRNA 


0.18 


2987 


AF153668 


Microcephala discoidea NADH dehydrogenase (ndhF) gene, partial 
cds; chloroplast gene for chloroplast product 


1.7 


2988 


AF216211 


Drosophilabuzzatii alpha-esterase 3 (aE3) and alpha-esterase 4b 
(aE4b) genes, complete cds 


0.073 


2989 


AF352244 


Dermatophagoides farinae paramyosin-like allergen mRNA, partial 
cds 


0.025 


2990 


XM_007409 


Homo sapiens alkylation repair, alkB homolog (ABH), mRNA 


0.21 


2991 


AF005779 


Sus scrofa neuropeptide Y Yl receptor mRNA, complete cds 


0.66 


2992 


AF264749 


Arabidopsis thaliana hydrophobic protein RCI2B (RCI2B) and 
hydrophobic protein RCI2 A (RCI2 A) genes, complete cds 


0.022 


2994 


AL162033 


Homo sapiens mRNA; cDNA DKFZp434F1872 (from clone 
DKFZp434F1872) 


0.69 


2995 


AK013593 


Mus musculus adult male hippocampus cDNA, RIKEN full-length 
enriched library, clone:2900024O09, foil insert sequence 


0.7 


2996 


AF222716 


Plasmodium falciparum clone 3D7 unconventional myosin PfM-B 
gene, complete cds 


0.027 


2997 


AK006599 


Mus musculus adult male testis cDNA, RIKEN full-length enriched 
library, clone: 1700034J06, foil insert sequence 


2.2 


2998 


AF01O509 


Plasmodium falciparum microsatellite TA39 sequence 


0.001 


2999 


AF070566 


Homo sapiens clones 24611 and 24705 mRNA sequence 


0.000003 


3000 


AF181676 


Homo sapiens clone 204112 polymorphic microsatellite sequence 


0.074 


3001 


M17169 


Chinese hamster glucose-regulated protein GRP78 mRNA, complete 
cds 


0.008 


3002 


NM 031236 


Rattus norvegicus alpha 1,2-focosyltransferase (Futl), mRNA 


0.69 


3003 


XM 016321 


Homo sapiens region containing TBX3-iso protein; T-box 3 (ulnar 
mammary syndrome) (LOC82353), mRNA 


0.73 


3004 


BC005379 


Homo sapiens, Similar to plasminogen, clone MGC: 12496 
IMAGE:3934860 9 mRNA, complete cds 


0.000001 


3005 


AF276176 


Scaphirhynchus platorynchus microsatellite Spl-107 sequence 


0.009 


3006 


XM 010048 


Homo sapiens like mouse brain protein E46 (E46L), mRNA 


0.47 


3007 


NC 001807 


Human mitochondrion, complete genome 


1E-10 


3008 


AE006103 


Pasteurella multocida PM70 section 70 of 204 of the complete genome 


; 0.4 
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P VALUE 


3009 


AE006266 


Lactococcus lactis subsp. lactis IL1403 section 28 of 218 of the 
complete genome 


0.2 


3010 


AF022812 


Dehalospirillum multivorans GTP cyclohydrolase II (rib A) gene, 
partial cds; and 3,4-dihydroxy-2-butanone-4-phosphate synthase 
(ribB), riboflavin synthase (ribH), tetrachloroethene reductive 
dehalogenase catalytically active subunit (pceA), and tetrachl> 


0.21 


3011 


AJ243901 


Mycoplasma hominis p75 gene, strain PG21 


1.4 


3013 


XM 027894 


Homo sapiens hypothetical protein DKFZp761C121 
(DKFZp761C121), mRNA 


5E-33 


3014 


AJ390525 


Candida albicans ORF DNA, clone 64gB 


0.52 


3015 


AJ006873 


Buchnera aphidicola plasmid pBUsl, repA2 gene, leuA gene and 
ORF1 


0.013 


3016 


L07898 


Entamoeba histolytica alpha- 1 tubulin (tubl) gene, complete cds 


0.024 


3017 


L25107 


Human LIS mRNA 


1.2 


3018 


AL122122 


Homo sapiens mRNA; cDNA DKFZp434L098 (from clone 
DKFZp434L098) 


0.007 


3019 


V01270 


Rattus norvegicus genes for 18S, 5.8S, and 28S ribosomal RNAs 


2E-10 


3020 


AK023061 


Homo sapiens cDNA FLJ12999 fis, clone NT2RP3000324 


0.12 


3021 


U80449 


Caenorhabditis elegans cosmid R10A10 


0.29 


3022 


AF355473 


Oryzias latipes germ cell-less protein (gel) mRNA, complete cds 


'2.3 


3023 


XM 037811 


Homo sapiens hypothetical gene supported by AL1 17650 
(LOC91349), mRNA 


0.39 


3024 


AK016492 


Mus musculus adult male testis cDNA, RIKEN full-length enriched 
library, clone:493 1430N09, full insert sequence | 


0.004 


3025 


XM 011189 


Homo sapiens similar to DKFZP564K247 protein (H. sapiens) 
(LOC91537), mRNA 


4 


3026 


BC010710 


Mus musculus, Similar to ribosomal protein L19, clone MGC:6500 
IMAGE.2648593, mRNA, complete cds 


5.1 


3027 


XM_017923 


Homo sapiens hypothetical protein FLJ12660 (FLJ12660), mRNA 


4.1 


3028 


AC016671 


Homo sapiens BAC clone RP11-12J24 from Y, complete sequence 


0.012 


3029 


L13662 


Campylobacter jejuni major cell-binding fector (pebl) and receptor 
genes, complete cds 


0.044 


3030 


X77172 


M.musculus pF2 genomic DNA, homology with mouse repetitive 
elements Bl, B2 and human Alu 


0.008 


3031 


AF074841 


Campylobacter lari tRNA-Ala and tRNA-Ile genes, complete sequence 


0.016 


3032 


AF272846 


Homo sapiens Fanconi anemia complementation group D2 protein 
(FANCD2) gene, exon 1 


0.065 


3033 


D50090 


Drosophila americana texana G-3-P dehydrogenase (alphaGpdh) 
gene, exonl-8, complete cds 


0.13 


3034 


AE001417 


Plasmodium falciparum chromosome 2, section 54 of 73 of the 
complete sequence 


0.15 


3035 


AK002007 


Homo sapiens cDNA FLJ11 145 fis, clone PLACE1006626, highly 
similar to Homo sapiens mRNA for KIAA0928 protein 


0.026 
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3036 


AL5 12432 


Human DNA sequence from clone RP1 1-245H22 on chromosome 6 
Contains GSSs, complete sequence [Homo sapiens] 


0.00003 


3037 


AB030490 


Glycine max SG-05 gene for thiamin biosynthetic enzyme, complete 
cds 


0.003 


3038 


U05820 


Saccharomyces cerevisiae DA-box protein Smc2p (SMC2) gene, 
complete cds, and assimilatory sulfite reductase (MET10) gene, 
partial cds 


1.7 


3039 


AF098761 


Gracilaria gracilis microsatellite Ggl55R sequence 


0.07 


3041 


AF045595 


Carassius ainratus clone gf-40 glutamic acid decarboxylase isoform 67 
(GAD67) mRNA, complete cds 


2 


3042 


AJ404308 


Sorites marginalis 18S rRNA gene, isolate 836 


0.009 


3043* 


AE006495 


Streptococcus pyogenes Ml GAS strain SF370, section 24 of 167 of 
the complete genome 


6 


3044 


XM_030781 


Homo sapiens enigma (LIM domain protein) (ENIGMA), mRNA 


3E-18 


3045 


AC024748 


Caenorhabditis elegans cosmid Y110A2AR, complete sequence 


0.15 


3046 


AL512282 


Human DNA sequence from clone XXyac-BA141F18 on chromosome 
X, complete sequence [Homo sapiens] 


0.06 


3047 


AF009980 


Otolemur garnetti color vision opsin gene, exons 5 and 6, and 
complete cds 


0.17 


3048 


AF284953 


Carex exsiccata tRNA-Thr, partial sequence; trnT-trnL intergenic 
spacer, tRNA-Leu, and trnL-tniF intergenic spacer, complete 
sequence; and tRNA-Phe, partial sequence; chloroplast genes for 
chloroplast products 


0.23 


3049 


AE004036 


Xylella fastidiosa 9a5c, section 182 of 229 of the complete genome 


4.3 


3050 


X94767 


Hsapiens RP3 gene (XLRP gene 2) 


0.002 


3051 


AF279135 


Dictyosielium discoideum spore coat structural protein SP65 (cotE) 
gene, complete cds 


0.003 


3052 


AF277452 


Rattus norvegicus Pkdl mRNA, partial cds 


0.12 


3053 


AF327114 


Homo sapiens clone 8qtel cl9t7 sequence 


8E-17 


3054 


AB052201 


Macaca fascicularis brain cDNA, clone:QnpA-20394 


0.59 


3055 


AJ235314 


Antilocapra americana mitochondrial DNA for D-Ioop 


0.17 


3056 


AF361481 


Homo sapiens mitochondrial GTP-binding protein 1 (GTPBP3) gene, 
complete cds; nuclear gene for mitochondrial product 


0.009 


3057 


AE006879 


Sulfolobus solfataricus section 238 of 272 of the complete genome 


1.6 


3058 


AJ277756 


Legionella pneumophila ORF X (partial), recA gene, recX gene and 
sya gene (partial) 


U.Zl 




U73044 


Schizosaccharomyces pombe ARS binding protein 2 (abp2) mRNA, 
complete cds 


0.058 


3060 


AF043123 


Homo sapiens interleulrin-7 receptor precursor (BL7R) gene, exon 1 


0.2 


3061 


L43603 


Gallus gallus Na+/K+-ATPase alpha- 1 subunit gene, promoter region 


0.48 
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NO 
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P VAT TTF 


S062 


X68976 


N.viridescens mRNA for Hoxd-10 


0.092 


3063 


AF001454 


Helianthus annuus Dc3 promoter-binding lactor-2 (Urlir-z; nuuNA, 
complete cds 


v. AO 




AF086062 


Homo sapiens ftul length insert cDNA clone yzajodh 




jUDj 


U67234 


Human clone HS5.29 Alu-Ya5 sequence 


0.087 


jUOO 


AF030199 


Mus musculns type 1 sigma receptor gene, complete cds 






J05544 


Plasmodium falciparum glucosephosphate isomerase (ggi.Rl) gene, 
complete cds 




3068 


X17455 


Rat Bmyc gene 5 '-fragment 






AF298207 


Dictyostelium discoideum transposon thug-S, complete sequence 


0.003 


3070 


NM 030238 


Mus musculus dynein, cytoplasmic, heavy chain 1 (Dnchcl), mRNA 


1.7 


3071- 


AB032151 


Homo sapiens BABP gene for bile acid-binaing protein |AKit iczj, 
exon 1 to 7 


0.28 




NC_001481 


Feline calicivirus, complete genome 


1.9 


3073 


U49266 


Mus musculus growth hormone receptor/binding protein gene, exons 
9-10, and partial cds 




jU /** 


Z24279 


R sapiens (D1S489) DNA segment containing (CA) repeat; clone 
AFM309ve9; single read 




3075 


AF228724 


Rhagoletis conversa NADH-dehydrogenase subunit 4 (ND4) gene, 
partial sequence; NADH-dehydrogenase subunit 4L (ND4L) gene, 
complete sequence; and tRNA-Thr gene, partial sequence; 
mitochondrial genes for mitochondrial products 


0.048 


3076 


U72723 


Oiyza longistaminata receptor kinase-like protein (Xa21) gene, 
complete cds and family member C, pseudogene 


0.21 


^077 

j\j / / 


U55184 


Human G protein Golf alpha gene, exon 12 and complete cds 


XJZr— IO 


jU / o 


AC006119 


Mus musculus clone UWGCimbaclOgap from 14D1-DZ ( l -ceil 
Receptor Alpha Locus), complete sequence 


0.00004 


3079 


AK015900 


Mus musculus adult male testis cDNA, RIKEN iull-lengtn enncnea 
library, clone:4930526r 13, lull insert sequence 


0 76 


JUOv 


AL133637 


Homo sapiens mRNA; cDNA DKFZp434J1618 (from clone 
DKFZp434J1618); partial cds 


0.071 


3081 


S74063 


c-rai^raf protem [Xenopus laevis, mRNA, 2/36 ntj 


V/. JVJ 




NM 014498 


TT • a TT « - » - - - - - 1 — anl. j M ^^"*|>TJT *3rt\ ttiOI^ A 

Homo sapiens type II Golgi membrane protem (OPrl 3u;, mKJNA 


1 Q 

X . 37 


3083 


AF238234 


Entamoeba histolytica diaphanous protein (dia) gene, partial cds 


2.1 






Homo sapiens SWI/SNF related, matrix associated, actin dependent 
rpoiiljitnr nf chrnmjitin oiihfhmilv f member 1 ( SMARCF 1 ) » mRNA 


0.65 


3085 


AK007542 


Mus musculus 10 day old male pancreas cDNA, RIKEN full-length 
enriched library, clone: 18 100 19D05, full insert sequence 


0.62 


3086 


AF228977 


Human T-cell lymphotropic virus type 1 isolate ATLL long terminal 
repeat, partial sequence 


0.0003 


3087 


i AC091698 


Homo sapiens clone RP11-596J6, complete sequence 


0.073 


3088 


AF233882 


Homo sapiens plakogiobin (JUP) gene, exons 1 through 7 


4E-88 
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3089 


U29468 


Drosophila koepferae K04 San Luis, Argentina, Gandalf transposable 
element DNA< partial sequence 


0.2 


3090 


AF279292 


Mycoplasma hyopneumoniae strain ISU 232 YX1 (yxl) gene, partial 
cds; pi 10 membrane protein precursor, gene, complete cds; and YX2 
(yxl) gene, partial cds 


0.2 


3091 


1 AF391284 


Homo sapiens 1 lpl5.5 clone LOH1 1 A, partial sequence 


0.007 


3092 


AF259737 


Ovis aries microsatellite MNS-64 sequence 


0.66 


3093 


NM 030261 


Mus musculus hypothetical protein MGC7182 (MGC7182), mRNA 


6E-36 


3094 


AB047879 


Macaca fascicularis brain cDNA, clone:QnpA~14270 


0.00003 


3095 


X65607 


Ksapiens MT1X gene for metallothionein IX 


0.023 


3096- 


AF231164 


Pseudosymblepharis schimperiana tRNA-Leu, partial sequence; trnL- 
trnF intergenic spacer, complete sequence; and tRNA-Phe, partial 
sequence; chloroplast genes for chloroplast products 


0.06 


3097 


AF290431 


Mus musculus sequence downstream of Hyall; and Fus2 gene, 5* UTR 
and exon 1 


1.3 


3098 


X96762 


C.elegans mRNA for DNA topoisomerase I 


0.028 


3099 


AB036229 


Salganea nigrita mitochondrial COII gene for cytochrome oxidase n, 
partial cds, collection_site:Lake Danao, Leyte, Philippines 


0.53 


3100 


AJ309183 


Staphylococcus aureus partial coa gene for coagulase, type F, isolate 
rs59 


0.07 


3101 


AF240785 


Homo sapiens glutathione-S-transferase GSTT1 deletion/junction 
region sequence 


0.000004 


3102 


XM_031737 


Homo sapiens synaptosomal-associated protein, 23kD (SNAP23), 
mRNA 


0.0000004 


3103 


AY035097 


Arabidopsis thaliana putative proline-rich protein (F23H24.15) 
mRNA, complete cds 


0.18 


3104 


AF121069 


Anopheles farauti 3 12S small subunit ribosomal RNA gene, partial 
sequence; mitochondrial gene for mitochondrial product 


0.065 


3105 


NM 016701 


Mus musculus nestin (Nes), mRNA 


0.23 


3106 


L41893 


Homo sapiens retinoblastoma susceptibility protein (RBI) gene, exon 
5, bases 44452-44939 in LI 1910 


0.008 


3107 


Z78623 


Ksapiens flow-sorted chromosome 6 TaqI fragment, SC6pA12B7 


2E-21 


3108 


XMJ)41367 


Homo sapiens NADH dehydrogenase (ubiquinone) 1 beta subcomplex, 
4 (15kD, B15) (NDUFB4), mRNA 


2E-24 


3109 


AL117617 


Homo sapiens mRNA; cDNA DKFZp564H0764 (from clone j 
DKFZp564H0764) 


0.17 


3110 


AK023907 


Homo sapiens cDNA FLJ13845 fis, clone THYRO1000815 


7E-42 


3111 


M95396 


Yeast putative transcriptional activator (ADA2) gene, complete cds 


3.8 


3112 


NC 002670 


Bacteriophage bIL3 1 1, complete genome 


0.073 


3113 


XMJ)37870 


Homo sapiens desmoplakin (DPI, DPII) (DSP), mRNA 


2.2 


3114 


NM 032276 


Homo sapiens hypothetical protein DKFZp547E052 
(DKFZp547E052), mRNA 


0.43 



329 



3NSDOC1D: <WO 021 4500A2J_> 



WO 02/14500 PCT/US01/25840 





Table 3 A Nearest Neighbor (BlastN vs. Geribank) 


QT7/^\ TTi 
OJCfV^ JJJ 

NO 


ACCESSN 


DESCR1P 


P VALUE 


"*1 1 ^ 

3 LID 


AB048877 


Macaca fasciculans brain cDNA, clone:QnpA-16414 


3E-30 


J 1 XO 


AK014209 


Mus musculus 13 days embryo head cDNA, RDCEN full-length 
enriched library, clone:3110053B16, full insert sequence 


0.029 


1117 
.511 / 


XM 037053 


Homo sapiens rhabdoid tumor deletion region protein 1 (RTDR1), 
mRNA 


9E-47 


3118 


AF156197 


Cinara pseudotaxifolia cytochrome oxidase II gene, partial cds; 
mitochondrial gene for mitochondrial product 


0.18 


3119 


AK022006 


[Homo sapiens cDNA FLJ1 1944 lis, clone HEMBB1000684 


0.003 




U23175 


Caenorhabditis elegans cosmid F57F10, complete sequence 


0.022 


j 1Z1 


AF267198 


Candidates Carsonella ruddii natural-host Arytaina genistae ATP 
synthase alpha subunit (atpA) gene, partial cds; ATP synthase gamma 
subunit (atpG) gene, complete cds; and ATP synthase beta subunit 
(atpD) gene, partial cds 


0.019 


1 TOO 


AY026461 


Homo sapiens CUB domain containing protein 1 (CDCP1) mRNA, 
complete cds 


0.5 


3123 


NC 002471 


Typhlonectes natans mitochondrion, complete genome 


0.45 


j 1ZD 


AF328531 


Homo sapiens clone 9qtel_cl0t7 sequence 


0.00001 


J 1ZO 


AF397158 


Homo sapiens clone 11 pur alpha-associated ribosomal RNA gene, 
partial sequence 


2E-37 


3127 


AF1 10970 


HTV-1 isolate C-96BW1 1.06 country Botswana, complete genome ; 


5.3 


JlZo 


AJ237690 


Canis familiaris polymorphic marker clone XC08 


1.3 


3129 


AE001176 


Borrelia burgdorferi (section 62 of 70) of the complete genome 


0.084 


J1.5U 


AJ271333 


Campylobacter hyoilei ORF1, ORF2, ORF3 and ORF4 


6.7 


3131 


AF1 12247 


Merluccius productus Mpro-1 mitochondrial control region, partial 
sequence 


0.0008 


3132 


XM 002378 


Homo sapiens uncharacterized hypothalamus protein HT010 (HT010), 
mRNA 


. 4E-28 


3133 


AJ242726 


Xenopus laevis mRNA for p21 activated kinase 2 (PAK2 gene) 


0.25 


5 134 


AF049711 


Petunia x hybrida pollen-specific 1-aminocyclopropane-l-carboxylate 
synthase (ACS2) gene, complete cds 


0.24 


3135 


XM 003091 


Homo sapiens KIAA0001 gene product; putative G-protein-coupled 
receptor, G protein coupled receptor for UDP-glucose (KIAA0001), 
mRNA 


2.2 


JUo 


Z33368 


M.capricolum DNA for CONTIG MCABF 


0.001 


31.57 


AE001433 


Plasmodium falciparum chromosome 2, section 70 of 73 of the 
complete sequence 


0.0U1 




AK.U172J5 


Mus musculus adult male pituitary gland cDNA, RIKEN full-length 
enricneu uorary, cione.j-jju^ujjLJiu, nut lnsen sequence 


0 7 


3139 


Z57308 


H.sapiens CpG island DNA genomic Msel fragment, clone 170g9, 
reverse read cpgl70g9.rtla 


0.023 


3140 


AC005485 


Homo sapiens PAC clone RP5-998M2 from 7q33-q35, complete 
sequence 


5.4 


3141 


AF1 18654 


Leishmania donovani ATPase 6 gene; kinetoplast gene for kinetoplast 
product 


0.66 
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3142 


AJ271879 


Uncultured eubacterium plasmid pIEl 130 DNA for repA gene, repC 
gene, repB gene, mobA gene, mobB gene, mobC gene, cac gene, catm 
gene, suin gene, strA gene, strB gene and aph(3*)-I gene 


0.26 


3143 


Y00165 


Sus scrofa mRNA for link protein 


0.079 


3144 


AF283982 


Caenorhabditis elegans transcription enhancer factor-l-like protein 
EGL-44 (egl-44) gene, complete cds 


0.0007 


3145 


AE004926 


Pseudomonas aeruginosa PA01, section 487 of 529 of the complete 
genome 


0.44 


3146 


AE004738 


Pseudomonas aeruginosa PA01, section 299 of 529 of the complete 
genome 


1.4 


3147 


AC083824 


Homo sapiens clone RP1 1-91J18, complete sequence 


0.59 


3148 


AY023127 


Oryza sativa microsatellite MRG5452 containing (GCA)X9, closest to 
marker C402, genomic sequence 


0.77 


3149 


XM 002943 


Homo sapiens similar to PRO0327 protein (H sapiens) (LOC92570), 
mRNA 


0.073 


3150 


D88151 


Clostridium perfringens DNA for D-alanine:D-alanine ligase, cortical 
fragment-lytic enzyme 


0.057 


3151 


XM 009013 


Homo sapiens similar to tumor necrosis factor (ligand) superfamily, 
member 14 (H. sapiens) (LOC63472), mRNA 


1.7 


3152 


NM 012702 


Rattus norvegicus Carcinoembryonic antigen gene family (CGM1) 
(Cgml), mRNA 


1.5 


3153 


AJ131041 


Clogmia albipunctata mRNA for Hunchback protein, partial 


0.019 


3154 


AE000720 


Aquifex aeolicus section 52 of 109 of the complete genome 


5.3 


3155 


X74789 


Potato leaf roll virus RNA sequence 


0.47 


3156 


AF064478 


Homo sapiens natural resistance-associated macrophage protein 2 
(NRAMP2) gene, exons 4 and 5 


0.021 


3157 


M84761 


Rat fatty acid synthase gene, complete cds 


0.25 


3158 


AF239566 


Polycycnis oraata tRNA-Leu (trnL) gene, partial sequence; and trnL-F 
intergenic spacer region, complete sequence; chloroplast gene for 
chloroplast product 


0.007 


3159 


AF343914 


Campylobacter jejuni NCTC 11828 LOS biosynthesis cluster, partial 
sequence 


0.025 


3160 


AL137443 


Homo sapiens mRNA; cDNA DKFZp566G184 (from clone 
DKFZp566G184) 


0.99 


3161 


X54452 


D.discoideum culmination spiA (Dd31) gene 


0.71 


3162 


X80908 


L.esculentum gene for fruit ripening polygalacturonase 


0.069 


3163 


XM 007838 


Homo sapiens nuclear factor of activated T-cells 5, tonicity-responsive 
(NFAT5), mRNA 


0.5 


3164 


AF1 19385 


Mus musculus KvLQTl gene, KvLQTl-AS allele, partial intron 
sequence 


0.17 


3165 


X86449 


Exaballus DNA segment containing microsatellite 


1E-25 


3166 


Z22800 


H.sapiens microsatellite repeat 


0.000004 


3167 


U22842 


Lagos bat virus nucleoprotein gene, partial cds 


0.027 
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3168 


AF108897 


Campylobacter jejuni PglB (pglB) gene, partial cds; and PglA (pglA), 
PglC (pglC), PglD (pglD), PglE (pglE), PglF (pglF), and PglG (pglG) 
genes, complete cds 


0.73 


3169 


AE000556 


Helicobacter pylori 26695 section 34 of 134 of the complete genome 


0.016 


3170 


AK024295 


Homo sapiens cDNA FLJ14233 fis, clone NT2RP4000049 


0.46 


3171 


U13835 


Mus musculus c-abl protein (c-abl) gene, exons 1 and IB, 
alternatively spliced 


4.5 


3172 


AJ132887 


Homo sapiens telomere-junction DNA sequence, haplotype B 


6E-13 


3173 


XM 007277 


Homo sapiens hypothetical protein DKFZp761F2014 
(DKFZp761F2014), mRNA 


1.9 


3174 


L40470 


Pisum sativum (clone HW39-4) gene fragment 


0.05 


- 

3175 


AF206594 


Lacerta vivipara 12S ribosomal RNA gene, partial sequence; tRNA- 
Val gene, complete sequence; and 16S ribosomal RNA gene, partial 
sequence; mitochondrial genes for mitochondrial products 


0.69 


3176 


XM_037847 


Homo sapiens neurexophilin 3 (NXPH3), mRNA 


6.7 


3177 


AY026945 


Rattus norvegicus T:G mismatch thymine glycosylase mRNA, 
complete cds 


L6 


3178 


XM_028009 


Homo sapiens KIAA1563 protein (KIAA1563), mRNA 


2E-71 


3179 


AF250386 


Carsonella ruddii RNA polymerase beta subunit (rpoB) and RNA 
polymerase beta-prime subunit (rpoC) genes, partial cds 


0.65 


3180 


AB038968 


Homo sapiens CFLAR gene, exon 5, exon 6 


0.24 


3182 


AK023569 ( 


Homo sapiens cDNAFLT13507 fis, clone PLACE1005085 


2E-64 


3183 


AF240176 


Mus Mus musculus MRP13 mRNA 


2 


3184 


U20230 


Human guanyl cyclase C gene, partial cds 


0.2 


3185 


L18959 


Ovis aries T cell receptor gene V-region, exons 1 (3* end) and 2 (5* 
end) 


0.64 


3186 


U36785 


Rattus norvegicus putative pheromone receptor VN1 mRNA, complete 
cds 


0.73 


3187 


AF270614 


Homo sapiens clone 5qtel_c93t7 sequence 


0.075 


3188 


NM_004114 


Homo sapiens fibroblast growth factor 13 (FGF13), mRNA 


6.7 


3189 


AJ403239 


M.musculus DNA for vimentin-binding fragment VimE7 


0.076 


3190 


BC010716 


Mus musculus, clone MGC:6551 IMAGE:2655861, mRNA, complete 
cds 


4.9 


3191 


AF270162 


Staphylococcus epidermidis strain SRI clone step. 105 le05 genomic 
sequence 


0.56 


3192 


XM_046992 


Homo sapiens hypothetical gene supported by AL137734 
(LOC92742), mRNA 


1.5 


3193 


AE003909 


Xylella fastidiosa 9a5c, section 55 of 229 of the complete genome 


6.3 \ 


3194 


AF240176 


Mus Mus musculus MRP13 mRNA 


1.5 | 


3195 


L08174 


Romanomermis culicivorax mitochondrial NADH dehydrogenase 
subunits 6 and 3 (ND6, ND3) cytochrome P450-like protein genes, 
complete CDS 


0.5 
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3196 


AK021O98 


Mus musculus adult male corpus striatum cDNA, REKEN full-length 
enriched library, cione:C030021G16, full insert sequence 


6.5 


3197 


AF3 11637 


Eimcria separata ORF470 gene, partial cds; apicoplast gene for 
apicoplast product 


0.079 


3198 


D84549 


Candida tropicalis DNA for carnitine acetyltransferase, complete cds 


0.057 


3199 


XM 045864 


Homo sapiens KIAA1804 protein (KIAA1804), mRNA \ 


0 


3200 


XM_010334 


Homo sapiens transducin (beta)-like 1 (TBL1), mRNA 


1.9 


3201 


AL133154 


S.pombe chromosome I cosmid cl751 


0.15 


3202 


S75898 


aromatase [Poephila guttata=zebra finches, ovary, mRNA, 3188 nt] 


0.49 


3203 


AF047669 


Homo sapiens dystrophin (DMD) gene, deletion junction 8 between 
introns 7 and 20 


0.001 


3204 


AB042652 


Bos taurus mRNA for zona pellucida glycoprotein ZPB, complete cds 


0.58 


3205 


AF147082 


Homo sapiens gamma-glutamyl hydrolase gene, exons 3 through 7 


0.028 


3206 


AE006252 


Lactococcus lactis subsp. lactis IL1403 section 14 of 218 of the 
complete genome 


0.07 


3207 


Z99773 


Caenorhabditis elegans cosmid H06A10, complete sequence 


0.7 i 


3208 


NC_001328 


Caenorhabditis elegans, mitochondrion, complete genome 


0.025 


3209 


AY037251 


Arabidopsis thaliana AT4g30750/T10C21_100 mRNA, complete cds 


5.9 j 


3210 


Z12834 


L.esculentum gene for polyphenol oxidase 


0.75 


3211 


AL391057 


Human DNA sequence from clone RP1 1-533B2 on chromosome 10, 
complete sequence {Homo sapiens] 


0.07 


3212 


AF001454 


Helianthus annuus Dc3 promoter-binding factor-2 (DPBF-2) mRNA, 
complete cds 


0.21 


3213 


AF295925 


Streptococcus pneumoniae transposon Tn5252 Orf28, Orf27, Oi£26, 
Orf25, Or£24, Orf23, Orf22, Orf21, and Orf20 genes, complete cds 


0.019 


3214 


AC024821 


Caenorhabditis elegans cosmid Y55B1AR, complete sequence 


1.7 


3215 


AC018655 


Homo sapiens 12q BAC RP11-946G22 (Roswell Park Cancer Institute 
Human BAC Library) complete sequence 


0.0000001 


3216 


XM 043524 


Homo sapiens cadherin 17, LI cadherin (liver-intestine) (CDH17), 
mRNA 


2E-52 


3217 


XM 051566 


Homo sapiens phosphoinositide-3-kinase, catalytic, beta polypeptide 
(PIK3CB), mRNA 


0.025 


3218 


X84742 


Mmusculus GPX3 gene 


0.079 


3219 


AL445527 


Human DNA sequence from clone RP1 1-1 15C2 on chromosome 1, 
complete sequence [Homo sapiens] 


5.4 


3220 


AF254792 


Danio rerio Period3 circadian clock protein (Per3) mRNA, complete 
cds 


3.8 


3221 


AE002225 


Chlamydophila pneumoniae AR39, section 53 of 94 of the complete 
genome 


0.12 
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1000 
Dill 


XM_048692 


Homo sapiens similar to hypothetical protein (H. sapiens) 
(LOC92998), mRNA 


2E-15 


5115 


AF335456 


Clavulina cinerea 18S ribosomal RNA gene, partial sequence; internal 
transcribed spacer 1, 5.8S ribosomal RNA gene and internal 
transcribed spacer 2, complete sequence; and 28S ribosomal RNA 
gene, partial sequence 


0.037 




AF335589 


Glycine max chitinase class I (Chial) and hypothetical protein genes, 
complete cds 


3.3 


nor 
511 J 


AE005230 


Escherichia coli 0157:H7 EDL933 genome, contig 1 of 3, section 54 
of 155 


5.2 


511ty 


AB010822 


Homo sapiens MASP gene for mannose binding protein-associated 
protease, exon 1 


7E-13 


100"P 


XM 052139 


Homo sapiens interleukin 13 receptor, alpha 1 (BL13RA1), mRNA 


0.007 


izzo 


AF293654 


Toxoplasma gondii TgMICIO precursor (MIC10) mRNA, complete 
cds 


0.15 


3229 


AJ388514 


Canis familiaris mRNA for partial Ribosomal protein S14 


0.38 


3230 


U80814 


Caenorhabditis elegans cosmid B0547, complete sequence 


0.22 


3231 


NM 013890 


Mus musculus f-box and WD-40 domain protein 2 (Fbxw2), mRNA 


0.29 


3232 


AB042411 


Homo sapiens strg gene for striatum-specific G potein-coupled 
receptor, complete cds 


0.000009 


3233 


AF151527 


Hyphantria cunea serpin gene, complete cds 


0.001 


3234 


X78305 


M musculus (SRP9) signal recognition particle subunit mRNA, 
1116bp 


0.047 


3235 


NM 018830 


Mus musculus N-acylsphingosine amidohydrolase 2 (Asah2), mRNA 


0.18 


3Z3o 


AF302060 


Apocryptophagus sp. MSPAR4 cytochrome oxidase I (COI) gene, 
partial cds; mitochondrial gene for mitochondrial product 


0.007 


JZ3 / 


AB024025 


Arabidopsis thaliana genomic DNA, chromosome 5, TAC 
clone:K16F13 


0.012 




U25810 


Bos taurus lysozyme (LZ) gene, complete cds 


0.004 


3239 


X95363 


C.annuum gamma thionin gene 


0.15 


3Z4U 


AF286897 


Plasmodium berghei carbamoyl phosphate synthetase II gene, partial 
cds 


0.15 


3241 


AF290920 


Procambarus clarkii microsatellite PclG-03 sequence 


3.7 


3242 


AE006380 


Lactococcus lactis subsp. lactis BL1403 section 142 of 218 of the 
complete genome 


2.2 


3243 


U66913 


Dictyostelium discoideum ORF DG1040 gene, partial cds 


0.22 


3244 


Y00460 


Mouse genes for transfer RNA-Ala, tRNA-He, tRNA-Pro and tRNA- 
Lys 


0.026 


3245 


L08052 


Fruitfly myosin alkali light chain (Mlcl) gene 
*supercontractile/tubular muscle and indirect flight muscle isofonns 1 ; 
complete cds's 


0.67 


3246 


AF076597 


Dictyostelium discoideum ubiquitin-conjugating enzyme protein UbcC 
(ubcC) mRNA, complete cds 


0.58 
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NO 
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DESCRIP 


P VALUE 


3247 


U67518 ] 


Methanococcus jannaschii section 60 of 150 of the complete genome 


0.003 


3248 


] 

AF362370 


Dictyostelium discoideum histidine kinase DhkH (dhkH) gene, partial 
cds 


6 


3249 


AF146688 


Fugu rubripes sex comb on midleg-like 2 protein (SCML2) gene, 
complete cds 


0.026 


3250 


AK009952 


Mus musculus adult male tongue cDNA, RIKEN full-length enriched 
library, clone:2310057C01, fall insert sequence 


0.07 


3251 


AB0O58O3 


Homo sapiens DNA for histidine-rich glycoprotein, complete cds 


0.078 | 


3252 


AY021484 


Oryza sativa microsatellite MRG3809 containing (TA)X27, closest to 
marker G227, genomic sequence 


2.2 


3253 


AD000833 


Homo sapiens DNA from chromosome 19-cosmid fl9399 (~17 kb 
EcoRI restriction fragment) 


0.024 


3254 


AB048982 


Macaca fascicularis brain cDNA, clone:QnpA-13067 


0.38 


3255 


AL023839 


Caenorhabditis elegans cosmid Y39A1C, complete sequence 


0.026 


3256 


NC 002052 


Tomato spotted wilt virus RNA-L, complete sequence 


0.22 


3257 


AK022137 


Homo sapiens cDNA FLJ12075 fis, clone HEMBB1002425 


0.64 


3258 


U44047 


Xenopus laevis MCM2 mRNA, complete cds 


0.24 


3259 


Z61385 


Rsapiens CpG island DNA genomic Msel fragment, clone 51d3, 
forward read cpg51d3.ftla 


5E-21 


3260 


AF163151 


Homo sapiens dentin sialophosphoprotein precursor (DSPP) gene, 
complete cds ' 


0.003 


3261 


L11194 


Montinia caiyophyllacea ribulose 1,5-bisphosphate carboxylase large 
subunit (ibcL) gene, partial cds; chloroplast gene for chloroplast 
product 


6.4 


3262 


BC002928 


Homo sapiens, clone MGC:11357 IMAGE:3954507, mRNA, 
complete cds 


3E-46 


3263 


XM 002837 


Homo sapiens acid phosphatase, prostate (ACPP), mRNA 


2.3 


3264 


AF189787 


Hordeum vulgare putative c-myb-like transcription factor (MYB3R-1) 
gene, partial cds 


0.61 


3266 


AB048353 


Paramecium caudatum mRNA for hsp60, partial cds 


1.8 


3267 


U75274 


Arabidopsis thaliana acyl-CoA binding protein (ACBP) gene, 
complete cds 


0.69 


3268 


AF034077 


Equus caballus alpha- 1 -antitrypsin (Spi2) gene, complete cds 


0.077 


3269 


AL031426 


Human DNA sequence from clone CTA-191D12 on chromosome 
22ql3. 1 Contains two exons of the APOL2 gene for apolipoprotein L 
2, ESTs and GSSs, complete sequence [Homo sapiens] 


0.000001 


3270 


AJ007958 


Thiobacillus sp. plasmid pT3.2I including adh and rep A genes, partial 


U.UU1 


3271 


NM 025978 


Mus musculus RIKEN cDNA 27000 16E08 gene (27000 16E08Rik), 
mRNA 


1E-17 


3272 


XM 018403 


Homo sapiens similar to gene with multiple splice variants near HD 
locus on 4pl6.3 (H. sapiens) (LOC91016), mRNA 


5.5 


3273 


AJ238613 


Gallus gallus mRNA for teneurin- 1 


0.57 


3274 


XM 033841 


Homo sapiens FGFR1 oncogene partner (FOP), mRNA 


0.55 
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3275 


NM 014601 


Homo sapiens EH-domain containing 2 (EHD2), mRNA 


0.065 


^11 fx 
J A* 1 \J 


AK021659 


Homo sapiens cDNA FLJ1 1597 fis, clone HEMBA1003856 


9E-13 




AB054644 


Sarcophaga peregrina gene for lectin, 5* upstream region and partial 
cds 


0.045 


/o 


AK0 10362 


Mus musculus ES cells cDNA, RIKEN full-length enriched library, 
clone:2410003J06, full insert sequence 


5.5 


/y 


AF282974 


Plasmodium falciparum cysteine protease falcipain-3 gene, complete 
cds 


1.8 


JZOW 


U60135 


Arabidopsis thaliana serine/threonine protein phosphatase 2 A-3 
catalytic subunit gene, complete cds 


0.024 




AF365086 


Martiodendron parviflorum tRNA-Leu (trnL) gene, partial intron 
sequence; chloroplast gene for chloroplast product 


6.5 


3282* 


AE002140 


Ureaplasma urealyticum section 41 of 59 of the complete genome 


0.22 




Z49124 


S.oleracea chloroplast secA mRNA 


0.074 


3284 


AE002353 


Chlamydia muridarum, section 80 of 85 of the complete genome 


4.4 




XM_0 11989 


Homo sapiens ELKL motif kinase (EMK1), mRNA 


7E-54 




AJ252287 


Plasmodium falciparum partial msp3 gene for merozoite surface 
protein 3 


0.68 


jZo / 


AK0 18224 


Mus musculus adult male medulla oblongata cDNA, RIKEN full- 
length enriched library, clone:6330531I01, full insert sequence 


2 




X91123 


Homo sapiens partial SLC6 A2 gene for norepinephrine transporter, 
exon5 


7.5 


3289 


AB045007 


Mus musculus GNB2 gene for guanine nucleotide binding protein 
beta2 subunit, complete cds 


0.67 


19QO 


AY017272 


Peromyscus difficilis isolate TK32541 mys-9 retrotransposon and 
LTR, 5* sequence 


5.4 


3291 


AL590902 


S.pombe chromosome I cosmid pB8E5 


0.077 




XM_003311 


Homo sapiens surfactant protein A binding protein (SPAR), mRNA 


0.38 


3293 


AK027388 


Homo sapiens cDNA FLJ14482 fis, clone MAMMA1002362 


0.007 


**9Qd. 


AB043885 


Homo sapiens BHLHB2 gene for bHLH transcriptional factor DEC1, 
complete cds 


0.068 




XM 028213 


Homo sapiens up-regulated by BCG-CWS (LOC64116), mRNA 


0.073 




XM 039789 


Homo sapiens solute carrier family 2 (facilitated glucose transporter), 
member 2 (SLC2A2), mRNA 


9.2 


3297 


AK021841 


Homo sapiens cDNA FLJ11779 fis, clone HEMBA1005921 


0.65 




M11815 


Plasmid pGKLl from killer yeast (K.lactis), complete 


0.008 


1900 


AF166527 


Zea mays flowering-time protein isoforms alpha and beta (ZmLD) 
gene, alternatively spliced products, complete cds 


0.074 


3300 


XM 040556 


Homo sapiens similar to hypothetical protein FLJ20127 (H. sapiens) 
(LOC91775), mRNA 


0.22 


3301 


AB001569 


Carrot DNA for transposon Tdcl 


0.17 


3302 


AF326737 


Mus musculus sclerostin gene, complete cds 


4.7 


3303 


AF272001 


Ebola virus subtype Zaire strain Mayinga complete genome 


1.9 
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NO 
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DESCRIP 


P VALUE 


3304 


XM 034863 


Homo sapiens similar to hypothetical protein FLJ22294 (H. sapiens) 
(LOC90918), mRNA 


0.2 


3305 


AC006654 


Caenorhabditis elegans cosmid H09I01, complete sequence 


0.023 


3306 


AK008812 


Mus musculus adult male stomach cDNA, RIKEN full-length 
enriched library, clone:2210403E17, full insert sequence 


6.1 


3307 


AF151387 


Lucilia eximia tRNA-He gene, partial sequence; D-loop, complete 
sequence; and 12S ribosomal RNA, partial sequence; mitochondrial 
genes for mitochondrial products 


0.077 


3308 


S53497 


immunoglobulin epsilon chain constant region=secreted form {3 1 
region} [human, B cell myeloma U-266, Genomic, 3198 nt] 


5.5 


3309 


XM 010126 


Homo sapiens stromal antigen 2 (STAG2), mRNA 


e-115 


3310 


D 10500 


Homo sapiens MAT gene for mitochondrial acetoacetyl-CoA thiolase, 
exon 1 


1.9 


3311 


AF293900 


ParaQabellula reniformis small subunit ribosomal RNA gene, 
complete sequence 


0.092 


3312 


NC 001530 


Human papillomavirus type 47, complete genome 


0.67 


3313 


Z85396 


H.sapiens Ig lambda light chain variable region gene (34-34SWDF32) 
rearranged; Ig-Light-Lambda; VLambda 


0.71 


3314 


NC_001566 


Apis mellifera ligustica mitochondrion, complete genome 


0.015 


3315 


AF103869 


Plasmodium yoelii blood-stage membrane protein Ag-1 mRNA, 
complete cds 


1.5 


3316 


XM 007957 


Homo sapiens hypothetical protein (LOC57019), mRNA 


3E-27 


3317 


XMJU2723 


Homo sapiens chromosome 18 open reading frame 1 (C18orfl), 
mRNA 


2.7 


3318 


XM 046956 


Homo sapiens Kallmann syndrome 1 sequence (KALI), mRNA 


4.5 


3319 


AE000614 


Helicobacter pylori 26695 section 92 of 134 of the complete genome 


0.068 


3320 


NM 017932 


Homo sapiens hypothetical protein FLJ20700 (FLJ20700), mRNA 


1.7 


3321 


AK017541 


Mus musculus 8 days embryo cDNA, RIKEN full-length enriched 
library, clone:5730410F24, full insert sequence 


4.8 


3322 


AK016519 


Mus musculus adult male testis cDNA, RIKEN full-length enriched 
library, clone:4932411G06, full insert sequence 


1.9 


3323 


U38184 


Trypanosoma cruzi kinetoplast minicircle ATPase subunit 6 mRNA, 
complete cds 


0.0003 


3324 


Z54181 


M.catarrhalis bla gene for BRO-2 


0.069 


3325 


U43961 


Oryza sativa 3-hydroxy-3-methylglutaiyl-CoA reductase gene, 
promoter region and complete cds 


0.17 




AK025166 


Homo sapiens cDNA: FLJ21513 fis, clone COL05778 


0.23 


3327 


AF158727 


Homo sapiens truncated properdin (PFC) gene, exons 1 and 2 


0.73 


3328 


AL1 16536 


Botrytis cinerea strain T4 cDNA library under conditions of nitrogen 
deprivation 


0.57 


3329 


NC 001712 


Locusta migratoria mitochondrion, complete genome 


0.061 
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P VALUE 


3330 


AY028779 


Staphylococcus epidermidis plasmid pST6 beta-lactamase repressor 
Blal (blal) gene, partial cds; resolvase/integrase BinR (binR), 
recombinase Sin (sin), repressor QacR (qacR), and transport protein 
QacB (qacB) genes, complete cds; and unknown gene 


0.23 


in 1 


AF346992 


Homo sapiens mitochondrion, complete genome 


1.7 


3332 


NMJ)31389 


Mus musculus nbonuclease/angiogemn inhibitor 2 (Rnh2), mRNA 


1.9 


1111 


AC079918 


Homo sapiens BAC clone RP11-114K13 from 7, complete sequence 


0.0002 


iiig. 
335j 


AF298207 


Dictyostelium discoideum transposon thug-S, complete sequence 


0.15 | 


5530 


L36903 


Saccharomyces cerevisiae mitochondrion 21S ribosomal RNA (21S 
rRNA) gene, 21S R2 ribosomal RNA (21S R2 rRNA) gene, and ORF 
RI 


0.024 


1111 


AF239663 


Prunus persica ACC synthase gene, partial cds 


0.066 


3338 


AJ223690 


H. sapiens D2-l-K2G7np gene for immunoglobulin kappa chain 
variable region 


0.026 


3339 


AC084157 


Caenorhabditis elegans cosmid Y46E12BR, complete sequence 


4.4 | 


3340 


M58606 


B.subtilis threonine deaminase (ILVA) gene, complete cds 


0.62 


3341 


NC 001648 


Cassava vein mosaic virus, complete genome 


0.008 


3342 


U09277 


Caenorhabditis elegans Bristol N2 synaptic vesicle acetylcholine 
transporter (unc-17) gene, complete cds 


0.024 


3343 


U52821 


Mus musculus prion (PrP) gene, exons 1 and 2 


1.9 [ 


"5 "5 A A 

3344 


AK014936 


Mus musculus adult male testis cDNA, RIKEN full-length enriched 
library, clone:4921521D15, full insert sequence 


2 


3345 


AE007447 


Streptococcus pneumoniae section 130 of 194 of the complete genome 


5 


3346 


AK005820 


Mus musculus adult male testis cDNA, RIKEN foil-length enriched 
library, clone: 1700010F03, full insert sequence 


0.18 


11A1 


ABO 11681 


Danio rerio mRNA for ZOR-2, complete cds 


1.9 


11 >io 
3348 


AB030616 


Mus musculus Chetk-alpha gene for choline/ethanolamine kinase- 
alpha, exon 1 and 5 f -flanking region 


2E-10 


3349 


AF269359 


Staphylococcus epidermidis strain SRI clone step.l002el2 genomic 
sequence 


0.53 


3350 


AF063004 


Trypanosoma rotatorium trans-spliced leader gene, partial sequence 


1.9 


3351 


AE001411 


Plasmodium falciparum chromosome 2, section 48 of 73 of the 
complete sequence 


0.008 


3352 


AF240496 


Mus musculus clone 148d4 Naip5 gene, partial sequence 


0.56 


3353 


AM UU4/ZZ 


riomo sapiens nypoineucai protein ri-»oo4zi \jrXA3o £ *zi) 9 iiuuna 




3354 


AF100324 


Mycoplasma fermentans DNA polymerase EI subunit (dnaH), YabC 
protein (yabC), orf550, lipoprotein MALP-404 precursor (malP), 
insertion sequence IS1630 transposase (tnpA) genes, complete cds; 
P78 ATP-binding cassette transport operon, complete sequence;> 


2.1 > 


3355 


NMJU9684 


Mus musculus serine/threonine kinase 23 (Stk23), mRNA 


0.23 
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NO 
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P VALUE 


3356 


AB004856 


Buchnera aphidicola mRNA for homoserine kinase, partial cds 


1.9 


3357 


AF328428 | 


Homo sapiens clone lptel_2109Bf&Br sequence 


2E-11 


3358 


XM 041166 


Homo sapiens hypothetical gene supported by AF308287; BC009516 
(LOC91860), mRNA 


0.024 


3359 


AE006429 


Lactococcus lactis subsp. lactis IL1403 section 191 of 218 of the 
complete genome 


0.24 


3360 


U95455 


HIV-1 clone 8 isolate BV from Milan, envelope glycoprotein, V1-V5 
region (env) gene, partial cds 


0.026 


3361 


XMJ)01775 


Homo sapiens PEF protein with a long N-tenninal hydrophobic 
domain (peflin) (PEF), mRNA 


0.45 


3362 


AF100670 


Caenorhabditis elegans cosmid M4 


0.072 


3363 


AB036737 


Homo sapiens mRNA for RERE, complete cds 


0.21 


3364* 


AE001030 


Archaeoglobus fulgidus section 77 of 172 of the complete genome 


1.5 


3365 


XM 046602 


Homo sapiens hypothetical gene supported by AK027658 
(LOC92679), mRNA 


2 


3366 


XM 005889 


Homo sapiens kinesin-like 1 (KNSL1), mRNA 


0.21 


3367 


AF275345 


Lycopersicon esculentum MADS-box transcription factor jointless 
gene, complete cds 


1.2 


3368 


NC 000857 


Ceratitis capitata complete mitochondrial genome 


0.022 


3369 


AC002048 


Human PCR Fragment of Cosmid gl346a274 from 7q3 1.3, complete 
sequence [Homo sapiens] 


1E-16 


3371 


AB016611 


Enterobacter cloacae ampC gene, complete cds 


0.62 


3372 


NM_026040 


Mus musculus RIKEN cDNA 2810036K01 gene (2810036K01Rik), 
mRNA 


e-138 


3373 


AJ252860 


Human immunodeficiency virus type 1, partial proviral POL gene, rt 
region, isolate CL 


0.25 


3374 


XM 043434 


Homo sapiens villin-like (VILL), mRNA 


1.8 


3375 


AE002776 


Drosophila melanogaster genomic scaffold 142000013385952, 
complete sequence 


0.19 


3376 


XM 030198 


Homo sapiens tumor necrosis factor receptor superfamily, member 8 
(TNFRSF8), mRNA 


0.76 


3377 


X65450 


C.canis mRNA for chloride channel 


6.3 


3378 


NMJ)32248 


Homo sapiens hypothetical protein DKFZp434F1719 
(DKFZp434F1719), mRNA 


1E-47 


3379 


L47121 


Carnobacterium piscicola transposase, bacteriocin, histidine protein 
kinase, ATP dependent transloactor, accessory protein, and 
carnobacteriocin B2 genes, complete cds 


0.18 


3380 


L26287 


Schistosoma mansoni SMDR2 gene, complete cds 


0.66 


3381 


AF125130 


Nemadactylus macropterus clone 15.1 microsatellite sequence 


0.072 


3383 


AB040099 


Oryzias latipes olgc4 gene for membrane guanylyl cyclase 01GC4, 5 1 
flanking sequence and partial cds 


0.22 


3384 


AF375398 


Arabidopsis thaliana Atlg51660/F19C24_26 gene, complete cds 


6 


3385 


AF391256 


HIV-1 clone TV004G24 from South Africa gag protein (gag) gene, 
complete cds 


0.23 


3386 


X91865 


A. thaliana asp5 gene 


2.1 
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3387 


XM 047084 


Homo sapiens human immunodeficiency virus type I enhancer- 
binding protein 2 (HTVEP2), mRNA 


0.21 


3388 


AF1 15469 


Arabidopsis thahana cultivar Landsberg erecta Arac2 (Arac2) gene, 
complete cds 


2.3 


3389 


Z48051 


Ksapiens gene for myelin oligodendrocyte glycoprotein (MOG) 


0.008 


3390 


U49259 


Arabidopsis thaliana isopentenyl diphosphaterdimethylallyl 
diphosphate isomerase (IPP2) mRNA, complete cds 


2.2 


3391 


AF032124 


Homo sapiens RET proto-oncogene (RET) gene, 5' flanking region 
and partial cds 


0.079 


3392 


AE001420 


Plasmodium falciparum chromosome 2, section 57 of 73 of the 
complete sequence 


0.067 


3393 


XM 005555 


Homo sapiens UDP-glucose ceramide glucosyltransferase (UGCG), 
mRNA 


0.076 


3394 


Z81014 


Human DNA sequence from cosmid U65A4, between markers 
DXS366 and DXS87 on chromosome X * 


2.1 


3395 


XM 002056 


Homo sapiens hypothetical protein FLJ11006 (FLJ11006), mRNA 


0.26 


3396 


AF339787 


Homo sapiens clone IMAGE:205688, mRNA sequence 


2 


3397 


AP000145 


Homo sapiens genomic DNA, chromosome 21q2L2, LL56-APP 
region, clone B2291C14-R44F3, segment 10/10, complete sequence 


0.026 


3398 


AF261936 


Homo sapiens clone Ilptel_c3t7 sequence 


0.00002 


3399 


AF193615 


Homo sapiens G-protein-coupled inward rectifying K+ channel 
KCNJ9 (KCNJ9) gene, complete cds 


0.009 


3400 


AK009952 


Mus musculus adult male tongue cDNA, RIKEN full-length enriched 
library, clone:2310057C01, full insert sequence 


0.079 


3401 


AF091779 


Oncorhynchus mykiss non-classical MHC class I antigen (Onmy- 
UAA) gene, Onmy-UAA*0101 allele, complete cds 


2.3 


3402 


AB001735 


Mus musculus DNA for ADAMTS-1, complete cds 


2.3 


3403 


AK000589 


Homo sapiens cDNA FLJ20582 fis, clone KAT12156 


1E-13 


3404 


U55243 


Dictyostelium discoideum glyceraldehyde-3 -phosphate dehydrogenase 
(GAPDH) gene, partial cds 


0.24 


3405 


AF1 10612 


Rrvonhiln^ Tnirmnln^ cytochrome b anoenzvme fCvfb^ £ene Dartial 
cds; tRNA-Ser, tRNA-Leu, tRNA-Cys, and tRNA-Met genes, 
complete sequence; and NADH dehydrogenase subunit 2 (ND2) gene, 
partial cds, mitochondrial genes for mitochondrial products 


0.077 


3406 


AF060542 


Haemophilus influenzae biogroup aegyptius HaelV 
restriction/modification system (HaelVRM) gene, complete cds 


0.75 


3407 


AF303391 


Homo sapiens meiotic recombination 11 (MRE11A) gene, exon 16 


1.7 


3408 


AF269359 


Staphylococcus epidermidis strain SRI clone step.l002e!2 genomic 
sequence 


0.22 


3409 


AF327975 


Hentzia palmarum 16S ribosomal RNA, partial sequence; 
mitochondrial gene for mitochondrial product 


0.57 
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NO 
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3410 


XM 047898 


Homo sapiens variable charge protein on X with eight repeats (VCX- 
8r), mRNA 


5.9 


3411 


XM 047003 


Homo sapiens TATA box binding protein (TBP)-associated factor, 
RNA polymerase II, B, 150kD (TAF2B), mRNA 


0.7 


3412 


D86566 


Human DNA for NOTCH4, partial cds 


0.025 


3413 


AF360301 


Arabidopsis thaliana unknown protein (T8B10_70/AT3g60410) 
mRNA, complete cds 


0.64 


3414 


AJ404228 


Saccharomyces douglasii mitochondrial tRNA gene cluster \ 


0.002 


3415 


XM 037369 


Homo sapiens hypothetical protein FU20693 (FLJ20693), mRNA 


9E-14 


3416 


AE005230 


Escherichia coli 0157:H7 EDL933 genome, contig 1 of 3, section 54 
of 155 ! 


6.2 


- 

3417 


AL121778 


Human DNA sequence from clone RP5-839B1 1 on chromosome 20. 
Contains the first coding exon of the gene for a novel protein with a 
Kunitz/Bovine pancreatic trypsin inhibitor domain and WAP-type 
(Whey Acidic Protein) Tour-disulfide core' domains and an e> 


0.08 


3418 


AY020760 


Oryza sativa microsatellite MRG3085 containing (TA)X13, closest to 
marker R2976, genomic sequence 


6.8 


3419 


AL122122 


Homo sapiens mRNA; cDNA DKFZp434L098 (from clone 
DKFZp434L098) 


1.8 


3420 


AF250346 


Xenopus la&vis early growth response protein 1 (egrl) gene, partial 
cds 


0.077 


3421 


XM_044852 


Homo sapiens KIAA1572 protein (KIAA1572), mRNA 


0.7 


3422 


XM_049057 


Homo sapiens G protein-coupled receptor 49 (GPR49), mRNA 


7 


3423 


AJ223690 


HLsapiens D2-l-K2G7np gene for immunoglobulin kappa chain 
variable region 


0.028 


3424 


AJ314752 


Anas piatyrhynchos mlgM gene (partial), mlgA gene (partial) and 
IgA gene (partial), clone 13.1 


0.072 


3425 


U51739 


Ipomoea trifida secreted glycoprotein 2 (ISG2) mRNA, complete cds 


0.64 


3426 


S66752 


coagulation factor JX{3* region} [human, Genomic Mutant, 3 130 nt] 


0.026 


3427 


AE001122 


Borrelia burgdorferi (section 8 of 70) of the complete genome 


0.72 | 


3428 


AJ233850 


Rattus norvegicus microsatellite sequence clone 27E9 


2.1 


3429 


L35664 


Homo sapiens (subclone H8 8 f5 from PI 35 H5 C8) DNA sequence 


0.079 


3430 


U97029 


Callithrix geoffroyi epsilon-globin gene, partial cds 


2.1 


3431 


Z82181 


Human DNA sequence from clone LL22NC01-86D10 on chromosome 
22 Contains part of the SYN3 gene for synapsin m and ESTs, 
complete sequence [Homo sapiens] 


0.71 


3432 


AB066544 


Macaca fascicularis brain cDNA clone:QtrA-10780, full insert 
sequence 


0.62 


3433 


L77039 


Homo sapiens (subclone 2_e8 from PI H22) DNA sequence 


0.0000002 


3434 


AL590650 


Human DNA sequence from clone RP11-334F20 on chromosome 6, 
complete sequence [Homo sapiens] 


1.9 
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3435 


U93519 


Kladothrips rugosus cytochrome oxidase subunit I (COI) gene, 
mitochondrial gene encoding mitochondrial protein, partial cds 


0.77 


3437 


U23451 


Caenorhabditis elegans cosmid ZK75 


0.48 


3438 


BC003415 


Homo sapiens, clone IMAGE:3451161, mRNA 


0.41 


3439 


AK024526 


Homo sapiens cDNA: FLJ20873 fis, clone ADKA02669 


3.3 


3440 


AE002548 


Neisseria meningitidis serogroup B strain MC58 section 190 of 206 of 
the complete genome 


0.6 


3441 


AF1 16832 


Anopheles minimus strain PT1 cytochrome oxidase subunit I (COI) 
gene, partial cds; mitochondrial gene for mitochondrial product 


1.5 


3442 


AB055323 


Macaca fascicularis brain cDNA, clone:QflA- 10308 


1.6 


3443 


AK014667 


Mus musculus 0 day neonate head cDNA, RIKEN full-length enriched 
library, clone:4833408P15, full insert sequence 


7E-25 


3444 


U80753 


Homo sapiens CAGL79 mRNA, partial cds 


0.048 


3445 


AJ275304 | 


Cicer arietinum mRNA for putative AB A-responsive protein 


i.4 


3446 


AF336131 


Lumpy skin disease virus strain Neethling, 3' partial sequence 


0.3 


3447 


U11476 


HIV-1 isolate LBLA010 from Liberty City, Florida, envelope 
glycoprotein (env) gene, partial cds 


1.6 


3448 


XM 045582 


Homo sapiens hypothetical gene supported by AL137476 
(LOC92530), mRNA 


6E-13 


3449 


AF030199 


Mus musculus type 1 sigma receptor gene, complete cds 


0.63 


3450 


AF345926 


Myzopoda aurita 12S ribosomal RNA gene, complete sequence; tRNA 
Val gene, complete sequence; and 16S ribosomal RNA gene, complete 
sequence; mitochondrial genes for mitochondrial products 


0.13 


3451 


NM 008715 


Mus musculus Notch24ike (Notch21), mRNA 


0.17 


3452 


AF067794 


Nicotiana tabacum meristem-specific promoter sequence 


0.1 


3453 


AJ287175 


Mucor indicus partial act-1 gene for actin 


2 


3454 


U89439 


Bos taurus ubiquitin-like protein mRNA, complete cds 


0.000003 


3455 


AF250346 


Xenopus laevis early growth response protein 1 (egrl) gene, partial 
cds 


0.072 


3456 


Z73587 


S.cerevisiae chromosome XVI reading frame ORF YPL23 lw 


0.19 


3457 


AB055310 


Macaca fascicularis brain cDNA, clone:QflA- 14233 


0.21 


3458 


Y16090 


Daucus carota Susy*Dcl gene 


0.13 


3459 


AE003868 


Xylella fastidiosa 9a5c, section 14 of 229 of the complete genome 


0.16 


3460 


AF281912 


Homo sapiens clone 16qtel_c89bt7 sequence 


5E-36 


3461 


AF126541 


Ateline herpesvirus 3 H-DNA terminal repeat unit, complete sequence 


0.023 


3462 


AK011178 


Mus musculus 10 days embryo cDNA, RIKEN full-length enriched 
library, clone:2600010E01, full insert sequence 


1.5 


3463 


AF370226 


Arabidopsis thaliana unknown protein (F1B16.11) mRNA, complete 
cds 


0.4 


3464 


1 X94768 


H sapiens RP3 gene (XLRP gene 3) 


0.21 


3465 


AB029068 


Mus musculus gene for Ser/Thr kinase KKIAMRE, exon 1, 2, 3, 4, 5 


0.57 
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3466 


X84742 


M.musculus GPX3 gene 


0.038 


j*fO / 


BC010011 


Homo sapiens, Similar to RIKEN cDNA 4931428D14 gene, clone 
MGC: 15407 IMAGE.4309613, mRNA, complete cds 


0.22 


J4oo 


AF288536 


Legionella longbeachae spectinomycin 3* adenylyltransferase (aadA), 
possible transcriptional regulatory protein (lrpR), and possible sensor 
kinase protein (lskS) genes, complete cds; and unknown genes 


1 




AF288536 


Legionella longbeachae spectinomycin 3* adenylyltransferase (aadA), 
possible transcriptional regulatory protein (lrpR), and possible sensor 
kinase protein (lskS) genes, complete cds; and unknown genes 


1 o 

l.o 




AB01315O 


Papiho machaon mitochondnal ND5 gene for NADH dehydrogenase 
subunit 5, partial cds 


5.9 


~> t r / 1 


XM 028011 


Homo sapiens CGI- 145 protein (LOC51028), mRNA 


0.63 


3472 


XM 052069 


Homo sapiens actin related protein (MGC15664), mRNA 


5.7 


3473 


AL449163 


Human DNA sequence from clone RP11-533E16 on chromosome 6, 
complete sequence [Homo sapiens] 


0.00001 


1A7A 


X96685 


B.burgdorferi cell division genes 


1.9 


j*f /-) 


AF179589 


Rana tigrina rugulosa glucagon receptor (GluR) mRNA, complete cds 


0.19 


3476 


AJ297549 


Homo sapiens partial PIK3CB gene for phosphatidylinositol 3-kinase 
catalytic subunit pi lObeta, exons 1-2 and joined CDS 


0.87 


1/177 


XM_032707 


Homo sapiens KIAAO630 protein (KIAA0630), mRNA 


0.2 


3478 


U25032 


Caenorhabditis elegans par-3 mRNA, complete cds 


1.3 


34 /y 


U89283 


Biomphalaria glabrata myoglobin gene, complete cds 


3.2 


34oU 


AY029303 


Strongylocentrotus purpuratus ADAM precursor, mRNA, complete 
cds 


5.6 


34© I 


AB043223 


Mus musculus V186.2 gene for immunoglobulin heavy chain, 
clone:LlG6001 


7.5 


3482 


AF304130 


Danio rerio transmembrane receptor Roundabout 1 (robol) mRNA, 
complete cds 


1.8 


3483 


AF386077 


Homo sapiens interleukin 17B (BL17B) gene, complete cds 


0.65 


3484 


U39889 


Bos taurus Y-chromosome specific genomic sequence 


0.65 


3485 


XM 032347 


Homo sapiens region containing TLS-associated serine-arginine 
protein 1; TLS-associated serine-arginine protein 2; TLS-associated 
serine-arginine protein 2; TLS-associated serine-arginine protein 2; 
TLS-associated serine-arginine protein 1; TLS-associa> 


0 


J4oO 


AF120475 


Mus musculus myelin-associated oligodendrocyte basic proteins 
MsjDr l /u ana JYiijJtSr^oy ^jviODp^ gene, exons o inrougn d ana 
complete cds, alternatively spliced 


0.0008 


3487 


U81676 


Unidentified eubacterium clone vadinBC27 16S ribosomal RNA gene, 
partial sequence 


0.31 


3488 


AL596023 


Human DNA sequence from clone RP1 1-142 J18 on chromosome X, 
complete sequence [Homo sapiens] 


0.016 


3489 


U67210 


Human clone HS2.16 Alu-Ya5 sequence 


0.056 
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3490 


XM 007196 


lomo sapiens similar to ret finger protein 2; candidate tumor 
suppressor involved in B-CLL (H. sapiens) (LOC93519), mRNA 


0.0000008 


3491 


AL049568 


Human DNA sequence from clone SC22CB-23F1 on chromosome 
22ql3.3 1-13.33 Contains a GSS, complete sequence [Homo sapiens] 


0.074 


3492 


X94755 


A.thaliana mRNA for SNFl-related ser/thr protein kinase (1852 bp) 


0.15 


3493 


AL392109 


Human DNA sequence from clone RP13-459H7 on chromosome 10, 
complete sequence [Homo sapiens] 


1.8 


3494 


S53497 


immunoglobulin epsilon chain constant region=secreted form {3' 
region} [human, B cell myeloma U-266, Genomic, 3198 nt] 


3.2 


3495 


X16027 


ILnorvegicus gene encoding alkaline phosphatase, exon 2 


0.0009 


3496 


AF213918 


Buchnera aphidicola plasmid pTrpEG isolate Mor26ps anthranilate j 
synthase component I (trpE) pseudogene, partial sequence 


0.004 


3497 


AE006491 


Streptococcus pyogenes Ml GAS strain SF370, section 20 of 167 of 
the complete genome 


0.33 


3498 


Z73587 


S.cerevisiae chromosome XVI reading frame ORF YFL23 lw 


0.089 


3499 


AK023040 


Homo sapiens cDNA FLJ12978 fis, clone NT2RP2006321 


2E-62 


3500 


AF062039 


Homo sapiens integrin alpha-2 subunit (TTGA2) gene, ITGA2-1 
allele, partial cds 


0.65 


3501 


Z77974 


ELsapiens flow-sorted chromosome 6 HindTTT fragment, SC6pA5D12 


0.036 


3502 


M13934 


Human ribosomal protein S14 gene, complete cds 


0.54 


3503 


NM 023220 


Mus musculus RIKEN cDNA 2010106G01 gene (2010106G01Rik), 
mRNA 


1.1 


3504 


AF 1405 13 


Homo sapiens clone 254LL2 genomic sequence 


0.0004 


3505 


AL359558 


Homo sapiens mRNA; cDNA DKFZp76201615 (from clone 
DKFZp76201615) 


0.019 


3506 


XM 048768 


Homo sapiens tetratricopeptide repeat domain 3 (TTC3), mRNA 


0.00009 


3507 


AF074994 


Homo sapiens full length insert cDNA YH91C06 


0.65 


3508 


NC 001338 


Sulfolobus virus 1, complete genome 


0.008 


3509 


U67502 


Methanococcus jannaschii section 44 of 150 of the complete genome 


0.21 


3510 


AY035821 


Schizosaccharomyces pombe Byrl-binding protein Bobl (bobl) gene, 
complete cds 


0.19 


3511 


AF275842 


Plasmodium falciparum clone b.2 PfEMPl (var) mRNA, partial cds 


2 


3512 


AJ294714 


Pichia anomala leu2 gene for beta-isopropylmalate dehydrogenase 


0.19 


3513 


AL391240 


Human DNA sequence from clone RP3-397P18 on chromosome 6, 
complete sequence [Homo sapiens] 


3.4 


3514 


AE002304 


Chlamydia muridarum, section 35 of 85 of the complete genome 


1.8 


3515 


AF083830 


Homo sapiens dihydropyridine receptor alpha 2 subunit 
(CACNA2D1) gene, exon 14 


0.2 
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NO 


ACCESSN 


DESCRIP 


P VALUE 


3516 


AE005222 


Escherichia coli 0157:H7 EDL933 genome, contig 1 of 3, section 46 
of 155 


1.7 


3517 


U04208 


Escherichia coli 9142-88 cytolethal distending toxin (cdtA, cdtB, and 
cdtC) genes, complete cds 


0.008 


3518 


Y07738 


Mmusculus gene for vimentin 


0.65 


3519 


AF304091 


Sotalia fluviatilis alpla lactalbumin gene, partial cds 


2 


3520 


AF270356 


Staphylococcus epidermidis strain SRI clone step.4025c09 genomic 
sequence 


0.52 


3521 


AF017516 


Bombus pascuorum cytochrome b (cytb) gene, mitochondrial gene 
encoding mitochondrial protein, partial cds 


1.8 


3523 


AF098919 


Gallus gallus alpha-globin gene domain 5 ! region 


2 


3524 


XM_047517 


Homo sapiens Kruppel-like factor 4 (gut) (KLF4), mRNA 


1E-39 


3525 


NC 001626 


Petromyzon marinus mitochondrion, complete genome 


0.021 


3526 


AF246647 


Bemisia tabaci biotype Q 16S ribosomal RNA gene, partial sequence; 
mitochondrial gene for mitochondrial product 


1.9 


3527 


AL391240 


Human DNA sequence from clone KP3-397P18 on chromosome 6, 
complete sequence [Homo sapiens] 


3.1 


3528 


M32779 


AcNPV with an S.frugiperda insertion element IFP2.2 


1.9 


3529 


L44121 


Homo sapiens (clone pHK2. ID) CMT1 A gene, repeat unit 


6E-44 


3530 


U52350 


Arabidopsis thaliana GTP-binding protein (ARAC5) mRNA, 
complete cds 


0.007 


3531 


X80908 


L.esculentum gene for fruit ripening polygalacturonase 


0.65 


3532 


AF267203 


Candidates Carsonella ruddii natural-host Blastopsylla occidentals 
ATP synthase alpha subunit (atpA) gene, partial cds; ATP synthase 
gamma subunit (atpG) gene, complete cds; and ATP synthase beta 
subunit (atpD) gene, partial cds 


0.22 


3533 


AE001379 


Plasmodium falciparum chromosome 2, section 16 of 73 of the 
complete sequence 


0.25 


3534 


XM 034129 


Homo sapiens hypothetical protein DKFZp761J1523 
(DKFZp761J1523), mRNA 


6 


3535 


AE002304 


Chlamydia muridarum, section 35 of 85 of the complete genome 


1.8 


3536 


AK024082 


Homo sapiens cDNA FLJ14020 fis, clone HEMBA1002508 


1.7 


3537 


U67547 


Methanococcus jannaschii section 89 of 150 of the complete genome 


0.22 


3538 


AF215629 


Crypteronia paniculata chloroplast rpll6 gene, intron sequence 


0.024 


3539 


X03853 


Pea chloroplast gene for ribulose 1,5-bisphosphate carboxylase (rbcl) 


0.008 


3540 


[ X85055 


B.taurus cosmid-derived microsatellite DNA (clone IDVGA-40) 


0.0009 


3541 


AK005508 


Mus musculus adult female placenta cDNA, RIKEN full-length 
enriched library, clone: 160002 1C16, full insert sequence 


0.63 


3542 


Y16262 


Daucus carota mRNA for neutral invertase 


0.21 


3543 


AL513491 


Human DNA sequence from clone RP1 1-79 A21 on chromosome X, 
complete sequence [Homo sapiens] 


0.56 


3544 


XMJ)36298 


Homo sapiens hypothetical gene supported by AL442095 
(LOC91 124), mRNA 


0.02 


3545 


XM 049045 


Homo sapiens KIAA0118 protein (KIAA0118), mRNA 


0.46 
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SEQID 
NO 


ACCESSN 


DESCR1P 


P VALUE 


3546 


AY020760 


Oryza sativa microsatellite MRG3085 containing (TA)X13, closest to 
marker R2976, genomic sequence 


5.4 


3547 


AF198037 


Mycoplasma pulmonis LipB (lipB) gene, complete cds; and VsaG 
(vsaG) and VsaE (vsaE) genes, partial cds 


0.008 


3548 


X62658 


E.faecalis plasmid pADl seal gene and orfy 


1.7 


3549 


AF083468 


Emericella nidulans putative zinc finger protein (flbC) gene, complete 
cds 


0.18 


3550 


AF315801 


Homo sapiens chromosome Y AHCP pseudogene, complete sequence 


1.9 


3551 


AC007042 


Homo sapiens clone RP11-399H17, complete sequence 


6 


3552 


AF198037 


Mycoplasma pulmonis LipB (lipB) gene, complete cds; and VsaG 
(vsaG) and VsaE (vsaE) genes, partial cds 


0.008 


3553 


AK025090 


Homo sapiens cDNA: FLJ21437 fis, clone COL04285 


0 


3554 


X76027 


KJactis APA2 gene for tetraphosphatase, QCR7 gene, bcl complex 
subunit VII 


0.4 


3555 


Z78713 


H.sapiens flow-sorted chromosome 6 HindTTI fragment, SC6pA15All 


0.61 


3556 


U50357 


Streptococcus zooepidemicus zoocin A immunity factor (zif) and 
zoocin A endopeptidase (zooA) genes, complete cds 


0.13 


3557 


U93721 


Homo sapiens green cone photoreceptor pigment gene, 5' flanking 
region 


1.4 


3558 


AJ291489 


Paracentrotus lividus mRNA for fibrosurfin (surfin2656 gene) 


3.7 


3559 


M13820 


Plasmid ColIb-p9 colicin lb structural and immunity genes, complete 
cds 


0.2 


3560 


Z54147 


Human DNA sequence from cosmid L129H7, Huntington's Disease 
Region, chromosome 4pl6.3 contains CpG island 


2 


3561 


AJ301616 


Homo sapiens DNC gene for deoxynucleotide carrier, exons 1-9 


0.17 


3562 


U29403 


Human immunodeficiency virus type 1 defective gag gene, clone 3-13, 
truncated cds 


0.63 


3563 


AF090118 


Plasmodium falciparum heat shock protein hspl04 (hspl04) gene, 
partial cds 


0.21 


3564 


Z22707 


S.scrofa spl IFN gene 


0.57 


3565 


| AF090889 


Homo sapiens clone HQ0092 


0.62 


3566 


XM_050705 


Homo sapiens GalNAc-4-sulfotransferase 2 (GALNAC4ST-2), mRNA 


0.15 


3567 


XM_016611 


Homo sapiens similar to 8-oxoguanine DNA glycosylase (H. sapiens) 
(LOC93577), mRNA 


0.052 


3568 


Z35914 


S.cerevisiae chromosome II reading frame ORF YBR045c 


0.59 


3569 


S69278 


CYP1 A2=3-methylcholanthrene responsive gene [Oncorhynchus 
mykiss=rainbow trout, liver, Genomic, 5023 nt] 


1.5 


| 3570 


X56832 


H.sapiens EN03 gene for muscle specific enolase 


0.21 


3571 


XM 011264 


Homo sapiens KIAA0372 gene product (KIAA0372), mRNA 


0 


3572 


NM_022628 


Rattus norvegicus nephrin (Nphsl), mRNA 


0.017 


3573 


AJ299718 


Homo sapiens partial MASP2 gene, intron 7 


6E-33 


3574 


Z69651 


Human DNA sequence from cosmid L75B9, Huntington's Disease 
Region, chromosome 4pl6.3 


0.022 
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NO 


ACCESSN 


DESCRIP 


P VALUE 


3575 


AF327739 


Streptococcus thermophilic Pebl (pebl), GlnQ (glnQ), response 
regulator Rrl (rrl), Hpk2 (hpk2), YycJ-like protein (yycJ), and MurM 
(murM) genes, complete cds 


0.59 


3576 


AC001656 


Drosophila melanogaster (PI DS00397 (D18)) DNA sequence, 
complete sequence 


0.068 


3577 


AF262620 


Periplaneta americana 16S ribosomal RNA gene, partial sequence; 
mitochondrial gene for mitochondrial product 


0.2 


3578 


L08425 


Zea mays auxin-binding protein (abpl) gene, exons 1-5 and complete 
cds 


0.64 


3579 


M97702 


Drosophila melanogaster glutathione S-transferase gene 


1.8 


3580 


AF268059 


Candidates Carsonella ruddii natural-host Arytaina geniotae RNA 
polymerase beta subimit (rpoB) and RNA polymerase beta-prime 
subunit (rpoC) genes, partial cds 


0.063 


3581 


U30149 


Entamoeba histolytica pyruvate:ferredoxin oxidoreductase (pfol) 
gene, complete cds 


1.9 


3582 


X88893 


C.jacchus intron 4 of visual pigment gene (green allele) 


0.00008 


3584 


AK000939 


Homo sapiens cDNA FU10077 fis, clone HEMBA1001864 


0.19 


3585 


AL1 14356 


Botrytis cinerea strain T4 cDNA library under conditions of nitrogen 
deprivation 


5.6 


3586 


AF126992 


Oxytropis besseyi var. ventosa chloroplast tRNA-Leu (trnL) gene, 
intron sequence 


0.21 


3587 


Z47046 


Human cosmid QLL2C9 from Xq28 


5.7 


3588 


AE006246 


Lactococcus lactis subsp. lactis IL1403 section 8 of 218 of the 
complete genome 


5.3 


3589 


AY018921 


Oryza sativa microsatellite MRG1246 containing (AT)X23, genomic 
sequence 


0.71 


3590 


AL589705 


Human DNA sequence from clone RP1 1-64M7 on chromosome 6, 
complete sequence [Homo sapiens] 


0.069 


3591 


NM_010250 


Mus musculus gamma-aminobutyric acid (GABA-A) receptor, subunit 
alpha 1 (Gabral), mRNA 


3.8 


3592 


AC024748 


Caenorhabditis elegans cosmid Y110A2AR, complete sequence 


0.017 


3593 


AJ002240 


Mus musculus minisatellite tandem repeat (MMS9) 


1.7 


3594 


AJ293574 


Brassica napus gene for putative corticosteroid binding protein and 
partial gene for hypothetical protein, cultivar N-o-9 


0.21 


3596 


AF324889 


Homo sapiens myosin phosphatase target subunit 2 (MYPT2) gene, 
exon 1 


0.22 


3597 


S62623 


IGF2=insulin-like growth foctor 2 {clone PPA1, exon 9} [human, 
Genomic, 811 nt] 


5.4 


3598 


Z22707 


S.scrofo spl IFN gene 


0.57 


3600 


AB030490 


Glycine max SG-05 gene for thiamin biosynthetic enzyme, complete 
cds 


0.023 


3602 


S69278 


CYP 1 A2=3 -methylcholanthrene responsive gene [Oncorhynchus 
mykiss=rainbow trout, Uver, Genomic, 5023 nt] 


1.7 


3603 


XM 029148 


Homo sapiens hypothetical protein BC004923 (LOC85865), mRNA 


0.002 
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CCA TT\ 

Sty ID 
NO 
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P VALUE 


3604 


AJ002979 1 


L. ooncinnum chloroplast DNA for intergenic spacer between tRNA- 
Thr (UGU) and tRNA-Leu (UAA) 5'exon, coUected from W. 
Pakawau, Cape Farewell, South Island, New Zealand 


0.067 


3605 


U36927 


Plasmodium yoelii rhoptry protein gene, complete cds 


0.2 


3606 


AK021502 


Homo sapiens cDNA FLJ11440 fis, clone HEMBA1001319 


0.00001 


3607 


AB033604 


Capra hircus Hsp70-3 gene for 70 kDa heat shock protein, complete 
cds 


0.18 


3608 


M81651 


Human semenogelin n (SEMGII) gene, complete cds 


1.8 


3609 


AB054512 


Berardius bairdii DNA, SINE flanking sequence Tuti35 locus 


0.49 


3610 


AF057740 


Homo sapiens 17-beta-hydroxysteroid dehydrogenase IV (HSD17B4) 
gene, exon 24 and complete cds 


0.018 


3611 


AJ295158 


Homo sapiens gene for thymosin beta-4 


1E-48 


3612 


X79402 


B.taurus mRNA for plasminogen 


0.21 


3613- 


NC 001807 


Human mitochondrion, complete genome 


3E-20 


3614 


X92204 


P.hybrida NAM gene 


0.049 


3616 


AB032016 


Homo sapiens PDNP1 gene for phosphodiesterase I/nucleotide 
pyrophosphatase 1, 5' flanking region and partial cds 


0.009 


3617 


AF339159 


Macropus rufogriseus mannose-6-phosphate/insulin-like growth factor 
n receptor (m6p/igf2r) mRNA, partial cds 


0.47 


3618 


AB025285 


Homo sapiens c-ERBB-2 gene, exons 1', 2\ 3', 4 f 


0.0000001 


3619 


AJ002550 


Homo sapiens MMP-1 gene, promoter region 


3.8 


3620 


AB004856 


Buchnera aphidicola mRNA for homoserine kinase, partial cds 


0.79 


3621 


AB010426 


Phytoplasma sp. gene for AL1 like protein, complete cds 


1.8 


3622 


L36903 


Saccharomyces cerevisiae mitochondrion 21S ribosomal RNA (21S 
rRNA) gene, 21S R2 ribosomal RNA (2 IS R2 rRNA) gene, and ORE 
RI 


0.022 


3623 


AF267642 


Filobasidiella neoformans var. neoformans Myo2p-like protein gene 


0.49 


3624 


XM 044499 


Homo sapiens KIAA0484 protein (KIAA0484), mRNA 


0.0001 


3625 


L04550 


Homo sapiens DNA fragment 


2.6 


3626 


AF210054 


Homo sapiens bone morphogenetic protein 7 (BMP7) gene, promoter 
region and partial cds 


2.5 


3627 


AL591505 


Human DNA sequence from clone RP1 1-67112 on chromosome X, 
complete sequence [Homo sapiens] 


0.001 


3628 


AL049969 


Homo sapiens mRNA; cDNA DKFZp564A072 (from clone 
DKFZp564A072) 


0.35 


3629 


AB055284 


Macaca fascicularis brain cDNA, clone :QflA-l 1654 


0.15 


3630 


AF304448 


Homo sapiens vitamin D receptor-interacting protein complex 
component DRIP150 (DRIP150) mRNA, complete cds 


1.3 


3631 


AL133157 


S.pombe chromosome I cosmid clB2 


0.6 


3632 


AC002475 


Homo sapiens Chromosome 22qll.2 PCR Product dgcr-gapl In 
DGCR Region, complete sequence 


0.005 ' 


3633 


AK003943 


Mus musculus 18 days embryo cDNA, RIKEN full-length enriched 
library, clone: 1110028F 11, full insert sequence 


4E-18 


3634 


S78712 


hemolin=48 kda C2-type immunoglobulin-like protein (Hyalophora 
cecropia=giant silkmoths, Genomic, 6622 nt] 


3.5 
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Table 3A Nearest Neighbor (BlastN vs. Genbank) 


SEQID 
NO 


ACCESSN 


DESCRIP 


P VALUE , 


3635 




Hnmn Qaniem cDNA FLT13835 fis clone THYRO1000712 


2E-98 


3636 


X55978 


T. brucei ESAG 8 gene for a leudne-rich repeat family protein 


0.14 


3637 


XM 031534 


tiomo sdpieiib bin iii di kj uieriiie protein ^ivi. iiiuouuiuaj \i-f\j\^>z?\J t * iu j 9 
mRNA 


5E-47 


3638 


AF044287 


LVIUoUpillicl iiiwicUivgctolCi ixciia auaL/ixii auuuiui ui nr — ' vgaxxiwL/ gw-u.^, 

complete cds 


0.004 


3639 


AB060917 


LVlaCaCa la&ClCUlalxb Uialll v»J-/rN.rV uiUUC.\^lx.rv , ±*TVJO*t, xuxx xxxowXL 

sequence ! 


1.1 


3640 


AK008522 


\XTiic mncnilnc aHiilt malp email intfctinp rDNA JJTTTRN filll-lPflPlll 
lVIUo xllUoUlllllo aUUll lUdlC jixiaxi ilHCaLiUG ui/ixn, xvxxvjl-»aw xuxx xirxigixi 

enriched library, clone:2010305C02, full insert sequence 


0.41 


3641 


NM 031340 


Rattus norvegicus timeless (Drosophila) homolog (Timeless), mRNA 


0.18 


3642 


AF334544 


xxOUio Sapiens cnroiuosoiiic i ioiiuiimtk. piu2\iu.iai oiui«iiv/3cmxig x 
translocation boundary distal to AZFc region 


0.19 


3643 


XM 033642 


xTOHIO oaDlClxb lllX^lCUUllUoluxil \XlUvlCVJiOi. LMiVJoiJiWjpivwxxi. xi^.j, 

numatrin) (NPM1), mRNA 


1E-66 


3645 


ACUU 


rieiicoDacier pyion., suain jyy secuon zj 01 1 jz, 01 mc tumpicic 

nan ftttl A 

i genome 


1.3 


3646 


Uol Vol 


Homo sapiens (subclone 6_cl0 from PI H17) DNA sequence, 
complete sequence 


4E-12 


3647 


a TT71 

AJ271J4& 


riomo sapiens partial ±-/ix±-o gene iui iiupaim tic uj xcvGpujx, cauu x 


0.19 


3648 




Simian immunodeficiency virus from African green monkey, tantalus 
species ^oivianj provirai jlmna, complete genome 


0.16 


3649 


NM 014601 


Homo sapiens EH-domain containing 2 (EHD2), mRNA 


0.087 


3650 


NM 032717 


Homo sapiens hypothetical protein MGC11324 (MGC 11324), mRNA 


0.62 


3651 


"VTV/f ClfiAA&O 
AJVl_UU'f*fOZ 


Homo sapiens signal sequence receptor, alpha (translocon-associated 

piuidii oipiiaj ^ooxvi j, iilt\-i.>xy 


0.79 


3652 




Phragmites australis rpl36-rps8 intergenic spacer, chloroplast 
sequence 


0.69 


3653 


Zoo 3 /4 


xi. sapiens v^pvj lsidiiu jl/jna. genomic lvxbci xxagxucxii, uuuc o^uj, 
iorwara reau cpgozoo.ixia 


0.22 


3654 


U10927 


Staphylococcus aureus M type 1 capsular polysaccharide biosynthesis 
(cap/v, caprs, capi^, capiv, cape, capr, capvj, uapxx, wipi, v^apj, t,apjv, 
capL, capM) genes, complete cds 


0.026 


3655 


AE007446 


Streptococcus pneumoniae section 129 of 194 of the complete genome 


0.18 


3656 


BC006035 


Mus musculus, clone 1MAGE:3492938, mRNA, partial cds 


5.3 


3657 


AF352740 


Oncorhynchus mykiss clone OMM1055 microsatellite sequence 


2.1 ! 


3658 


NM 032141 


Homo sapiens hypothetical protein DKFZp434K1421 
(DKFZP434K1421), mRNA 


0 


3659 


AF275231 


Phloeomyzus passerinii 12S small subunit ribosomal RNA gene, 
partial sequence; tRNA-Val gene, complete sequence; and 16S large 
subunit ribosomal RNA gene, partial sequence; mitochondrial genes 
for mitochondrial products 


0.077 
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Table 3A Nearest Neighbor (BlastN vs. Genbank) 


SEQED 
NO 


ACCESSN 


DESCRIP 


P VALUE 


3660 


AF017646 


Schizosaccharomyces pombe TFIIH subunit p47 (tfh47) gene, 
complete cds 


0.72 


3661 


AF200526 


Zea mays cellulose synthase-2 (CesA-2) mRNA, complete cds 


2.2 


3662 


AJ297654 


Sus scrofa AACT2 gene for alpha-l-antichymotiypsin 2, exons 1-5 


0.74 


3663 


XM 018189 


Homo sapiens hypothetical protein FLJ13949 (FLJ13949), mRNA 


0.74 


3664 


AE004047 


Xylella fastidiosa 9a5c, section 193 of 229 of the complete genome 


2.2 


3665 


XM 003012 


Homo sapiens EphBl (EPHB1), mRNA 


6.9 


3666 


AF083240 


Shewanella putrefaciens multi -cytochrome gene cluster, complete 
sequence; ferrous iron transporter (feoB), deca-heme c-type 
cytochrome (mtrC), c-type cytochrome precursor (mtrA), and outer 
membrane protein precursor (mtrB) genes, complete cds 


0.085 


3667 


AF100304 j 


Caenorhabditis elegans cosmid W07B3, complete sequence 


0.74 


3668 


AP001348 


Homo sapiens genomic DNA, chromosome 21q21.1-q2L2 
clone:B9G8, LL56-APP region, complete sequence 


2E-18 


3669 


ABO05O39 


Alternaria alternata MAT2 gene, complete cds 


6.4 


3670 


NM 009181 


Mus musculus sialyltransferase 8 (alpha-2, 8-sialytransferase) B 
(Siat8b), mRNA 


0.22 


3671 


AB045894 


Nepenthes alata NaAP4 mRNA for aspartic proteinase 4, complete cds 


7 


3672 


M74944 


Simian immunodeficiency virus (32H Re-isolate of SIVmac 251, clone 
W9) env gene, complete cds 


0.25 


3673 


AF288211 


Danio rerio Nkx5-1 mRNA, complete cds 


0.072 


3674 


AC006753 


Caenorhabditis elegans cosmid Y40A1 A, complete sequence 


0.02 


3675 


S53497 


immunoglobulin epsilon chain constant region=secreted form {3' 
region} [human, B cell myeloma U-266, Genomic, 3198 nt] 


6.8 


3676 


AL5 12305 


Human DNA sequence from clone RP1 1-85E24 on chromosome 6, 
complete sequence [Homo sapiens] 


0.009 


3677 


AF207862 


Bos taurus bradykinin receptor B2 (BDKRB2) gene, promoter 


5.8 


3678 


AF269417 


Staphylococcus epidermidis strain SRI clone step.l003e03 genomic 
sequence 


5.1 


3679 


X53232 


Rat mRNA for preoptic regulatory factor-2 (PORF-2) 


2 


3680 


AB005039 


Alternaria alternata MAT2 gene, complete cds 


6.5 


3681 


AE003252 


Drosophila melanogaster genomic scaffold 142000013385369, 
complete sequence 


1.9 


3682 


U19744 


Caenorhabditis elegans integrin beta pat-3 gene, complete cds 


0.083 


3683 


XM 010829 


Homo sapiens potassium channel, subfamily K, member 3 (TASK-1) 
(KCNK3), mRNA 


0.25 


3684 


AF065853 


Homo sapiens OR7E11P pseudogene, partial sequence 


0.077 


3685 


AF325859 


Choristoneura fumiferana antifreeze protein (AFP-Lul) gene, 
complete cds 


2.3 


3686 


XM 017134 


Homo sapiens breast cancer 2, early onset (BRCA2), mRNA 


0.23 
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Table 3A Nearest Neighbor (BlastN vs. Genbank) 


SEQID 
NO 


ACCESSN 


DESCRIP 


P VALUE 


3687 


AE002139 


Ureaplasma urealyticum section 40 of 59 of the complete genome 


0.17 


3688 


J00241 


Human Ig germline kappa-L chain, C region (inv3 allele) 


0.22 


3689 


U23855 


Bos taunts von Willebrand factor gene, 5* flanking region and exon 1 


0.78 


3690 


AKO 15465 


Mus musculus adult male testis cDNA, RIKEN full-length enriched 
library, clone:4930455H04, full insert sequence 


6.2 


3691 


XM_042056 


Homo sapiens LOC86123 (LOC86123), mRNA 


2 


3692 


AL137458 


Homo sapiens mRNA; cDNA DKFZp434E2221 (from clone 
DKFZp434E2221) 


0.086 


3693 


Z81079 


Caenorhabditis elegans cosmid F39H11, complete sequence 


2 


3694 


AF142998 


Pteronotus parnellii NADH dehydrogenase subunit 1 (ND1) gene, 
partial cds; mitochondrial gene for mitochondrial product 


0.69 


3695 


AK012282 


Mus musculus 1 1 days embryo cDNA, RIKEN full-length enriched 
library, clone: 2700023124, full insert sequence 


0.75 


3696 


AB032937 


Cucumis sativus CS-ACS1 gene for 1 -aminocyclopropane- 1 - 
carboxylate synthase, complete cds 


1.5 


3697 


M92295 


Gorilla gorilla gamma- 1 and gamma-2 globin genes, complete cds 


0.029 


3698 


AF034085 


Caenorhabditis elegans UNC-45 (unc-45) gene, complete cds 


0.0001 


3699 


AB048996 


Macaca fascicularis brain cDNA, clone:QnpA-14055 


0.25 


3700 


AK024527 


Homo sapiens cDNA: FLJ20874 fis, clone ADKA02818 


0.000001 


3701 


AKO 14489 


Mus musculus 14 days embryo liver cDNA, RIKEN full-length 
enriched library, clone:4432412D04, full insert sequence 


4.3 


3702 


NM 022599 


Rattus norvegicus outer membrane protein (Omp25), mRNA 


1E-14 


3703 


AF177242 


Bodo saltans clone pBME40 mitocondrial minicircle DNA 


0.021 


3704 


AF183329 


Homo sapiens clone NIGMS NA10926D chromosome 10 paralogous 
sequence variant, genomic sequence 


5E-61 


3706 


AF374728 


Saxifragella bicuspidata maturase K (matK) gene, partial cds; 
chloroplast gene for chloroplast product 


0.65 


3707 


AK010113 


Mus musculus adult male tongue cDNA, RIKEN full-length enriched 
library, clone:23 10068 J16, full insert sequence 


5E-29 


3708 


XM_045657 


Homo sapiens McKusick-Kaufman syndrome (MKKS), mRNA 


1.8 


3709 


AJ223690 


H.sapiens D2-l-K2G7np gene for immunoglobulin kappa chain 
variable region 


0.026 


3710 


XM 008389 


Homo sapiens Clq-related factor (CRF), mRNA 


0.026 


3711 


AF200526 


Zea mays cellulose synthase-2 (CesA-2) mRNA, complete cds 


2.3 


3712 


AE000049 


Mycoplasma pneumoniae M129 section 3 1 of 63 of the complete 
genome 


0.68 


3713 


XM_002495 


Homo sapiens cyclic nucleotide gated channel alpha 3 (CNGA3), 
mRNA 


2.1 


3714 


AF322106 


Oncorhynchus mykdss lysozyme.n precursor, gene, complete cds 


0.7 


3715 


AK002007 


Homo sapiens cDNA FLJ11145 fis, clone PLACE1006626, highly 
similar to Homo sapiens mRNA for KIAA0928 protein 


0.029 


3716 


Z73978 


Caenorhabditis elegans cosmid ZC302, complete sequence 


0.23 


3717 


NM 023979 


Rattus norvegicus apoptotic protease activating factor 1 (Apafl), 
mRNA 


0.52 ! 
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Table 3A Nearest Neighbor (BlastN vs. Geribank) 


SEQID 
NO 


ACCESSN 


DESCRIP 


P VALUE 


3718 


AF335504 


Oryza sativa hemoglobin 1 (hbl), hemoglobin 3 (hb3), and 
hemoglobin 4 (hb4) genes, complete cds 


0.077 


3719 


BC006943 


Mus musculus, Similar to RIKEN cDNA 3230401N03 gene, clone 
MGC:6906 IMAGE:2655807, mRNA, complete cds \ 


2 


3720 


AK007159 


Mus musculus adult male testis cDNA, RIKEN full-length enriched 
library, clone: 1700110101, full insert sequence 


6 


3721 


AF033002 


Cerebratulus lacteus body wall globin gene, complete cds 


0.026 j 


3722 


X95276 


Rfalciparum complete gene map of plastid-like DNA (IR-B) 


0.2 


3723 


XM 012629 


Homo sapiens DEAD/H (Asp-Glu-Ala-Asp/His) box polypeptide 8 
(RNA helicase) (DDX8), mRNA 


L4 


3724 


XM 042643 


Homo sapiens hypothetical protein DKFZp434G131 
(DKFZP434G131), mRNA 


0.69 


3725 


AF 145640 


Drosophila melanogaster clone GH07346 BcDNA.GH07346 
(BcDNA.GH07346) mRNA, complete cds 


2.3 


3726 


X58823 


Bovine COX7cPl retroposed pseudogene for cytochrome c oxidase 
subunit VIIc 


0.21 


3727 


XM 052301 


Homo sapiens hypothetical protein MGC3 199 (MGC3199), mRNA 


6E-25 


3728 


U00176 


Dictyostelium mucoroides DMUC2 clone p288m2 Gl-like and G5- 
like ORFs* proteins, complete cds 


0.00009 


3729 


AJ132901 


Drosophila guanche mitochondrial A+T-rich region 


0.071 


3730 


AL583762 


Human chromosome 14 DNA sequence Partial sequence from BAC R- 
1109N18_PCR1 of library RPCI-11 firom chromosome 14 of Homo 
sapiens (Human), complete sequence 


7E-35 


3731 


AK021083 


Mus musculus adult male corpus striatum cDNA, RIKEN full-length 
enriched library, clone:C030015A19, full insert sequence 


1.5 


3732 


XM_001934 


Homo sapiens PRO0529 protein (PRO0529), mRNA 


1.4 


3733 


AJ235836 


Labordia tinifolia chloroplast ndhF gene 


2.2 j 


3734 


M81855 


Rat mdr mRNA sequence 


1.9 


3735 


NMJ)31649 


Rattus norvegicus killer cell lectin-like receptor subfamily G, member 
1 (Klrgl), mRNA 


4.9 


3736 


Z78014 


Caenorhabditis elegans cosmid F42E8, complete sequence 


0.002 


3737 


XM_045320 


Homo sapiens chondroitin sulfete proteoglycan 6 (bamacan) (CSPG6), 
mRNA 


0.0005 


3738 


AF288788 


Staphylococcus aureus rot-like protein Rip (rip) gene, complete cds 


0.4 


3739 


D16417 


Dictyostelium discoideum mRNA 


0.008 


3740 


X15465 


Hamster SPC14 LINE LI repeat DNA 


0.000003 


3741 


! Y14544 


Danio rerio mRNA for Hoxc8 protein 


0.014 


3742 


M13897 


Rattus norvegicus thyrotropin beta subunit (TSH-beta) gene, complete 
cds, clones RP100-14 and RP21 


3.5 


3743 


AE001390 


Plasmodium falciparum chromosome 2, section 27 of 73 of the 
complete sequence 


0.018 1 


3744 


XM 035389 


Homo sapiens ribosomal protein, large, PI (RPLP1), mRNA 


9E-36 


3745 


XM_007409 


Homo sapiens alkylation repair; alkB homolog (ABH), mRNA 


0.057 
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Table 3 A Nearest Neighbor (BlastN vs. Genbank) 


JNVJ 


ACCESSN 


DESCRIP 


P VALUE 


3746 


AE007425 


Streptococcus pneumoniae section 108 of 194 of the complete genome 


0.15 


3747 


Z38113 


S.cerevisiae chromosome DC lambda clone 4554 


4.9 


3749 


X80491 


S.latifolia SLM4 mRNA 


0.02 


3750 


XM_035466 


Homo sapiens hBKLF for basic kruppel like factor (LOC51274), 
mRNA 


2E-35 


3751 


Z75951 


Human DNA sequence from cosmid U203H4, between markers 
DXS366 and DXS87 on chromosome X 


0.000001 


3752 


AE000800 


Methanobacterium thermoautotrophicum from bases 58271 to 68710 
(section 6 of 148) of the complete genome 


0.19 


3754 


AJ409501 


Mus musculus RNA binding site for Dazl protein, clone gflO 


0.73 


3755 


. AJ243829 


Oryza sativa RPA gene for protein phosphatase 2A A subunit, exons 1- 
12 


3E-25 


3756 


AK026692 


Homo sapiens cDNA: FLJ23039 fis, clone LNG02242 


0.022 


3757 


S81000 


{microsatellite regions} [Ciyptococcus neoformans=pathogenic yeast, 
var. gattii, serotype B, UCLA 371-B, human isolate, Genomic, 283 nt] 


0.94 


3758 


AF252420 


Bos taurus clone MNB-5 microsatellite sequence 


0.074 


3759 


AF338734 


Homo sapiens hypothetical PHD zinc finger protein XAP135 
pseudogene, complete sequence 


2.1 


3760 


AJ401038 


Human immunodeficiency virus type 1 proviral gpl60 gene for 
envelope protein, strain 97DC.KFE267 


0.79 


3761 


NCJ)02087 


Piasmid pRL765, complete sequence 


0.71 


3762 


AE004161 


Vibrio cholerae chromosome I, section 69 of 251 of the complete 
chromosome 


1.9 


3763 


Z54349 


Rsapiens MN/CA9 GENE 


0.56 


3764 


AF372396 


HIV-1 isolate CM53658 from Cameroon envelope glycoprotein (env) 
gene, partial cds 


1.8 


3765 


AY021127 


Oiyza sativa microsatellite MRG3452 containing (TA)X19, genomic 
sequence 


1.9 


3766 


AK021485 


Homo sapiens cDNA FLJ11423 fis, clone HEMBA1001024 


3.4 


3767 


AF269468 


Staphylococcus epidermidis strain SRI clone step.l005a02 genomic 
sequence 


0.69 


3768 


AF229080 


Rattus norvegicus glucagon receptor gene, promoter and partial cds 


0.71 


3769 


X04384 


Sea urchin (L.pictus) gene for testis specific histone H2B-2 


0.23 


3770 


Z36061 


S.cerevisiae chromosome II reading frame ORF YBR192w 


0.087 


3771 


AB038488 


Halimeda discoidea chloroplast rbcL gene for ribulose-1,5- 
bisphosphate carboxylase/oxygenase large subunit, partial cds 


0.25 ! 


^772 


XMJ)03759 


Homo sapiens leukocyte cell-derived chemotaxin 2 (LECT2), mRNA 


0.000001 


3773 


AF122979 


Tridacna maxima 16S ribosomal RNA gene, partial sequence; 
mitochondrial gene for mitochondrial product 


0.076 


3774 


L44120 


Homo sapiens (clone pHK1.5D) CMT1A gene, primer binding site 


5E-39 
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Table 3A Nearest Neighbor (BlastN vs. Genbank) 


SEQID 
NO 


ACCESSN 


DESCRIP 


P VALUE 


3775 


AC008855 


Homo sapiens chromosome 5 clone CTD-2178M23, complete 
sequence 


0.73 


3776 


AF267225 


Candidates Carsonella ruddii natural-host Tainarys sordida ATP 
synthase alpha subunit (atpA) gene, partial cds; ATP synthase gamma 
subunit (atpG) gene, complete cds; and ATP synthase beta subunit 
(atpD) gene, partial cds 


0.024 


3777 


U67231 


Human clone HS4.74 Alu-Ya5 sequence 


2E-11 


3778 


XM 038985 


Homo sapiens similar to HYPOTHETICAL PROTEIN KIAA0379 (H. 
sapiens) (LOC91526), mRNA 


0.13 


3779 


X71625 


H.sapiens dinucleotide repeat polymorphism cA14 


0.057 


3780 


BC007480 


Mus musculus, clone MGC7570 IMAGE:3493227, mRNA, complete 
cds 


0.59 


3781 


U60153 


Human immunodeficiency virus type 1 isolate ID4-77, clone 12, from 
USA, envelope glycoprotein (env) gene, partial cds 


0.013 


3782 


XM 050963 


Homo sapiens hypothetical protein DKFZp56401664 
(DKFZP56401664), mRNA 1 


0 


3783 


AC024236 


Homo sapiens BAC clone RP11-400I17 from Y, complete sequence 


0.0001 


3784 


M17627 


Chicken alpha-A-ciystallin gene, complete cds and 5* flank 


0.077 


3785 


NM 031557 


Rattus norvegicus Prostaglandin 12 (prostacyclin) synthase (Ptgis), 
mRNA 


5.8 


3786 


Z79647 


B.thayeri mitochondrial 16S rRNA gene 


0.23 


3787 


AF032386 


Nicotiana tabacum aldose- 1 -epimerase-like protein (GP40) mRNA, 
complete cds 


0.077 


3788 


X89568 


P.sativum mRNA for HMGI/Y protein 


5.8 ! 


3789 


i Z71180 


Caenorhabditis elegans cosmid F22E12, complete sequence 


0.24 


3790 


AJ239079 


Tetrahymena pyriformis partial gc4 gene for guanylyl cyclase, isoform 
4 


0.79 


3791 


AF3 10890 


Dictyostelium discoideum PcmA (pcmA) and RacE (racE) genes, 
complete cds; tRNA-Phe gene, complete sequence; and unknown gene 


0.00003 


3792 


L22300 


Chilo iridescent virus type 6 zinc finger protein and non-histone 
chromosomal high mobility group protein homolog genes, complete 
cds 


0.021 


3793 


AF155054 


Rickettsia slovaca protein PS 120 (D) gene, partial cds 


0.47 


3794 


X59799 


M. musculus S-antigen gene promoter region 


0.2 


3795 


AF330221 


Oncorhynchus keta clone Oke4 microsatellite sequence 


0.56 


3796 


XM.043227 


Homo sapiens similar to proteasome (prosome, macropain) 26S 
subunit, non-ATPase, 13 (H. sapiens) (LOC92153), mRNA 


6 


3797 


D17668 


Clostridium septicum gene for alpha-toxin, complete cds 


0.39 


3798 


AF165135 


Mycoplasma mycoides mycoides SC strain Afade hypothetical proline 
rich glycoprotein, hypothetical surface-located membrane protein, 
lipoprotein B precursor (lppB), and hypothetical ABC transporter 
protein genes, complete cds; hypothetical ABC transport> 


1.5 
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Tabic 3A Nearest Neighbor (BlastN vs. Genbank) 


SEQID 
NO 


ACCESSN 


DESCRIP 


P VALUE 


3799 


AF379892 


Rhagigaster sp. M222 16S ribosomal RNA gene, partial sequence; 
mitochondrial gene for mitochondrial product 


0.068 


3800 


NM 024379 


Rattus norvegicus glutamate receptor delta-2 subunit (Grid2), mRNA 


1.5 


3801 


AF364550 


Drosophila melanogaster transposon cruiser, complete sequence 


5.4 


3803 


AJ315160 


Xenopus tropicalis BMP-2 gene for bone morphogenetic protein 2 


0.24 


3804 


AJ293015 


Pisum sativum loxl:Ps:7 gene for lipoxygenase, exons 1-8 


0.079 


3805 


Z82768 


R.prowazekii genomic DNA fragment (clone A810F) 


1.5 i 


3806 


NM 013780 


Mus musculus neuronal PAS domain protein 3 (Npas3), mRNA 


0.72 


3807 


U56088 


Human periodic tryptophan protein 2 (PWP2) gene, exons 3 to 14 


0.25 


3808 


D88987 


Mus musculus Ampd3 gene, exon 4 


0.18 


3809 


Z72680 


S.cerevisiae chromosome VII reading frame ORF YGL158w 


0.73 


3810 


M17988 


Spiroplasma virus 4 (SpV4) replicative form, complete genome 


5.3 


3811 


XM 039684 


Homo sapiens putative G-protein coupled receptor (SH120), mRNA 


0.17 


3812 


XM 039684 


Homo sapiens putative G-protein coupled receptor (SH120), mRNA 


0.19 


3813 


X60735 


S.murinus Dq52 gene and immunoglobulin heavy chain S region gene 
loci 


1.4 


3814 


XM 045862 


Homo sapiens ring finger protein 10 (RNF10), mRNA 


5E-14 


3815 


AB005521 


Homo sapiens ppar gamma gene for peroxisome proliferator activated- 
receptor gamma, exon 1 


1.6 


3816 


AF271964 


Bos taurus microsatellite MNB-162 sequence 


0.019 


3817 


AF274307 


Zaocys dhumnades creatine kinase mRNA, complete cds 


6.1 


3818 


BC010716 


Mus musculus, clone MGC:6551 IMAGE:2655861, mRNA, complete 
cds 


5.5 


3819 


AE004219 


Vibrio cholerae chromosome I, section 127 of 251 of the complete 
chromosome 


5.7 


3820 


AB060193 


Macaca fascicularis brain cDNA clone:QccE-22277, full insert 
sequence 


1.9 


3821 


AY007106 


Homo sapiens clone TCCCIA00427 mRNA sequence 


3E-45 


3822 


X02801 


Mouse gene for glial fibrillary acidic protein (GFAP) 


0.08 


3824 


AJ234506 


Hordeum vulgare genomic DNA fragment; clone MWG0561.uni 


2.1 


3825 


AK019864 


Mus musculus 11 days pregnant adult female ovary and uterus cDNA, 
RIKEN full-length enriched library, clone:5031405K23, full insert 
sequence 


1 o 

1.8 


3826 


AK023304 


Homo sapiens cDNA FLJ13242 fis, clone OVARC1000578 


6.5 


3827 


AB062056 


Homo sapiens p53DINPl gene for p53DINPla, p53DINPlb, complete 
cds, alternative splicing 


0.055 


3828 


AF379858 


Dendrocerus carpenteri 16S ribosomal RNA gene, partial sequence; 
mitochondrial gene for mitochondrial product 


0.07 


3829 


X05621 


Chironomus pallidivittatus BR1 gene for giant secretory protein 


0.077 
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3830 


AF108841 


Homo sapiens human endogenous retrovirus HERV-H10 pol protein 
(pol) gene, partial cds; env pseudogene and 3' LTR, complete 
sequence 


0.009 


3831 


AF309030 


Ixodes scapularis haplotype SSCP NC2_29 16S ribosomal RNA gene, 
partial sequence; mitochondrial gene for mitochondrial product 


0.21 


3832 


AF268055 


Candida tus Carsonella ruddii natural-host Glycaspis brimblecombei 
RNA polymerase beta subunit (rpoB) and RNA polymerase beta-prime 
subunit (rpoC) genes, partial cds 


0.0007 


3833 


AY021307 


Oiyza sativa microsatellite MRG3632 containing (TA)X23, genomic 
sequence 


0.009 


3834 


XM 034551 


Homo sapiens dual-specificity tyrosine-(Y)-phosphorylation regulated 
kinase 4 (DYRK4), mRNA 


0.00003 


3835 


AK001647 


Homo sapiens cDNA FLJ 10785 fis, clone NT2RP4000457, weakly 
similar to UBIQUTTIN CARBOXYL-TERMINAL HYDROLASE 15 
(EC 3.1.2.15) 


0 


3836 


XM 043492 


Homo sapiens 42259 (KIAA1728), mRNA 


5E-19 


3837 


U22345 


Human chromosome 20ql2 locus-specific repeat 


0.003 


3838 


J05535 


Bovine vascular smooth muscle connexin43 mRNA, complete cds 


0.18 


3839 


AF290221 


Cucumis sativus clone B80F mitochondrial genomic sequence 


0.065 


3840 


NM 010059 


Mus musculus disrupted meiotic cDNA 1 homolog (Dmclh), mRNA 


5.3 


3841 


NM 017681 


Homo sapiens hypothetical protein FLJ20130 (FTJ20130), mRNA 


2 


3842 


AE002114 


Ureaplasma urealyticum section 15 of 59 of the complete genome 


1.9 


3843 


AK022372 


Homo sapiens cDNA FU12310 fis, clone MAMMA1001970 


0.019 


3844 


U33633 


Rice ragged stunt virus minor structural protein gene, complete cds 


0.21 i 


3845 


AB062993 


Macaca fascicularis brain cDNA clone:QmoA-10825, full insert 
sequence 


4E-40 


3846 


AJ251485 


Gorilla gorilla partial lmp7 gene for large multifunctional protease 7, 
exon 7 and tap2 gene for antigen peptide transporter 2, exons 1-6 


0.71 ! 


3847 


AP000145 


Homo sapiens genomic DNA, chromosome 21q21.2, LL56-APP 
region, clone B2291C14-R44F3, segment 10/10, complete sequence 


0.68 


3848 


XM 046052 


Homo sapiens similar to ribosomal protein S3 A (H. sapiens) 
(LOC65465), mRNA 


0.017 


3849 


XM 012036 


Homo sapiens potassium inwardly-rectifying channel, subfamily J, 
member 5 (KCNJ5), mRNA 


5 


3850 


AB035883 


Pseudoregma bambucicola mitochondrial genes for small subunit 
rRNA, tRNA-Val, large subunit rRNA, partial and complete 
sequences, country: Taiwan: Sun Moon Lake 


0.077 


3851 


1 AB037688 


Xenopus laevis Xptch-2 mRNA for patched-2, complete cds 


1.8 
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3852 


AB041264 1 


VIus musculus pseudogene for GTP-binding protein betal subunit 


0.53 


3853 


] 

AF288170 


Plasmodium falciparum isolate MM cytoadherence linked asexual 
protein (clag9) mRNA, partial cds 


0.083 


3854 


AF304321 . 


Neospora caninum RNA polymerase CI (rpoCl) gene, partial 
sequence, plastid gene for plastid product 


0.002 


3855 


] 

AF133181 


Peziza varia 18S ribosomal RNA gene, and internal transcribed spacer 
1, partial sequence 


9 1 

J*. J. 


3856 


NM 012694 


Rattus norvegicus Solute carrier family 6 (neurotransmitter 
transporter, dopamine), member 3 (SIc6a3), mRNA 


0.56 


3857 


NM 030261 


Mus musculus hypothetical protem MuC/ioi (Moc/iozj, hikjna 




3858 


AF231164 


Pseudosymblepharis schimperiana tRNA-Leu, partial sequence; trnL- 
trnF intergenic spacer, complete sequence; and tRNA-Phe, partial 
sequence; chloroplast genes for chloroplast products 




3859 


Z36070 


S.cerevisiae chromosome II reading frame ORF YBR201w 


0.064 


3860 


Z24279 


H. sapiens (D1S489) DNA segment contaimng (tA) repeat, clone 
AFM309ve9; single read 


0 2 


3861 


XM 036937 


Homo sapiens KIAA1020 protem (KlAAiuzuj, mKiN/v 


2 1 


3869 


AB033019 


Homo sapiens mRNA for KIAA1193 protein, partial cds 


1.6 


3864 


XM 030075 


Homo sapiens hypothetical protem DKFZp434D0513 
(DKFZp434D0513), mRNA 


2E-74 


3865 


NM 023979 


Rattus norvegicus apoptotic protease activating factor 1 (Apafl), 
mRNA 


0.59 


J o w 


U50715 


Mus musculus alpha-galactosidase A gene, complete cds 


0 008 


3867 


AF078780 


Caenorhabditis elegans cosmia CU4I*2 


1.7 


3869 


L07305 


Histoplasma capsulatum (clone pMo3) ri-AiJrase gene, complete cos 


0.46 


3870 


XM_035796 


Homo sapiens hypothetical gene supported by AF001893; AH005586 
(LOC91063), mRNA 


1.7 


3871 


NM 025788 


Mus musculus RIKEN cDNA 49305 UN 13 gene (493051 IN UKiK), 
mRNA 


5.4 


3879 


AF339787 


Homo sapiens clone IMAGE:205688, mRNA sequence 


2 


3873 


AF028784 


Rattus norvegicus glial fibrillary acidic proteins alpha and delta 
(GFAP) gene, alternatively spliced products, complete cds 


ft ftftS 


JO /*r 


AF328904 


Mus musculus lysyl-tRNA synthetase (Kars) gene, exon 14, complete 
cds; adenosine deaminase tRNA-specific 1 (Adatl) gene, exons EA, 1 
and partial cds 


0.22 


3875 


AF033647 


Rhytidiadelphus triquetrus 23S ribosomal RNA gene, partial 
cA/m<»iir¥»' internal transcribed snacer 2. 4.5S ribosomal RNA gene, 
internal transcribed spacer 3, and 5S ribosomal RNA gene, complete 
sequence, chloroplast genes for chloroplast RNAs 


2 


3876 


BC006035 


Mus musculus, clone IMAGE:3492938, mRNA, partial cds 


5.6 


3877 


AF3 11856 


Homo sapiens spectrin beta IV (SPTBN3) mRNA, complete cds, 
alternatively spliced 


0.5 


3878 


AB041327 


Anguilla japonica GnRH receptor mRNA for gonadotropin-releasing 
hormone receptor, complete cds 


0.24 
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3879 


AF220175 


Homo sapiens acid ceramidase (ASAH) gene, exons 5 through 14, and 
complete cds 


0.0009 


3880 


X89398 


H. sapiens ung gene for uracil DNA-glycosylase 


0.002 


3881 


L14017 


Staphylococcus aureus methicillin-resistance protein (mecR) gene and 
unknown ORF, complete cds 


0.56 


3882 


M97008 


Xenopus laevis/gilli hybrid IgM chain gene, clone LG7Gmu 


0.007 


3883 


Z78414 


Caenorhabditis elegans cosmid W09D12, complete sequence 


0.63 


3884 


AL5 12549 


S.pombe chromosome I BAC pB2B4 


1.9 


3885 


AB030616 


Mus musculus Chetk-alpha gene for choline/ethanolamine kinase- 
alpha, exon 1 and 5-flanking region 


6.4 


3886 


AF069535 


Bradypus tridactylus 16S ribosomal RNA gene, mitochondrial gene 
for mitochondrial RNA, complete sequence 


1.5 


3887 


AK023304 


Homo sapiens cDNA FLJ13242 fis, clone OVARC1000578 


0.056 


3888 


AB039756 


Brassica rapa SP 11-32 mRNA for S locus protein 11-32, partial cds, 
strain: S32(50-l It) 


1.9 


3889 


AL5 12405 


Human DNA sequence from clone RP1 1-370N21 on chromosome 6, 
complete sequence [Homo sapiens] 


1.6 


3890 


AF147813 


Protrama radicis elongation factor 1 alpha gene, partial cds 


0.18 


3891 


XM 046967 


Homo sapiens region containing variable charge protein on X with 
eight repeats; variable charge protein on X with two repeats; variable 
charge, X chromosome (LOC92738), mRNA 


0.21 


3892 


XM 037664 


Homo sapiens hypothetical protein MGC4090 (MGC4090), mRNA 


1.5 


3893 


AJ296142 


Silene foetida partial rpb2 gene for RNA polymerase EL, exons 23 and 
24 


0.16 


3894 


AF360340 


Arabidopsis thaliana putative pectinesterase (K19E1.17/AT5g53370) 
mRNA, complete cds 


2.7 


3895 


AF327975 


Hentzia palmarum 16S ribosomal RNA, partial sequence; 
mitochondrial gene for mitochondrial product 


0.55 


3896 


Z15140 


L.esculentum mRNA for chitinase 


2.1 


3897 


U57326 


Chlamydomonas reinhardtii chloroplast complete EcolO fragment, 
rpoC2 gene, partial cds 


0.001 


3898 


AK024908 


Homo sapiens cDNA: FLJ21255 fis, clone COLO 1321 


0.003 


3899 


NC 002087 


Plasmid pRL765, complete sequence 


0.62 


3900 


AF063220 


Papaya ringspot virus isolate P polyprotein gene, partial cds 


0.25 | 


3901 


AE002173 


Chlamydophila pneumoniae AR39, section 9 of 94 of the complete 
genome 


0.25 


3902 


BC001152 


Homo sapiens, growth arrest-specific 7, clone MGC:1347 
IMAGE:3353809, mRNA, complete cds 


5.9 


3903 


AB011800 


Agrobacterium tumefaciens plasmid pRiA4 gene, virB operon 


5.4 


3904 


BC010389 


Homo sapiens, clone MGC: 13615 IMAGE.4283814, mRNA, 
complete cds 


0.084 


3905 


AF091779 


Oncorhynchus mykiss non-classical MHC class I antigen (Onmy- 
UAA) gene, Onmy-UAA*0101 allele, complete cds 


2.3 


3906 


AF222716 


Plasmodium falciparum clone 3D7 unconventional myosin PfM-B 
gene, complete cds 


6.2 
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3907 


Z48951 


S.cerevisiae chromosome XVI cosmid 9723 


2.5 


3908 


XM_040326 


Homo sapiens absent in melanoma 1 (AIM1), mRNA 


0.056 


3909 


AF271964 


Bos taurus microsatellite MNB-162 sequence 


0.019 


3910 


BC003261 


Mus musculus, serine/threonine kinase 5, clone MGC:5803 
IMAGE:3501444, mRNA, complete cds 


0.056 


3911 


AJ011641 


Arabidopsis thaliana (ecotype Columbia) spl8 gene, exons 1-3 


1.7 


3912 


AK000864 


Homo sapiens cDNA FLJ10002 fis, clone HEMBA1000046 


0.53 


3913 


XM 045747 


Homo sapiens midline 1 (Opitz/BBB syndrome) (MIDI), mRNA 


2E-33 


3914 


AF269581 


Staphylococcus epidermidis strain SRI clone step.l011g07 genomic 
sequence 


0.057 


3915 


XM 042333 


Homo sapiens casein kinase 2, beta polypeptide (CSNK2B), mRNA 


1.9 


3916 


AF269810 


Staphylococcus epidermidis strain SRI clone step.l023h06 genomic 
sequence 


0.022 


3917 


AJ404228 


Saccharomyces douglasii mitochondrial tRNA gene cluster 


0.21 


3918 


XM_027223 


Homo sapiens hypothetical gene supported by AK021954 
(LOC89916), mRNA 


0.61 


3919 


AJ011641 


Arabidopsis thaliana (ecotype Columbia) spl8 gene, exons 1-3 


2 


3920 


AF020725 


Gallus gallus unoccupied integration site for endogenous ALV-type 
retroviral element ev-B2 


2 


3921 


AL157825 


Human DNA sequence from clone RP11-518C21 on chromosome 
Xql3.2-2L1, complete sequence (Homo sapiens] 


1.2 


3922 


AF203676 


Myotis myotis microsatellite H23 sequence 


0.2 


3923 


XM 010734 


Homo sapiens smg GDS-ASSOCIATED PROTEIN (Kifap3), mRNA 


0.66 


3924 


AF056048 


Heterodera glycines beta-l,4-endoglucanase-3 precursor (eng-3) gene, 
promoter and complete cds 


0.026 


3925 


XM 042676 


Homo sapiens hypothetical protein MGC4618 (MGC4618), mRNA 


2.2 


3926 


AL359750 


Human DNA sequence from clone RP1 1-24 A10 on chromosome 13, 
complete sequence [Homo sapiens] 


0.024 


3927 


NMJU9054 


Homo sapiens hypothetical protein MGC5560 (MGC5560), mRNA 


2E-31 


3928 


AL023845 


Caenorhabditis elegans cosmid Y51B9A, complete sequence 


0.25 


3929 


BC001781 


Homo sapiens, ribosomal protein L44, clone MGC:2064 
IMAGE:3353669, mRNA, complete cds 


7E-43 


3930 


AK004553 


Mus musculus adult male lung cDNA, RIKEN full-length enriched 
library, clone: 1200002022, full insert sequence 


0.51 


3932 


X17339 


Dengue-2 virus NS1 gene for nonstructural protein (patient M2) 


0.23 


3933 


AE004278 


Vibrio cholerae chromosome I, section 186 of 251 of the complete 
chromosome 


0.35 1 


3934 


U41064 


Drosophila melanogaster putative extracellular ligand trunk gene, 
complete cds 


0.53 


3935 


AF005900 


Homo sapiens alpha2B-adrenergic receptor (alpha2C2AR) gene, 
complete cds 


5.2 


3936 


| AK024599 


Homo sapiens cDNA: FLJ20946 fis, clone ADSE01819 


5 
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3937 


K00489 


human enkephalin gene, intron c (3* end) and 3' flank 


0.24 


3938 


U92545 


Callus gallus tubby-like protein (TULP1) mRNA, complete cds 


1.8 


3939 


Z69363 


Human DNA sequence from cosmid L60G9B, Huntington's Disease 
Region, chromosome 4pl6.3 contains ESTs 


0.55 


3941 


U82516 


Dictyostelium discoideum random slug cDNA22 protein (rsc22) 
mRNA, complete cds 


1.8 


3942 


NM 026077 


Mus musculus RIKEN cDNA 3 1 10040N1 1 gene (3 1 10040N1 IRik), 
mRNA 


e-107 


3943 


AF241235 


Homo sapiens Na+,K+-ATPase gamma -subunit (FXYD2) gene, 
alternatively spliced isoforms 1 and 2, exons 1 through 6 and 
complete cds 


0.00002 


3944 


M38180 


Human 3-beta-hydroxysteroid dehydrogenase/delta-5-delta-4- 
isomerase (3-beta-HSD) gene, complete cds 


0.64 


3945 


AJ275020 


Homo sapiens partial F AH gene for fumarylacatoacetate hydrolase, 
intron 10 


0.71 


3946 


AB042198 


Mus musculus mac25 gene promoter and exon 1, partial cds 


1.9 


3947 


AF082520 


Cryptosporidium parvum Hsp60 gene, partial cds 


0.68 


3948 


Z48051 


H.sapiens gene for myelin oligodendrocyte glycoprotein (MOG) 


0.71 


3949 


AF269768 


Staphylococcus epidermidis strain SRI clone step.l023b09 genomic 
sequence 


0.59 


3950 


AK022181 


Homo sapiens CDNAFLJ12119 fis, clone MAMMA1000092 


0.62 


3951 


AF186889 


Chlosyne janais 16S ribosomal RNA gene, partial sequence; 
mitochondrial gene for mitochondrial product 


0.056 


3952 


AB026544 


Hyoscyamus niger trl gene for tropinone reductase-I, complete cds 


0.17 


3954 


M32666 


Homo sapiens 1TGB3 gene, intron 1, partial sequence 


2 


3955 


Z72888 


S.cerevisiae chromosome VII reading frame ORF YGR103w 


0.009 


3956 


AB038979 


Homo sapiens gene for caspase-10, exon 11 and complete cds 


0.001 


3958 


AF148987 


Legionella fallonii macrophage infectivity potentiator (mip) gene, 
partial cds 


0.075 


3959 


AF159173 


Gallus gallus structural muscle protein titin mRNA, partial cds 


1.2 


3960 


U34931 


Mycoplasma pulmonis FtsZ (ftsZ) gene, complete cds, methionyl- 
tRNA synthetase (metG) gene, partial cds 


0.21 


3961 


AF072439 


Rattus norvegicus zinc-finger protein-37 mRNA, complete cds 


0.58 


3962 


AF079877 


Mus musculus cyclin G2 (Ccng2) gene, complete cds 


0.049 


3963 


X87947 


S.cerevisiae ALG2 gene 


0.024 


3964 


BC004730 


Mus musculus, proteasome (prosome, macropain) subunit, beta type 
10, clone MGC:5837 IMAGE:3583052, mRNA, complete cds 


6.8 


3965 


BC010610 


Homo sapiens, clone IMAGE:4214515, mRNA, partial cds 


2E-11 


3966 


AK012050 


Mus musculus 10 days embryo cDNA, RIKEN full-length enriched 
library, clone:2610319A01, full insert sequence 


e-144 


3967 


BC000240 


Homo sapiens, chromosome 4 open reading frame 1, clone 
IMAGE:3352004, mRNA 


1.9 


3969 


AE007346 


Streptococcus pneumoniae section 29 of 194 of the complete genome 


4.4 
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3970 


AK021969 


Homo sapiens cDNA FLJ11907 fis, clone HEMBB1000059 


| 0.005 


3971 


U05765 


Human annexin V (ANX5) gene, exon 7 


4.9 ! 


3972 


S68117 


rPLP-A=prolacfo-like protein A {5* region, exon 1, intron 1} [rats, 
Genomic, 1187 nt] 


0.11 


3973 


AL358852 


Human DNA sequence from clone RP1-3 17N9 on chromosome 6, 
complete sequence [Homo sapiens] 


0.18 


3974 


M26221 


African green monkey origin of replication (ORS8) region 


0.01 


3975 


M22064 


Mouse MHC class I Lyt-2-a gene encoding lyt-2. 1 T-cell surface 
alloantigen, complete cds 


0.71 


3976 


AY039917 


Arabidopsis thaliana putative wall-associated kinase 1 (F16F4.6) 
mRNA, complete cds 


3.5 


3977 


U67488 


Methanococcus jannaschii section 30 of 150 of the complete genome 


0.021 


3979 


XM 027089 


Homo sapiens N-acetylated alpha-linked acidic dipeptidase 2 
(NAALAD2), mRNA 


1E-09 


3980 


AF062751 


Manduca sexta soluble guanylyl cyclase beta-1 subunit mRNA, 
complete cds 


0.53 


3981 


AP000214 


Homo sapiens genomic DNA, chromosome 21q22.1, D21S226-AML 
region, clone f43Dl 1-1 19B8, segment 12/12, complete sequence 


0.0003 


3982 


AF360278 


Arabidopsis thaliana putative MLH1 protein (AT4g09140) mRNA, 
complete cds 


0.0001 


3983 


AF187968 


2 Homo sapiens concentrative nucleoside transporter (CNT1) gene, 
exon 2 


0.085 


3984 


AY032741 


Drosophila melanogaster H.M.S. Beagle transposon long terminal 
repeat, complete sequence; and heat shock protein Hsp70Ab gene, 
promoter and partial cds 


0.6 


3985 


NM_008697 


Mus musculus ninein (Nin), mRNA 


4 


3986 


AB055369 


Macaca fascicularis brain cDNA, clone:QflA-12661 


0 


3987 


X15999 


Kluyveromyces lactis mitochondrial tRNA-Val, COH and COI 
(partial) 


0.023 


3988 


AC008855 


Homo sapiens chromosome 5 clone CTD-2178M23, complete 
sequence 


0.56 


3989 


X79482 


l.punctatus Immunoglobulin mu heavy chain gene 


0.00007 


3990 


AF051289 


Dalbulus charlesi NADH dehydrogenase subunit 1 gene, 
mitochondrial gene encoding mitochondrial protein, complete cds 


0.072 


3991 


BC004382 


Homo sapiens, clone IMAGE:3640982, mRNA, partial cds 


e-147 


3992 


XM 041432 


Homo sapiens hypothetical gene supported by AL1 17603 
(LOC91908), mRNA 


1.3 


3993 


NM 027142 


Mus musculus RIKEN cDNA 2310012P17 gene (2310012P17Rik), 
mRNA 


0.027 


3994 


AB022673 


Oryza sativa rpl!2-l gene for chloroplast ribosomal protein L12, 
complete cds 


0.69 


3995 


AB009987 


Bombyx mori nuclear polyhedrosis virus gene for putative DNA- 
directed RNA polymerase component lefS, complete cds 


0.2 


3996 


NM 000144 


Homo sapiens Friedreich ataxia (FRDA), mRNA 


0.008 
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3997 


XM 002395 


Homo sapiens LBP protein; likely ortholog of mouse CRTR-1 (LBP- 
9^ mRNA 




3998 


AF056532 


Rra<;<;ica nanii<« ciiltrvar Darmnr AP"} 1 jwrnp AP^l -TVM allplp nnrrinl 

cds 




3999 


Z72888 


S.cerevisiae chromosome VII reading frame ORF YGR103w 


0.008 


4000 


AL5 12660 


milium a-hn-tv owvjuviivCf iiuui uiunc s\± 1 1 ~ j J j ivij' vjii win v^iiiuoV/ii ic i v/ 5 
comnlete sennence fHomo sanien«sl 


5.3 


4001 


AF190131 


dloninf? vector r*VO205 hv£n*omvcin-R-nhosnhotran^ferase fhtyh^ 
gene, complete cds 


1.7 


4002 


AJ132557 


Oiyctolagus cuniculus CYP19 gene, ovarian promoter region 


0.069 


4003 


AB001901 


Homo sapiens PACE4 gene, exon 4-7 


4 


4004 


AE006784 


Sulfolobus solfataricus section 143 of 272 of the complete genome 


0.19 


4005 


XM 049237 


Homo sapiens KIAA0841 protein (KIAA0841), mRNA 


1.7 


4006 


AF107676 


Aedes aegypti clone 416 Feilai family of SINES 


0.54 


4007 


AB053857 


TT virus gene for ORF1, partial cds, clone:Pd-10-3 


0.24 


4008 


AC078798 


Homo <»anien^ 3 BAP PAP-56F1 1 fRoswell Park Chancer Institute 
Human BAC Library) complete sequence 


5 


4009 


NM 026040 


Mus musculus RTKEN cDNA 2810036K01 eene f 28 1 003 6K0 1 Rito 
mRNA 


2E-33 


4011 


XM 027300 


Homo sapiens DKFZP434K114 protein (DKFZP434K114), mRNA 


4E-08 


4012 


AY021973 


Oryza sativa microsatellite MRG4298 containing (TA)X104, closest 
to marker SI 0620 genomic ^eonence ! 


0.0009 


4013 


AB003097 


Fruitfly strain g20 mitochondrial DNA, A+T-rich region, partial 


0 000003 


4014 


AJ000682 


Bos taurus PPARG gene, intron 1, 5' 


0.1 


4015 


AE001504 


TTflifYiHaptpr fwlori of rain TQ0 section fiS of 1 *\*) of* th<* r*otnnI F»tf* 

genome 


0.64 


4016 


NM 031442 


Homo oaoien*; hvnothetical nrotein DTCF5^n761 T17191 
(DKFZP76U17121), mRNA 


0.007 


4017 


U10698 


Rattus norvefficii^ liver microsomal carhoxvle^tera^e mRNA comnlete 
cds 


0.21 


4018 


AF009255 


Homo saniens nutative chloride channel ffene ( CTT .CN6^ exons 14 15 
16 and 17 


1.7 


4019 


NM 010657 


Mus musculus karma B and Rss recognition comnonent (TCrc^ rriRNA 


0.17 


4020 


AY021100 


Orvza sativa microsatellite MRG3425 containing JTA1X19 closest to 
marker L246, genomic sequence 


4.7 


4021 


U43092 


Rattus norvegicus surfactant protein-A (SP-A) gene, complete cds 


0.9 


4022 


XM 050127 


Homo sapiens oligophrenin 1 (OPHN1), mRNA 


0.64 


4023 


AE005576 


Escherichia coli 0157:H7 EDL933 genome, contig 3 of 3, section 195 
of 290 


0.45 


4024 


AF288693 


Mus musculus Ubell (Ubell) gene, partial cds 


0.0008 


4025 


AF203748 


Taphozous sp. Brcal (Brcal) gene, partial cds 


5.6 
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4026 


AF179234 


Homo sapiens short-chain dehydrogenase/reductase 1 (SDR1) gene, 
exon 1 


0.56 


4027 


AF163735 


Ilhcium parviflorum internal transcribed spacer 1, 5.8S ribosomal 
RNA gene, and internal transcribed spacer 2, complete sequence 


0.62 


4028 


INlVL UZOZJJ 


Mus musculus RIKEN cDNA 4933434120 gene (4933434I20Rik), 
mKNA 


6.5 


4C29 


AL031426 


Human DNA sequence from clone CTA-191D12 on chromosome 
22ql3.1 Contains two exons of the APOL2 gene for apolipoprotein L 
2, ESTs and GSSs, complete sequence [Homo sapiens] 


0.000001 


4030 


JSJVl UUol*fO 


Homo sapiens myeloid/lymphoid or mixed-lineage leukemia (trithorax 
(Drosophila) homolog); translocated to, 6 (MLLT6), mRNA 


0.63 


4031 


ArUU /VIM* 


Chaetopterus variopedatus histone H4 and histone H2 A genes, 
complete cds 


0.009 


4032 




Homo sapiens C-type (calcium dependent, carbohydrate-recognition 
domain) lectin, superfamily member 5 (CLECSF5), mRNA 


0.69 


4033 


Uool54 


Homo sapiens proline and glutamic acid rich nuclear protein isoform 
mRNA, partial cds 


0.074 


4034 


AC023838 


Arabidopsis thaliana chromosome HI PI MJM20 genomic sequence, 
complete sequence 


0.2 


4035 


X15894 


Sinapsis alba cab-1 gene for chlorophyll a/b-binding polypeptide \ 


0.1 


4036 


X54742 


Nicotiana plumbaginifolia beta-(l,3)-glucanase gene for a vacuolar 
isoform 


1.8 


4037 


NM 017590 


Homo sapiens hypothetical protein DKFZp434K0920 
(DKFZp434K0920), mRNA 


0.13 


4038 


U92482 


Equus caballus interleukin-1 receptor antagonist (EqlL-IRA) mRNA, 
complete cds 


0.38 


4039 


U47540 


Aspergillus nidulans nitrogen regulatory protein (tamA) gene, 
complete cds 


0.18 


4040 


AK001076 


Homo sapiens cDNA FLJ10214 fis, clone HEMBA1006530 


0.0000009 


4041 


AC002049 


Homo sapiens Chromosome 22qll.2 Cosmid Clone 107d7 In BCRL2- 
GGT Region, complete sequence 


1.3 


4042 


U55243 


Dictyostelium discoideum glyceraldehyde-3-phosphate dehydrogenase 
(GAPDH) gene, partial cds 


0.23 


4043 




H. sapiens (D1S218) DNA segment containing (CA) repeat; clone 
ArMD /xev; smgle read 


0.002 


4044 


AB012733 


maturase, complete cds 


0.024 


4045 


AF034950 


Bos taurus gonadotropin hormone receptor (GnRH) gene, partial cds 


0.22 


4046 


AJ307017 


Mus musculus mRNA for putative ubiquitin-specific protease (Usp9y 
gene) 


1.7 


4047 


AE001159 


Borrelia burgdorferi (section 45 of 70) of the complete genome 


0.89 


4048 


AF239215 


Forficula auricularia microsatellite FA2 sequence 


0.0002 
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4049 


AP000249 


Homo sapiens genomic DNA, chromosome 21q22.1, D21S226-AML 
region, clone:B762015, complete sequence 


1E-18 


4050 


AP000249 


Homo sapiens genomic DNA, chromosome 21q22.1, D21S226-AML 
region, clone:B762015, complete sequence 


2E-18 


4051 


AF355470 


Mus musculus cysteine dioxygenase gene, exon 2 


0.61 


4052 


BCO 10739 


Homo sapiens, hypothetical protein FLJ12612 similar to COP9 
(constitutive photomorphogenic), subunit 7b (Arabidopsis), clone 
MGC: 16836 IMAGE:3893022, mRNA, complete cds 


4.5 


4053 


AK016525 


Mus musculus adult male testis cDNA, RIKEN full-length enriched 
library, clone:4932415G12, full insert sequence 


1.5 | 


4054 


AJ237609 


Borrelia recurrentis vmpAl gene, silent copy 


i 1.8 


4055 


U00038 


Caenorhabditis elegans cosmid T21D11, complete sequence 


4.3 


- 

4056 


AF268053 


Candidates Carsonella ruddii natural-host Tainarys sordida RNA 
polymerase beta subunit (rpoB) and RNA polymerase beta-prime 
subunit (rpoC) genes, partial cds 


0.067 


4057 


AC016694 


Homo sapiens BAC clone RP11-123G1 from Y, complete sequence 


| 0.2 


4058 


AKO 19557 


Mus musculus adult male testis cDNA, RIKEN full-length enriched 
library, clone:4930402H05, full insert sequence 


0.15 


4059 


AF233898 


Escherichia coli strain KI1218 BfpA (btpA) gene, partial cds 


0.063 


4060 


AK026269 


Homo sapiens cDNA: FU22616 fis, clone HSI05164 


5.7 


4061 


Z80362 


H.sapiens HLA-DRB pseudogene, exon 1; 


1.7 


4062 


XM 046282 


Homo sapiens hypothetical gene supported by AK026722 
(LOC92635), mRNA 


0.007 


4063 


U83303 


Human line-1 reverse transcriptase gene, partial cds, and granulocyte 
chemotactic protein-2 (GCP-2) gene, complete cds 


5E-12 


4064 


AF276169 


Scaphirhynchus platorynchus microsatellite Spl-100 sequence 


0.074 


4065 


AF156797 


Mus musculus ATP synthase coupling factor 6 gene, 5' UTR; and 
GABP-alpha subunit gene, promoter and 5' UTR, partial sequence 


1.2 


4066 


AF1 10420 


Mus musculus hoTgN37INRA locus sequence 


4E-10 


4067 


AK021659 


Homo sapiens cDNA FLJ1 1597 fis, clone HEMBA1003856 j 


6E-10 


4068 


AL5 12724 


Homo sapiens mRNA; cDNA DKFZp547M202 (from clone i 
DKFZp547M202) 


0.2 


4069 


S54531 


L-plastin {exon 1, promoter} [human, Genomic, 3231 nt] j 


0.023 


4070 


L02110 


Mus musculus proviral retroviral insertion in the cGMP- 
phosphodiesterase (rd beta PDE) gene, intron 1, with the proviral 
insert encompassing the env pseudogene (3' end) and 3 1 LTR 


2.1 


4071 


XM 015244 


Homo sapiens ring finger protein 21, interferon-responsive (RNF21), 
mRNA 


1.5 


4072 


AK015420 


Mus musculus adult male testis cDNA, RIKEN full-length enriched 
library, clone:4930448I18, full insert sequence 


0.003 


4073 


U67583 


Methanococcus jannaschii section 125 of 150 of the complete genome 


0.021 


4074 


X05640 


Mouse NF-M gene for middle-molecular-mass neurofilament protein 


0.072 
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4075 


D01021 


Anthocidaris crassispina niRNA for dynein beta-heavy chain, 


0.61 


4076 


XM 029072 


Homo sapiens LOC89256 (LOC89256), mRNA 


0.006 


4077 


NM_0 17757 


Homo sapiens hypothetical protein FLJ20307 (FLJ20307), mRNA 


0.26 


4078 


AF145592 


Nicotiana tabacum calcium/calmodulin-dependent protein kinase 
(CCaMK) gene, exons 


0.002 


4079 


ni JOt7 f\j 


Arabidopsis thaliana somatic embryogenesis receptor-like kinase 3 

^OXVtNJVJ ) IXUvLN/V, COIUpieiC COS 


5.3 


4080 


AF238313 


uicxyosieiium aiscoiaeum aeveiopmentai protein DG1037 (DG1037) 
gene, partial cds 


0.2 


4081 


NM_012519 


Rattus norvegicus Ca-H-/calmodulin-dependent protein kinase n, delta 
subunit (Camk2d), mRNA 


0.071 


4082 


AT ^OOIRJ. 


Human DNA sequence from clone RP11-349A16 on chromosome 
j\s\jlj*. j-z^f, complete sequence [xiomo sapiens J 


Alt 

0.11 


4083 




xiomo sapiens up id. 3 cione lAJrii 1 A, partial sequence 


0.17 


4086 


XM 034197 


Homo sapiens KIAA1505 protein (KIAA1505), mRNA 


e-153 


4087 




Sinorhizobium meliloti plasmid pSymA section 61 of 121 of the 
complete plasmid sequence 


5.4 


4088 




Streptococcus pyogenes Ml GAS strain SF370, section 115 of 167 of 
uie compiece genome 


1.6 


4089 


AF391284 


Homo sapiens llpl5.5 clone LOH11A, partial sequence 


0.14 


4090 




caenoriiauaiiis eiegans cosmia rzzr l 


0.065 


4091 




Pseudosymblepharis schimpeiiana tRNA-Leu, partial sequence; tmL- 
trnF intergenic spacer, complete sequence; and tRNA-Phe, partial 
sequence, cmoropiasi genes ior cnioropiast products 


O.Uoo 


4092 




xiomo sapiens enromaun accessibility complex i (CtiKACl), mRNA 


A OO 

0.39 


4093 


IN 1V1_U 4DzfZ.y 


Mus musculus RIKEN cDNA 2010109103 gene (2010109I03Rik), 

mixXNA 


5.1 


4094 


AOZ / ©<#■ 


j-/. aiscoiaeum iiikina ior calcium Dinning protein 


A /T>AAA1 

0.00001 


4095 


TTfid.^'M 

UvtJZ't 


Caedibacter taeniospiralis 47 R body synthesis and assembly (rebA 
reojo, reou,, reouj genes, complete cos 


A CO 

0.55 


4096 


XM 050063 


Homo sapiens B-cell CLL/lymphoma 9 (BCL9), mRNA 


0.075 


4097 


-AJ.V1 \JJ 1 OJU 


xiomo sapiens synaecan 1 joul^i,/, idkna 


A 1 

2 


4098 




Homo sapiens retinitis pigmentosa GTPase regulator (RPGR) gene, 
exon wivr ij pamai cos 


A A*7iC 
0.0/O 


4099 


L05466 


Pneumocystis carinii beta-tubulin gene, complete cds 


0.009 


4100 


Z69723 


Human DNA seauence from cosmid hpfw«*n mark^rc 
DXS6791 and DXS8038 on chromosome X 


0.18 


4101 


D83657 


Human DNA for CAAF1 (calcium-binding protein in amniotic fluid 
1), complete cds 


5E-30 


4102 


AL1 17050 


Botrytis cinerea strain T4 cDNA library under conditions of nitrogen 
deprivation 


2.1 


4103 


AE004168 


Vibrio cholerae chromosome I, section 76 of 251 of the complete 
chromosome 


2.3 
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4104 


BC007114 


Homo sapiens, clone IMAGE:4295422, mRNA 


0.22 


4105 


D13859 


Penicillium chry sogenum mitochondrion genes for rRNA, tRNA and 
S5 ribosomal protein, complete cds 


0.75 


4106 


AF137068 


Canis familiaris cubilin (CUBN) mRNA, complete cds 


4.3 


4107 


AKO 12387 


Mus musculus 11 days embryo cDNA, RIKEN full-length enriched 
library, clone:2700047H13, full insert sequence 


0.38 


4108 


NC 001942 


Mycoplasma arthritidis bacteriophage MAV1, complete genome 


0.12 


4109 


AF333431 


Homo sapiens pseudo attP site psiD, partial sequence 


0.45 


4110 


AF305628 


Cubitermes subarquatus microsatellite P19 sequence 


0.68 


4111 


D 10997 


C.ellipsoidea ibcL, rpsl4, trnM, trnG, trnD, trnS, rps4, atpE, atpB 
genes, complete cds 


1.8 


4112 


Z99704 


Human DNA sequence from clone LL22NC01-75B8 on chromosome 
22 Contains GSSs, complete sequence [Homo sapiens] 


0.005 


4113 


XM 016579 


Homo sapiens androgen-induced prostate proliferative shutoff 
associated protein (KIAA0979), mRNA 


1.6 


4114 


NM_008149 


Mus musculus glycerol-3-phosphate acyltransferase, mitochondrial 
(Gpam), mRNA 


5.5 


4115 


XM 036116 


Homo sapiens KIAA0594 protein (KIAA0594), mRNA 


e-144 


4116 


XM 030510 


Homo sapiens microfibrillar-associated protein 2 (MFAP2), mRNA 


2.4 


4117 


ABO 10203 


Leptospira interrogans gene, 19kb region containing 5S rRNA gene 


1.4 


4118 


XM 007094 


Homo sapiens collagen, type IV, alpha 1 (COL4A1), mRNA 


1.9 


4120 


AF124049 


Fugu rubripes double stranded RNA adenosine deaminase RED IB 
gene, complete cds 


0.066 


4121 


AY007236 


Pneumocystis carinii pheromone receptor a mRNA, complete cds 


0.53 


4122 


AF307966 


Aptostichus simus clone PD1 16S ribosomal RNA gene, partial 
sequence; mitochondrial gene for mitochondrial product 


2 


4123 


AJ001022 


Marmata monax mRNA for tumor suppressor, p53 


1.3 


4124 


AF239469 


Polycycnis gratiosa maturase (matK) gene, partial cds; chloroplast 
gene for chloroplast product 


0.65 


4125 


X75675 


C.parapsilosis mitochondrial ND6 and ND1 genes 


0.00001 


4126 


AF335589 


Glycine max chitinase class I (Chial) and hypothetical protein genes, 
complete cds 


0.0003 


4127 


U33175 


Oryza sativa sucrose phosphate synthase gene, complete cds 


6.4 


4128 


AB028198 


Arabidopsis thaliana ZF14 mRNA, complete cds 


0.008 


4129 


AF372653 


Heteropneustes fossilis prolactin hormone (PRL) mRNA, complete cds 


1.3 


4130 


M63955 


S.oedipus MHC class I SO-NI gene 


0.6 


4131 


BC003238 


Mus musculus, protein kinase, cAMP dependent, catalytic, alpha, 
clone MGC:6169 IMAGE:3497908, mRNA, complete cds 


0.22 


4132 


AJ001688 


Homo sapiens NKG2D gene, exons 6-9 


0.67 


4133 


D 10872 


Humam h NAT allele 3-2 gene for arylamine N-acetyltransferase 


0.016 
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4134 


AF049877 


Cotesia congregata EP1 locus with complete integrated Cotesia 
congregata virus EP1 circle and partial integrated A circle 


0.19 


4135 


Y15783 


Rickettsia prowazekii groESL gene 


0.3 


4136 


Y00354 


Xenopus laevis gene encoding vitellogenin A2 


0.077 


4137 


NC 001950 


Scyliorhinus canicula mitochondrion, complete genome 


0.019 


4138 


AF266048 


Macromia splendens small subunit ribosomal RNA gene, partial 
sequence; mitochondrial gene for mitochondrial product 


0.42 


4139 


AL035426 


Human DNA sequence from clone 370N13 on chromosome Xq25- 
26.3. Contains an exon of the GRIA3 gene for glutamate receptor, 
ionotrophic, AMPA 3, Contains ESTs, complete sequence [Homo 
sapiens] 


0.0006 


4140. 


ArUlo434 


Mus musculus long-chain acyl-CoA dehydrogenase (Acadl) gene, 
intron 1 


6.4 


4141 


AK509654 


Mus musculus U2AF small subunit-related protein (U2afl-rsl) gene, 
complete cds 


0.64 


4142 


AJ009913 


Bos taurus pip gene 


2.1 


4143 


XM 028213 


Homo sapiens up-regulated by BCG-CWS (LOC64116), mRNA 


0.078 


4144 


U67550 


Methanococcus jannaschii section 92 of 150 of the complete genome 


0.081 


4145 


AB037771 


Homo sapiens mRNA for KIAA1350 protein, partial cds I 


0 


4146 


AP001438 


Homo sapiens genomic DNA, chromosome 21q22.2, clone:T695, 
LB7T-ERG region, complete sequence 


0.009 


4147 


U88995 


Callithrix geoffroyi clone CGE85 mitochondrial D-loop, partial 
sequence 


0.57 


4148 


AE007343 


Streptococcus pneumoniae section 26 of 194 of the complete genome 


1.9 


4149 


Z83832 


Avena sativa mRNA for UDP-glucose:sterol glucosyltransferase 


0.072 


4150 


BC007664 


Homo sapiens, hepatocellular carcinoma-associated antigen 59, clone 
MGC:866 IMAGE: 3 140172, mRNA, complete cds 


0 


4151 


NM 002577 


Homo sapiens p21 (CDKNlA)-activated kinase 2 (PAK2), mRNA 


5E-39 


4152 


XM 040120 


Homo sapiens ADP-ribosylation factor GTPase activating protein 1 
(ARFGAP1), mRNA 


2.1 


4153 


NM 031511 


Rattus norvegicus Insulin-like growth factor II (somatomedin A) 
(Ig£2), mRNA 


0.024 


4154 


XM 049817 


Homo sapiens Sec23-interacting protein pl25 (P125), mRNA 


1.8 


4155 




Homo sapiens, clone MGC: 13350 IMAGE:4333615, mRNA, 

complete vUS 


\ 

U.UUUUU^ 


4156 


AJ222787 


Hordeum vulgare mRNA for alpha-keto acid dehydrogenase-like 
protein, clone RG136 


0.075 


4157 


AL021306 


Human DNA sequence from clone CTB-1 109B5 on chromosome 22 
Contains a GSS, complete sequence [Homo sapiens] 


0.000001 


4158 


AF3 15813 


Norwalk virus (Hu/NLV/OC96065/1996/JP) polymerase gene, partial 
cds 


0.71 
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4159 


AE001407 


Plasmodium falciparum chromosome 2, section 44 of 73 of the 
complete sequence 


0.0008 


4160 


U32706 


Haemophilus influenzae Rd section 21 of 163 of the complete genome 


0.6 


4161 


AF293663 


Anopheles culicifacies 12S ribosomal RNA gene, partial sequence; 
mitochondrial control region, complete sequence; and tRNA-Ile gene, 
partial sequence; mitochondrial genes for mitochondrial products 


0.066 


4162 


XM_037617 


Homo sapiens acetyl-Coenzyme A carboxylase alpha (ACACA), 
mRNA 


3E-35 


4163 


AF305712 


Bos taurus insulin-like growth factor binding protein-3 (IGFBP3) 
gene, complete cds 


0.23 


4164 


AKO 18618 


Mus musculus adult male cecum cDNA, RIKEN full-length enriched 
library, clone:9130014E20, full insert sequence 


0.68 


4165 


Y00354 


Xenopus laevis gene encoding vitellogenin A2 


0.079 


4166 


M61827 


Human leukosialin (CD43) gene, complete cds 


2.2 


4167 


M64861 


D.discoideum TJfilD mRNA, complete cds 


0.066 


4169 


AB028894 


Mus musculus Rpsl 1, U35 genes for ribosomal protein Sll and U35 
snoRNA, complete cds and sequence j 


0.6 


4170 


XM__003974 


Homo sapiens hypothetical protein FIJI 1 159 (FLJ1 1 159), mRNA 


0.68 


4171 


AF100658 


Caenorhabditis elegans cosmid H08G01 


0.75 


4172 


AF1 15645 


Sedum multiceps maturase K (matK) gene, partial cds; chloroplast 
gene for chloroplast product 


0.0001 


4173 


AF064029 


Hehanthus tuberosus lectin 1 mRNA complete cds 


0.11 


4175 


AF093754 


Arabidopsis thaliana trans-membrane nitrate transporter protein 
AtNRT2:l mRNA, complete cds 


0.22 


4176 


M14292 


Human LIHeg repetitive element from the intergenic region of the 
epsilon and G-gamma globin genes 


2E-08 


4177 


NM 013564 


Mus musculus insulin-like 3 (Insl3), mRNA 


0.067 


4178 


AF218815 


Coturnix japonica early response to neural induction ERNI (ERNI) 
mRNA, partial cds 


0.21 


4179 


U67616 


Human 28S ribosomal RNA pseudogenes and alu repeat region 
sequence 


6E-20 


4180 


AJ297945 


Frankia *n ArTC ervnrie nla^mid nF031 ORF AF korSAF &ene ORF 
CrF, ORF DrF, repF gene, ORF FrF, ORF GF, ORF HF, ORF JF, 
ORF KrF, ORF LrF, ORF MF, ORF NrF, ORF OF, ORF PrF, parAF 
gene and ORF SrF 


0.68 


4181 


AB035429 


Homo sapiens AGL gene for glycogen-debranching enzyme, exon 14 


0.62 


4182 


AE006774 


Sulfolobus solfataricus section 133 of 272 of the complete genome 


0.063 


4183 


U38803 


Dictyostelium discoideum vacuolar proton ATPase 100-kDa subunit 
(vatM) mRNA, complete cds 


0.025 


4184 


U78096 


Human macrophage colony stimulating factor receptor (c-fms) gene, 
exon 1A, 2 and partial cds 


0.0001 
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A t Of 

4185 


X57037 


T.thennophila G8-scRNA DNA 


5.7 


4186 


AJ001005 


Picea abies chloroplast DNA for rbcL pseudogene 


0.67 


4187 


XM 048652 


Homo sapiens Apobec-1 complementation factor; APOBEC-1 
stimulating protein (ACF), mRNA 


4E-12 


4188 


Y07654 


P.crispum pall gene 


0.24 


4189 


Z78708 


Rsapiens flow-sorted chromosome 6 Hindm fragment, SC6pA14H12 


I 1E-28 


4190 


AE002143 


Ureaplasma urealyticum section 44 of 59 of the complete genome 


2.1 


4191 


AF247729 


Oncorhynchus mykiss glucose transporter IB mRNA, partial cds 


1.8 


4192 


Z72929 


S.cerevisiae chromosome VII reading frame ORF YGR144w 


0.003 


4193 


M87754 


Human simple repeat polymorphism 


0.081 


4194 


AL021809 


S.pombe chromosome I cosmid c25A8 


0.026 


4195" 


D12820 


Rat mRNA for GUST27 protein, complete cds 


3.7 


4196 


M60352 


Mouse testosterone 16a-hydroxylase type b (16aoh-b), exon 1 


4.7 


4197 


XM_007360 


Homo sapiens hepatocyte nuclear factor 3, alpha (HNF3A), mRNA 


0.95 


4198 


Z36064 


S.cerevisiae chromosome II reading frame ORF YBR195c 


0.52 


4199 


AK023959 


Homo sapiens cDNA FLJ13897 fis, clone THYRO1001706 


0.49 


4200 


U48298 


Dictyostelium discoideum rtoA gene, complete cds 


0.68 


4201 


L47461 


Heterodontus francisci tcr beta gene 


1.6 


4202 


AB041023 


Humulus lupulus DNA, 18S-26S intergenic spacer, complete sequence 


0.0008 


4203 


AF1 18226 


Hordeum vulgare high pi alpha-glucosidase (AGL97) gene, complete 
cds 


1.8 


4204 


AF158049 


Platybrachys decemmacula 16S ribosomal RNA gene, partial 
sequence; mitochondrial gene for mitochondrial product 


0.077 


4205 


U63855 


Drosophila virilis decapentaplegic protein (dpp) gene, complete cds, 
haploinsufficient region 


0.2 


4206 


AF248814 


Maoricicada cassiope isolate TB-MI-004 tRNA-Asp gene, complete 
sequence; ATPase subunit 8 gene, complete cds; and ATPase subunit 
6 gene, partial cds; mitochondrial genes for mitochondrial products 


0.023 


4207 


XM 037529 


Homo sapiens KIAA0410 gene product (KIAA0410), mRNA 


0.78 


4208 


AF330636 


Plant DNA excision vector pX6-GFP, complete sequence 


0.76 


4209 


AL080235 


Homo sapiens mRNA; cDNA DKFZp586E1621 (from clone 
DKFZp586E1621) 


0.007 


4210 


NM 021262 


Rattus norvegicus Acid phosphatase 1, soluble (Acpl), mRNA 


0.16 


4211 


Ml 1449 


Yeast mitochondrial ori2-ori7 region DNA with putative peptide 


0.25 


4212 


AF211134 


Carsonella ruddii natural-host Qenaiytaina longicauda 16S ribosomal 
RNA gene, partial sequence; 23S ribosomal RNA and 5S ribosomal 
RNA genes, complete sequence; transaldolase (tal) and alkyl 
hydroperoxide reductase small subunit (ahpC) genes, complete> 


0.074 


4213 


AF157293 


Sporodiniella umbellata translation elongation factor 1-alpha (EF- 
lalpha) gene, partial cds 


0.77 



369 



3NSDOCID: <WO 0214500A2_I_> 



WO 02/14500 



PCT/US01/25840 





Table 3A Nearest Neighbor (BlastN vs. Genbank) 


SEQID 
NO 


ACCESSN 


DESCRIP 


P VALUE 


4214 


AL390767 


Human DNA sequence from clone RP1-68P15 on chromosome llpl3- 
14.2 Contains GSSs and ESTs. Contains part of a novel gene, 
complete sequence [Homo sapiens] 


0.004 


4215 


XM 004579 


Homo sapiens HGC6.1.1 protein (HGC6.1.1), mRNA 


4E-08 


4216 


XM 043964 


Homo sapiens similar to KIAA0220 protein (H. sapiens) (LOC92263), 
mRNA 


5E-40 


4217 


AL390767 


Human DNA sequence from clone RP1-68P15 on chromosome llpl3- 
14.2 Contains GSSs and ESTs. Contains part of a novel gene, 
complete sequence [Homo sapiens] 


0.004 


4218 


AF147082 


Homo sapiens gamma-glutamyl hydrolase gene, exons 3 through 7 


0.088 


4219 


AY023560 


Oryza sativa microsatellite MRG5885 containing (TTA)X8, closest to 
marker R2561, genomic sequence 


0.026 


4220 


XM_050088 


Homo sapiens hepatocellular carcinoma-associated antigen 127 
(HCA127), mRNA 


0.007 


4221 


AF131999 


Plasmodium falciparum putative erythrocyte binding protein EBL-1 
(ebl-1) gene, partial cds 


0.68 


4222 


AF008434 


Solanum fendleri farnesyl-protein transferase beta subunit gene, 
intron 10 


0.21 


4223 


AF367267 


Arabidopsis thaliana AT3g26020/MPEl 1_17 mRNA, complete cds 


0.53 


4224 


XM 006706 


Homo sapiens primase, polypeptide 1 (49kD) (PRJM1), mRNA 


7E-12 


4225 


AK004675 


Mus musculus adult male lung cDNA, RIKEN full-length enriched 
library, clone: 1200009120, full insert sequence 


0.017 


4226 


AB026660 


ArabidoDsis thaliana genomic DNA chromosome 5. BAC 
clone:T20O7 


0.23 


4227 


AF272387 


Homo sapiens leucine zipper nuclear factor (BLZF1) gene, exon 6 


5E-09 


4228 


L08897 


Mycoplasma gallisepticum (strain A5969) 16S-, 23 S-, 5S ribosomal 
RNA (rrs A, rrlA, rrfA) genes 


0.073 


4229 


Z26314 


P.falciparum gene for STARP antigen 


0.008 


4230 


NC 001942 


Mycoplasma arthritidis bacteriophage MAV1, complete genome 


0.18 


4231 


Y18277 


Gallus gallus mRNA for neurobeachin, partial 


3 


4232 


XM 047009 


Homo sapiens pleiomorphic adenoma gene-like 2 (PLAGL2), mRNA 


1.8 


4233 


X84729 


S.cereale repeated DNA sequence 


0.21 


4234 


AL513132 


Human DNA sequence from clone RP1 1-460C6 on chromosome 9, 
complete sequence [Homo sapiens] 


0 


4235 


Z71694 


E.herklotsi mitochondrial genes for NADH Debydroenase subunit 5 
and 6 


0.6 


4236 


Z33215 


M.capricolum DNA for CONTIG MC294 


0.54 


4237 


X86012 


Human DNA sequence from intron 22 of the factor VIE gene, Xq28. 
Contains the end of a 9.5kb repeated region, int22h-l, involved in 
many cases of haemophilia 


0.0000005 


4238 


AF174355 


Praon barbatum 16S ribosomal RNA gene, partial sequence; 
mitochondrial gene for mitochondrial product 


0.026 
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4239 


AY039607 


Arabidopsis thaliana AT3g52380/F22O6 240 mRNA, complete cds 


6.1 


4240 


AJ223385 


Fowlpox virus strain HP-440 DNA, isolate FP9, 14.6 kb fragment 


0.000004 


4241 


ArZ / i LJSJL 


Daphnia cristata country Lithuania 12S ribosomal RNA gene, partial 
sequence 


1.9 


4242 


1NIV1 UZJojU 


Mus musculus carbohydrate (keratan sulfate Gal-6) sulfotransferase 1 

/Y* , l»«!*'^ mD\T A 


0.59 


4243 




Lepilemur septentrionalis genomic fragment, RAPD with primer 
operon Kjrn/t, clone ai 


0.2 


4244 


YM OS 174.7 


Homo sapiens eukaiyotic translation initiation factor 4 gamma, 2 

CFTFACiO^ mPIMA 
\l2,lSr i *\j4.) 9 lIliviNA 


3E-58 


4245 


lVlOJ J /o 


riuman ii\±*ivi-z protein gene, exons l,z and 3 


0.7 


4246 


ATjO /ZO / 


/vraoiaopsis tnauana Ai JgzouzU/MPEll 17 mRNA, complete cds 


0.59 


4247 


AJ294714 


Pichia anomala leu2 gene for beta-isopropylmalate dehydrogenase 


0.17 


4248 




Mus musculus amnionless precursor protein (Aran) gene, partial cds 


0.57 


4249 


AL049609 


S.pombe chromosome IE cosmid c297 


1.8 


4250 




Homo sapiens genomic hybrid Rhesus box 


3.8 


4251 


XM 034693 


Homo sapiens ectodermal dysplasia 1, anhidrotic (EDI), mRNA 


0.2 


4252 




Caprine arthritis-encephalitis virus, complete genome 


1.3 i 


4253 




Homo sapiens 12q BAC RP11-946G22 (Roswell Park Cancer Institute 
Human BAC Library) complete sequence 


5.9 


4254 


AF003519 


Paraligoneunis sp. 16S ribosomal RNA gene, partial sequence 


0.024 


4255 


AL049434 ! 


Homo sapiens mRNA; cDNA DKF2p586M151 (from clone 
DKFZp586M151) 


0.0008 


4256 


TTW71 


Triticumaestivum acetyl-CoA carboxylase gene, exons 1-30, complete 
cds 


0.22 


4257 




Oiyza sativa V-ATPase B gene for vacuolar ATPase B subunit, 
complete cds 


1.7 


4258 




Rattus norvegicus small zinc finger-like protein (TTMIO) mRNA, 
complete cds 


0.24 


4259 


AFOA7^77 


Homo sapiens dystrophin (DMD) gene, deletion breakpoints 1-3 in 
lnuon d 


0.025 


4260 


Z81504 


Caenorhabditis elegans cosmid F15H9, complete sequence 


0.24 


4261 


AF226147 


HTV-1 isolate 630469c from USA protease (pol) gene, partial cds 


2 


4262 


AF270072 


Staphylococcus epidermidis strain SRI clone step.l047b05 genomic 
sequence 


0.68 


4263 


U56088 


Human periodic tryptophan protein 2 (PWP2) gene, exons 3 to 14 


0.076 


4264 


X83692 


Brassica napus (tournefortii) mitochondrial genes for atp6, and 
ORF263 


2.1 
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4265 




"PpnifM Ilium fwii 1 i n l m vaniftlar cprinp rvmtpacp TnU'WA fnmTVIptP pHq 
rcillvi 1 1 1 U 1 11 UAiUivuiii v a\*ixsjiai avXXX&C tyiuiwctow xiix\j.^^i 9 ^uiii^xcLC/ wua 


0.63 


4266 


1N1Y1 UZHjoI 


Rattus norvegicus ATP -stimulated glucocorticoid-receptor 

UcUloXUUiitiJll pXUiXllHCX \\jyJXJ 9 XLLXviN/Y 


0.68 


4267 


AF098919 | 


Gallus gallus alpha-globin gene domain 5' region 


2.3 


4268 




xiypUL/ICil JCtAJllXul X1HJJ gCUC, CAUllb 1 


0.23 


4269 




^^piTuiTinr'.nrvnc innnacphii cppf inn 70 rrf" 1 ^0 rvFtliP. crvmnlpte ffenoine 

LVXCI 1 HflHUv*\JW , U3 i In 1 1 1 'fliTV In 1 uwllvll / \J VJ1 X ~J\J \JX Ulw wUl i 1. Iv ^WlvlUO 


0.029 


4270 


AF104920 


Arabidopsis thaliana BAC T9E19 


0.026 


4271 


AF137394 


Euperipatoides rowelli elongation factor-1 alpha mRNA, partial cds 


0.23 


4272 


AF243087 


Homo sapiens intrinsic factor-vitamin B12 receptor (CUBN) gene, 
exons 11, 12, and 13 


0 


4273 


AF264703 


Ach;onov mpvir , onn C f*f=»ttTIQl Vl rtTTI PV^V»r*"V nivitpiTi /T? v 1 ^ TTlT? TsJ A 
.rYo L V dX ldA. XllCAlWlllUo XCIIUcll livlliiCUUUA. \JX VJ Iwlll ^IVAi ) 1LLC\J.X.T\, 

complete cds 


2.2 


4274 


AJ3 11054 


Phirnnnmnc nalliriit/ittsitiic AT 0121? fr\T* mitatTVP^ TPfYYmhinJJQP 
^lUXUIlUIXluo Udll ILUVllulLLlo *+J-» WIvT XUX iJUUlLlVE/ IPwUmUHInOV 

subtelocentric clone, Cp5.5 


0.025 


4275 


AF267209 


Candiaatus (jarsonena maun natural -no si caiopnya scnuu Air 
synthase alpha subunit (atpA) gene, partial cds; ATP synthase gamma 

cntVmnit (<ztT\Or\ optip rnmnl ptp rHc* arts! A I P ovnthjiQP hptn oiihnnit 

(atpD) gene, partial cds 


0.0008 


4276 


*7T71 AR 


XT. SapiCllS ^XVlXo50-Jj L/INA bCglUCXll (JUXlutlXJXXXg lCLJCaL, dUllfc 

ATTVAO ^A-tKQ" cinolp TP2*/i 
ATIVX^ J*rZW", SillXgiG ICdU 


0.008 


4277 




Gallus gallus T cell receptor delta chain (TCRD) pseudogene, partial 


2.3 


4278 




Mus musculus 10 days embryo cDNA, RIKEN full-length enriched 

XlUXoXy, L1U11C . l w J i/uxi i J, -lull iiiawit awt[ucuw 


0.49 


4279 


ATZ0747U 


oinpnyiococcus cpiacixiuuxb suaxii oxvx ciuxic xuu jiiv^t gcuuiLu.i' 
sequence 


0.087 


4280 


XM_050118 


Homo sapiens hephaestin (HEPH), mRNA 


2E-75 


4281 


/■\_r x ozo 


A /fiic mnc/*iiliic vpnfral TiPnrnTi- cnfV*i"Fir* nrAtPin 1 ~Mf"^\/A 1 fMrtvn 1 ^ 
iVXUS XTLUawUXUo VClXIXciX X1CIXXU11— opCv«XXXW LJlWlClll X V-TVX ^INvlVdX/ 

IIUUNA, \J<Xk Lldl WVIO 


0.008 


4282 


XM 029168 


riomo sapiens nypouicu.L»dLi gcxic ouppuncu uy rvrs-uz, o 10^ 
(LOC90120), mRNA 


0.53 


4283 


T **1£A5 


~uw|f orit Hiic annnuc riho<:nTii5)1 nrntpin ^^51 mR 7\T A nnmnlptp pHq 

XlClXoXlLlllXo dXXIXUUo X XUvlSVIlllCU iyiV/LvlXl OJ(l lllXVl^^V, LAJlll^JlWlC wVID 


0.67 


4284 


-AJV1_U1Z / DO 


TJrtmrfc coniAnc fTQtier*in»if~ir»Ti far»trtr A /' 1 "f^FA^ mT? AJ A 
xiOUlU adpiCXXb UaliDwllLyuun xox^lxjx *t ^iw ^ /, XiuvLN^v 


0.22 


4285 


AK021049 


JY1US mUSCUlUS auuiL mine uuxpixb blXXalUlXX cl/xn A, r\ 1 rS. xuxx-xcxigux 

enriched library, clone:C030006F08, full insert sequence 


0.006 


4286 


AB020763 


Ptychodera flava mRNA for Paxl9, complete cds 


0.26 


4287 


XM_047401 


Homo sapiens G protein-coupled receptor 51 (GPR51), mRNA 


0.25 


4288 


AF250952 


Osmia satoi 16S ribosomal RNA gene, partial sequence; 
mitochondrial gene for mitochondrial product 


0.24 


4289 


AF267985 


Homo sapiens 15 kDa selenoprotein gene, exon 4 


0.024 


4290 


AJ409503 


Mus musculus RNA binding site for Dazl protein, clone jd8 


0.029 


4291 


AC009304 


Homo sapiens clone NH01 1 1109, complete sequence 


0.026 
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4292 


XM_050269 


xxcuiiu sapxciid auiuic cdixici icuiiiiy x v gnai mgn ajximty fnutamate 
transporter), member 3 (SLC1 A3), mRNA 


0.21 


4293 


AK004286 


xviub mubL.uiub 10 aays cuiDryo cujn/v, ivJJvc/iN mii-iengui enncned 
library, clone: 1110057K12, full insert sequence 


2.1 


4294 


NC 002095 


viuuiuuiuui uuuwifl L/JtaalXIXU JJV^X-* x , ^vlxlllylvLC oCtJUCllUC 


U.Z4 


4295 


AC006678 


Caenorhabditis elegans cosmid R05G9, complete sequence 


2.1 


4296 


AF217235 


ouipuyiuwoucub aureus pdinogcniciiy lsianu oarioov, complete 
sequence 


0.009 i 


4297 


XM 012045 


Homo sapiens hypothetical protein FU13074 (FLJ13074), mRNA 


0.68 


4298 


NMJ)22087 


Homo sapiens hypothetical protein FLJ21634 (FLJ21634), mRNA 


e-158 


4299 


XM 010678 

/ViVl \J XV/ w 1 O 


XiUino sdpxens similar 10 zinc ringer protein zoo ^rx. sapiens j 




4300 


AJ3 11847 


lNiwuuaiia idDduuiu cm oropiasi nirviN/i ior rtsZi-iiKe protein (xtsZi 
gene), clone FtsZ2-2 


0.55 


4301 




H. sapiens (D18S480) DNA segment containing (CA) repeat; clone 

^ijrivx^zuycy, single tghq. 


oJti-12 


4302 




Mus musculus 18 days embryo cDNA, RIKEN full-length enriched 

1 i Vlltl f*7 r»1 f\Ttf*' 1 1 1 f\C\ 1 QT5 1 0 -fill 1 ineorf ra/nion/->o 

iiuidiy, uuue. 1 1 1 uu i iuii insert sequence 


f\ Ann 


4303 


AB047606 


Macaca fascicularis brain cDNA, clone:QnpA-10881 


0 


4304 




xiumu sdpicxib rLjjzAJtroo lis, cionc AlAIUouji/ 




4305 


AF375597 


Mus musculus medium and short chain L-3-hydroxyacyl-Coenzyme A 
aenyarogenase (ivLscnaci/ gene, exons 2 tnrougn o, ana complete cas; 
nuclear gene for mitochondrial product 


0.75 


4306 


AP000101 

ruruuUJv x 


riomo sapiens genomic uin a, ctiromosome z iqzZ. l, JJ2ib22o-AML# 

TPOinn f*1tf\fiA*l"l 1 f\ 1— C 52 f Amnlptp ctvniAn^A 
ACgHJlL, ^lVJIlt .XV 1 UJJO, CVJllipiCLC SCUUCI1CC 


U.Uol 


4307 


AK01963? 


Mus musculus adult male testis cDNA, RIKEN full-length enriched 
xiuioiy, wuuct^ju^jjiviuj, iuii insert sequence 


U.vDo 


4308 


X94S% 


xv.v/u.igmaui jl'injtv iui iiiiciiicu uaixswriueci spacer 1 1 oz ox ruiN/v unii 


u.u /y 


4310 


AV077174 

A X V/X. / X / *T 


Arabidopsis thaliana sequence flanking 3 1 end of Ds-GeneTrap 

xiioci Liuii nun i line vjizoju 


n o 
u.z 


4311 


AC092063 


Homo sapiens clone RP1 1-745H7, complete sequence 


0.14 


4312 




riomo sapiens oi-/o- , siauie vunenrin-DOuna jjjna iragment 

XJTRT71 QVTM"1 R 
r±JLi,r 17Y nvti o 


e-ii / 


4313 


Z82088 


Caenorhabditis elegans cosmid ZK256, complete sequence 


0.015 


4314 


AC006709 


Caenorhabditis elegans cosmid Y119C1A, complete sequence 


0.17 


4315 


AF016620 


Mus musculus Rb-8 neural cell adhesion molecule long form 
(RNCAM) mRNA, partial cds 


6.2 


4316 


AK013121 


Mus musculus 10, 11 days embryo cDNA, RIKEN full-length 
enriched library, clone:2810421E14, full insert sequence 


1.9 


4317 


XM 033379 


Homo sapiens KIAA1607 protein (KIAA1607), mRNA 


4E-09 


4318 


AL355864 


Human DNA sequence from clone RP4-803A2 on chromosome 1, 
complete sequence [Homo sapiens] 


1.4 
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4319 


AHO 01422 


Plasmodium falciparum chromosome 2, section 59 of 73 of the 
complete sequence 




4320 


ATI CI A 1*1 

AL 15743 7 


Homo sapiens mRNA; cDNA DKFZp434E1512 (from clone 
DKrZp4i4rsi!5l2), partial cds 




4321 


T TO 1 A1 O 

U31929 


Human orphan nuclear receptor (DAX1) gene, complete cds 


DA 


4322 


Ar 3 3 8875 


Hylobates moloch NAN03 cytochrome b gene, partial cds; and tRNA- 
Thr, tRNA-Pro, and tRNA-Phe genes, complete sequence; 
mitochondrial genes for mitochondrial products 


A 

U.O 


4323 


VA/1 CAO 


Human oLJr i gene ior secretory leujcocyte protease lruuDiior 


9 
z 


4324 


TTA*7A1 Q 


riuman lmrouno oeiiciency virus type i otiouj ^env; gene, partial cas 




4325 




xiomo sapiens enromosome -«\pzz yjo-xj> 


ft ft91 


4326 


U17081 


Human fatty acid binding protein (FABP3) gene, complete cds 


6 


4327 


AJ24V36D 


Plasmodium falciparum partial mRNA for delta-aminolevulinic acid 
dehydratase 




4328 


AF090936 


Homo sapiens clone HQ0582 


0.006 


4329 


X97970 


Athaliana mRNA for RNA nelicase 


A ATQ 

U.U /y 


4330 


XM 047794 


Homo sapiens KIAA1641 protein (KIAA1641), mRNA 


e-156 




AF334172 


Zea mays clone B7 chromosome 4 centromeric region 


3.9 


4332 


AJ401041 


Human immunodeficiency virus type 1 proviral partial gpl60 gene for 
envelope protein, strain J_97DC.KTB147 


0.64 


4333 


XM038175 


Homo sapiens K1AA0872 protein (K1AAUS 72), mRNA 


A 


4334 


Y16473 


Chlamydomonas reinhardtii chloroplast DNA for rpsl8 gene, and 
011570 


U.Uzo 


4335 


AF1 90381 


Eucalyptus tetragona photosystem Q(B) protein (psbA) pseudogene 
and tRNA-His (trnH) gene, partial sequence; chloroplast genes for 
chloroplast products 


a AAC 


4336 


AE006276 


Lactococcus lactis subsp. lactis IL1403 section 38 of 218 of the 
complete genome 


ft 90 

u.zz 


4337 


AK020646 


Mus musculus adult male urinary bladder cDNA, RJJsJiN iuii-iengtn 
enriched Ubrary, clone:9530077 A04, full insert sequence 


2.3 


4338 


Z69666 


Human DNA sequence from cosmid 24F8 from a contig from the tip 
of the snort arm or chromosome 16, spanning 2Mb oi lop 13. 3. 
Contains ESTs, repeat polymorphism and CpG island 


0.018 


4339 


AF0 17254 


West Nile virus nonstructural protein NS5 (NS5) gene, partial cds 


0.73 


4341 


NM 008645 


Mus musculus murinoglobulin 1 (Mugl), mRNA 


ft ftftftftO 


4342 


AF1 14927 


Qjirrhnrfvmv^fK: nsicf-Ari gnuc ORSl ^mall QiiViiiTlit Tlhosomal RNA 

gene, mitochondrial gene for mitochondrial RNA, complete sequence 


0.25 


4343 


AL451057 


Human DNA sequence from clone CTD-2010K20 on chromosome 10, 
complete sequence [Homo sapiens] 


0.17 


4344 


AK022875 


Homo sapiens cDNA FLJ12813 fis, clone NT2RP2002503, weakly 
similar to ZINC FINGER PROTEIN 45 


1.7 


4345 


D10910 


Arabidopsis thaliana Atpk7 gene for serine/threonine protein kinase, 
complete cds 


0.61 



374 



INSDOCID: < WO 021 4500A2_I_> 



WO 02/14500 



PCT/US01/25840 





Table 3A Nearest Neighbor (BlastN vs. Genbank) 


SEQID 
NO 


ACCESSN 




T> T 7 A T TTT7 
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4346 


X53096 


o.duicub gencb cncouiug oauyoi jljina memyrtransterase anct oau"6i 

rp ctri r*<"i An pnHnnnrlpQcp 

IbdlilWliUll CUUvlUUvtlWCtdV 


0.2 


4347 


M57855 


Human protein S-beta pseudogene, exons 5 and 6 


0.2 


4348 




•j.ucicvioiac wu oiiiuauiue vi reading rrame ijrvr y jtLilo /c 


0.065 


4349 




ivicutciiiuL/Uwcub jdiuidscnii secuon 01 idu or trie complete genome 


0.54 


4350 


Y09047 


Mmusculus pxF gene, strain 129VJ 


0.002 


4351 


XM 027332 


TTomfi ^fnripn^ rihr»Qnm?i1 ■nrntpin T CD T>T *\ft\ mT?"M A 


U.UUU1 


4352 


AF106029 


Thomomys mazama clone 12-8 microsatellite sequence 


0.25 


4353 


AL137428 


Hnmn caniptic mPMA* rfiWA TYK"E , 7 r »V7/£ 1 XT 1 *2 1 'J ^{Vnm s*l/vna 
riuiiiu SapiCIEs llUVrN/-\, CJ-TN A 1/FkTZfP /OXINxOZj ^ITOni CiOIlC 

DKFZp761N1323) 


0.028 


4354 


L81915 


no i no bdpicnb Vbuucione A go JTOm Jtl /O) JL/JN A. sequence, 
cnrnnlptp QPniipnp.p 


\J.\iD 


4355 


JO 1291 


oUyUGall /a aCCU alVJIdgC piUlCUl dipiui SuDlUlll TTirTl3 \>D^, IT01I1 gHlC~ 
a1nhfl*-a1r>ha 916 


j. i 


4356 


NM 006649 


l-T/^tTiri Wntpnc Cf*m1f\oi/val1v rf ffifi^w? r»nf*vn /vHt/^ar n-nii' nn« 1 

xx\jiiiu odjjidio sci uiu^iwuiY uciiiicu vAiioii cancer aiiiitien xo 


u.ov 


4357 


AL137635 


Homo sapiens mRNA; cDNADKFZp434E175 (from clone 
DKFZM14F 1 7M 


0. 1 


4358 


Y14422 


Mmusculus DNA for retinal protein 


0.23 


4359 




rxuiiiaxL geuc xur z^xogiuuiraie uenyurogenase, exon l sequence 




4360 


D14337 


Yeast pckl+ gene for protein kinase, complete cds 


0.085 


4361 




ivxuuoc udiiu j oiuv/u cAcnange protein gene, complete cos 


1 Q ! 

i 


4362 


XM 048747 


Homo sapiens KIAA1223 protein (KIAA1223), mRNA 


o.69 ; 


4363 


M97514 


Saccharomyces douglasii mitochondrial cytochrome c oxidase subunit 
I (COXI) gene, complete cds 


0.19 


4364 


D10910 


rvraDiuopsis ma nana /\ipK / gene ior senne/tnreontne protein i&nase, 
complete cds 


0.53 


4365 


AK015420 


ivius museums aauit male testis cjjjna, kuviin iull-iengui enncned 
library, clone:4930448I18, full insert sequence 


0.003 


4366 


U82487 


■Drancmosionia nonuae engraueo protein Ampni nn^ mKJN A, 
complete cds | 


0.22 


4367 


AB052798 


renne ioamy virus env gene ior envelope protein, complete cds, 
clonerHFFV 


0.08 


4368 


AY027174 


/vrauiaopsis inanana sequence nan King j end ox us-vjene 1 rap 
insertion from line GT2850 


0.25 


4369 


AJ298684 


jrnyuoxeia bp. ivjlpj avlz nuiocnonanai Airo gene ior Air syntnase a 

Liioiii oUUUUil \j 


u.o / 


4370 


NM 013696 


Mus musculus thyrotropin releasing hormone receptor (Trhr), mRNA 


0.000003 


4371 


AF176830 


Dictyostelium discoideum polyphosphate kinase (PPK) gene, complete 
cds 


0.83 


4372 


AB036665 


Bacteriophage WO genes, ISW1 inserted sequence, partial and 
complete cds 


0.17 


4373 


AJ3 10187 


Prevotella albensis dpp-IV gene for dipeptidyl peptidase IV 


1.6 


4374 


L77040 


Homo sapiens (subclone 8_cll from PI H22) DNA sequence 


0.025 
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P VALUE 


4375 


AJ277162 


Beta vulgaris mRNA for putative Cdc2-related protein kinase CRK2 


0.009 


4376 


K02781 


Mouse PND gene encoding atrial natriuretic factor, complete cds 


0.24 


4377 


AL110166 


DKFZp586M1424) ! 


0.69 


4378 


AE006400 


T .ariricnmi^ lactic: qii1iqt> lactic TT„140'3 section 162 of 2 18 of the 
comolete genome 


6.4 


4379 


AF3 10887 


Dictyostelium discoideum RacB (racB) gene, partial cds; and 
unknown gene 


0.23 


4380 


A X V/ZrJJJ / 


Orv-ra cofivn micro^atellite MRGS6R2 containinff rTAAY5C20 

\j\.y ca oauva iiixisivjoaxwixixw ivjxvvjjvio^> tsuixicuxuiig ^ i niny^viiv, 

genomic refine rice 


2.3 


4381 


AL133038 


Homo sapiens mRNA; cDNA DKFZp434D1835 (from clone 
DKFZd434D1835, 


0.00004 


4382 


M37760 


Mouse serine 2 ultra high sulfur protein gene, complete cds 


0.083 


4383 


AF151698 


A crm}mr k tf y Tinm fnmp'fo^i^Tic f*rvntic nlacmtH nAtf , *5J? tr^nsjcrinrtoTial 

iTAtil VJL/clCLd 11X111 lIXIXiwXCtwAClid WljrLJlxfc* LJlCUlXlXU piVLVyJ O IXClXldWXALJLXVJXuXX 

TMiiY^Qrvr GlnTT homology GlnP/GInO homoloff rortative 
oxidoreductase and AeaE homoloe eenes comnlete cds 


0.25 


4384 


X59370 


X.laevis POMC-A gene for proopiomelanocortin-A 


0.007 


4385 


AF244933 


Orvrtnl noiK rnvnicnliifi iirteTferon pamma nreciirRor mRNA comnlete 

cds 


0.66 


4386 




Human DNA sequence from cosmid 91K3, Huntington's Disease 
Reeion chromosome 4nl6 3 contains CoG island 


0.077 


4387 


AK027136 


Homo sapiens cDNA: RJ23483 fis, clone KAIA04052 


1.7 


4388 


AK027136 


Homo sapiens cDNA: FLJ23483 lis, clone KAIA04052 


6.6 


4389 


AL589705 


Wnman DNA G&nnpnce. from clone RP1 1-64A/T7 on chromosome 6 

complete sequence [Homo sapiens] 


0.63 


4390 


AF035192 

ill \J ~J ~J A. <S 


Pommrlnl^o/^fpr rAfMnc QJavpr.DTY fiTWl'fM n /V»cyRi UP tip* fwm Til Pff 1 pHc* 

v^HiiipyioudCLcr rcwtixb o-idj ci ~xv a .^v piuicui ^wad^ gouc, iajuiljiwlv wis, 

tRNA-Arg gene, complete sequence; and putative histidine kinase 
ffene nartial cds 


0.34 


4391 


AK021269 


A>TnQ miicnilriQ 1 2 rtavs emhrvo <*nina1 corri cDNA RUCEN fiiU-lenffth 
enriched library, clone:C530007A02, full insert sequence 


0.69 


4392 


VTv/f 010404 

Al VI 1/JvtUT 


T-Tomo <5anien<; acid oho^nhatase 1 soluble TACPli mRNA 


0.007 


4393 




Mus musculus, Flt3 interacting zinc finger protein 1, clone 
MGC-7201 1MAGK-3482191 mRNA comolete cds 


2 


4394 


/UVw/JUt 


Mus musculus adult male tongue cDNA, RIKEN full-length enriched 
lihrarv clone*2310032D21 fnll insert seauence 


1.6 


4395 


L04802 


Silkmoth apoptosis preventing protein (p35) gene sequence 


| 0.21 


4396 


AB026599 


Neogerris parvulus mitochondrial gene for 16S ribosomal RNA, 
partial sequence 


0.027 


4397 


BC005581 


Mus musculus, RKEN cDNA 2310022K15 gene, clone MGC: 11736 
IMAGE:3968615, mRNA, complete cds 


0.22 


4398 


XM 033374 


Homo sapiens myosin, heavy polypeptide 7, cardiac muscle, beta 
(MYH7), mRNA 


0.23 


4399 


U31534 


Isla Vista virus MC-SB-1 nucleocapsid protein mRNA, complete cds 


0.54 
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4400 


XM_005318 


Homo sapiens hypothetical protein FLJ11125 (FLJ11125), mRNA 


0.44 


4401 


U76670 


Arabidopsis thaliana recA-like protein (AtDMCl) gene, complete cds 


0.051 


4402 


AK009448 


Mus musculus adult male tongue cDNA, RIKEN full-length enriched 
library, clone:23 1002 1K08, full insert sequence 


1.6 


4403 


AY003917 


Phytophthora infestans isolate 94-52 NADH dehydrogenase subunit 4 
gene, partial cds; mitochondrial gene for mitochondrial product 


0.16 


4404 


AC084160 


Caenorhabditis elegans cosmid Y73B3B, complete sequence 


0.21 


4405 


AF330120 


Homo sapiens voltage-gated sodium channel type in alpha subunit 
(SCN3A) gene, exon 4 


2.4 


4406 


AK022135 


Homo sapiens cDNA FLJ12073 fis, clone HEMBB1002387 


0.49 


4407 


AF332583 


Homo sapiens stratum corneum chymotryptic enzyme (KLK7) gene, 

A «r «f m X X Cf 7 

complete cds, alternative transcripts 


1.7 


4408 


XM 043049 


Homo sapiens hypothetical gene supported by AL080215 
(LOC92138), mRNA 


0.7 


4409 


AK017682 


Mus musculus 8 days embryo cDNA, RIKEN full-length enriched 
library, clone:5730465I09, full insert sequence 


2.1 


4410 


AJ409162 


Solanum ochranthum LINE retrotransposon within partial ure gene 
for urease, exons 6-8 


0.025 


4411 


AF183584 


Papio hamadryas tyrosinase gene, exon 2 


0.24 


4412 


AK010138 


Mus musculus adult male tongue cDNA, RIKEN full-length enriched 
libraiy, clone:2310069P03, foil insert sequence 


0.56 


4413 


AF112141 


Oryzias latipes homeodomain transcription Jactor (eng2) mRNA, 
complete cds • 


0.021 


4414 


AK022458 


Homo sapiens cDNA FU12396 fis, clone MAMMA1002758 


4.9 


4415 


AK024159 


Homo sapiens cDNA FU14097 fis, clone MAMMA1000760 


0.00007 


4416 


AB060834 


Macaca fascicularis brain cDNA clone:QtrA-10686, foil insert 
sequence 


2.1 


4417 


AF063864 


Schizosaccharomyces pombe essential nuclear protein Mcm3p 
(mciu3+) gene, complete cds 


0.21 


4418 


AF233288 


Drosophila melanogaster WDS (wds) and egghead (egh) genes, 
complete cds 


1.3 


4419 


AE003106 


Drosophila melanogaster genomic scaffold 142000013386018, 
complete sequence 


0.007 


4421 


AB001919 


Oryza sativa DNA for phospholipase D, complete cds j 


0.23 


4422 


AF2 16290 


Mus musculus heterochromatin protein 1 alpha mRNA, complete cds 


0.66 


4423 


AF028828 


Homo sapiens Tax interaction protein 43 mRNA, partial cds 


0.18 


4424 


AF011446 


Mus musculus granzyme K gene, complete cds 


1.9 


4425 


AF200375 


Ceratosolen bisulcatus cytochrome oxidase subunit I (COI) gene, 
partial cds; tRNA-Leu gene, complete sequence; and cytochrome 
oxidase subunit n (COII) gene gene, partial cds; mitochondrial genes 
for mitochondrial products 


0.68 
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4426 


AF328031 


Zygoballus rufipes NADH dehydrogenase subunit I (ND1) gene, 
partial cds; mitochondrial gene for mitochondrial product 


1.7 


4427 


AF289983 


Alouatta stramineus cytochrome b gene, partial cds; mitochondrial 
gene for mitochondrial product 


0.72 


4428 


Z73253 


S.cerevisiae chromosome XII reading frame ORF YLR081w 


0.19 


4429 


NM 018706 


Homo sapiens hypothetical protein DKFZp762Ml 15 (KIAA1630), 
mRNA 


1E-48 


4430 


AF298209 


DictyosteUum discoideum non-LTR retrotransposon TRE5-B, 
polyprotein (gag) and group-specific antigen (pol) genes, complete cds 


0.74 


4431 


AJ276991 


Mus musculus partial mRNA for ClpX protein 


0.079 


4432 


AB047240 


Human endogenous retrovirus HERV-K(H) DNA, complete sequence 
and flanking region 


0.13 


4433 


L40389 


Candida glabrata ERG1 1 gene, complete cds 


0.027 


4434 


K02592 


Mouse Ins DNA; cellular component that mediates integration and 
excision of polyoma virus DNA 


0.38 


4435 


D31782 


Snake (green habu) gTgTBP gene for TATA-box binding protein 


L5 


4436 


AK019649 


Mus musculus adult male testis cDNA, RIKEN full-length enriched 
library, clone:4930480I16, full insert sequence 


0.25 


4437 


U36777 


Carassius auratus aldolase C mRNA, complete cds 


0.094 


4438 


AF036690 


Caenorhabditis elegans cosmid F09C11 


0.72 


4439 


NM 012750 


Rattus norvegicus Tyrosine kinase receptor ligand 2 (Retl2), mRNA 


0.63 


4440 


M74000 


Brugia malayi myosin heavy chain gene, complete cds 


0.073 


4441 


XM 042656 


Homo sapiens LOC87184 (LOC87184), mRNA 


0.66 


4442 


AF335329 


Kallichroma tethys isopenicillin N synthase (pcbC) and alpha- 
aminoadipyl-cysteinyl-valine synthetase (pcbAB) genes, complete cds 


6.1 


4443 


XM_037269 


Homo sapiens region containing hypothetical protein FLJ10549; 
hypothetical protein FU105 17 (LOC82252), mRNA 


0.26 


4444 


AF130079 


Homo sapiens clone FLC0578 PR02852 mRNA, complete cds 


0.24 | 


4445 


AF086241 


Homo sapiens full length insert cDNA clone ZD29F04 


0.51 


4446 


AF262583 


Schedorhinotermes sarawakensis 16S mitochondrial ribosomal RNA 
gene, partial sequence; mitochondrial gene for mitochondrial product 


4.6 


4447 


AF170550 


Arabidopsis thaliana germin-like protein 7 (GLP7) gene, complete cds 


2 


4448 


AC084500 


Caenorhabditis briggsae cosmid G13G15, complete sequence 


0.66 


4449 


M23360 


Trypanosoma brucei cytochrome C mRNA, partial cds 


2.1 


4450 


AF236387 


Schizdsaccharomyces pombe zinc finger protein Grtl (grtl) gene, 
complete cds 


0.076 


4453 


XM 010610 


Homo sapiens hypothetical protein (FLJ10287), mRNA 


0.027 


4454 


AY021908 


Oryza sativa microsatellite MRG4233 containing (TA)X42, closest to 
marker C563, genomic sequence 


0.025 


4455 


I AK021675 


Homo sapiens cDNA FIJI 1613 fis, clone HEMBA1004012 


0.073 
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4456 


NM 030995 


Rattus norvegicus Microtubule-associated protein la (Mapla), mRNA 


1.8 


4457 


AFO 15298 


Human herpesvirus 6 IE-2 protein (ie2hom) gene, complete cds 


0.058 


4458 


XMJ) 11297 


Homo sapiens hypothetical protein FLB3342 (FLB3342), mRNA 


0 0000009 


4459 


AE006714 


Sulfolobus solfataricus section 73 of 272 of the complete genome 


0.002 | 


4460 


AE001434 


Plasmodium falciparum chromosome 2, section 71 of 73 of the 
complete sequence 


0.004 


4461 


AF068270 


Rachiplusia ou nuclear polyhedrosis virus protein-tyrosine 
phosphatase (ptp), late expression factor-2 (lef-2), occlusion matrix 
protein (polh), capsid-associated protein, and protein kinase (pkl) 
genes, complete cds; and unknown genes 


0.027 


4462 


AK015582 


Mus musculus adult male testis cDNA, RIKEN full-length enriched 
libraiy, clone:4930477O15, full insert sequence 


0.08 


4463 


XM 039778 


Homo sapiens HSPC047 protein (HSPC047), mRNA 


0 


4464 


S73580 


putative polymerase... capsid protein [potato aucuba mosaic potexvirus 
PAMV, Genomic RNA Complete, 6 genes, 7059 nt] 


0.071 


4465 


XM 016858 


Homo sapiens LOC87857 (LOC87857), mRNA 


3E-13 


4466 


AE004427 


Vibrio cholerae chromosome n, section 84 of 93 of the complete 
chromosome 


0.69 


4467 


AK025346 


Homo sapiens cDNA: FLJ21693 fis, clone COL09609 j 


4.9 


4468 


U89926 


Drosophila melanogaster cut gene, partial sequence 


0.22 


4469 


X57171 


D.caryophyllus CARSR12 gene 


2 


4470 


L36434 


Mus Musculus basic domain/leucine zipper transcription factor 
mRNA, 3' end of cds 


2 


4471 


AF1 15469 


Arabidopsis thaliana cultivar Landsberg erecta Arac2 (Arac2) gene, 
complete cds 


1.9 


4472 


AB063169 


Petunia x hvbrida tazl gene for tanetum-snecific zinc fineer nrotein 1 
promoter region and partial cds 


0.64 


4473 


L41691 


Bos tauras (clones L6, C15, C12, C8 C9 C17 C19 C12 C18 C5 
C6, C3, C13, C10, C47) mRNA, 3' end of cds 


0.47 


4474 


U61463 


Human myosin Vila (MY07A) gene, 5 1 exon 36 


0.0007 


4475 


NM 008992 


Mus musculus ATP-binding cassette, sub-femily D (ALD), member 4 
(Abcd4), mRNA 


1.9 


4476 


XM 050942 


Homo sapiens ribosomal protein L6 (RPL6), mRNA 


0.00009 


4477 


AK005146 


Mus musculus adult male cerebellum cDNA, RIKEN full-length 
enriched libraiy, clone: 1500004F14, full insert sequence 


0.71 


4478 


AF345089 


Hepatitis C virus isolate SCpreSC2cl4 nonfunctional polyprotein 
gene, partial sequence 


0.71 


4479 


AK008812 


Mus musculus adult male stomach cDNA, RIKEN full-length 
enriched libraiy, clone:2210403E17, full insert sequence 


6.4 


4480 


XM 006983 


Homo sapiens A kinase (PRKA) anchor protein 3 (AKAP3), mRNA 


1.6 


4481 


X06438 


Soybean leghaemoglobin gene lba promoter region 


0.19 | 


4482 


AF020051 


Blackcurrant reversion virus RNA2 polyprotein mRNA, complete cds 


0.71 
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4483 


XM 051560 


Homo sapiens peroxisome proliferative activated receptor, gamma 
(PPARG), mRNA 


0.6 


4484 


AB023481 


CvDrintis carnio mRNA for mitogen activated nrotein (MAP") kinase 
p38, complete cds 


6.3 


4485 


AJ250933 


Homo sapiens partial SBP1 gene for survival interacting protein 1, 
exons 3-4 


2.2 


4486 


XM 039989 


Homo sapiens microfibrillar-associated protein 3 (MFAP3), mRNA 


0.69 


4487 


XM 047554 


Homo sapiens similar to KIAA1473 protein (H. sapiens) (LOC92830), 
mRNA 


6.7 


4488 


AB042411 


Homo sapiens strg gene for striatum-specific G potein-coupled 
receptor, complete cds 


2 


4489 


XM 035221 


Homo sapiens ancient ubiquitous protein 1 (AUP1), mRNA 


0.16 


4490 


D 14061 


Schizosaccharomyces pombe mRNA for ORF 


0.015 


4491 


AE001426 


Plasmodium falciparum chromosome 2, section 63 of 73 of the 
complete sequence 


0.074 


4492 


M18349 


Rat leukocyte common antigen (L-CA) gene, exons 1 through 5 


0.16 


4493 


AF020726 


Tetrahymena thermophila phosphoglucomutase (PGM1) gene, 
comDlete cds 


0.058 


4494 


M80241 


Caenorhabditis elegans unc-6 gene, complete cds 


2.1 


4495 


U67234 


Human clone HS5.29 Alu-Ya5 sequence 


0.083 


4496 


AK000864 


Homo sapiens cDNA FU10002 fis, clone HEMBA1000046 


0.00001 


4498 


AK024393 


Homo sapiens cDNA FLI1433 1 fis, clone PLACE4000320 


0.064 


4499 


AF071754 


Mus musculus DNA cytosine methyltransferase mRNA 


0.21 


4500 


XM 047325 


Homo sapiens hypothetical protein LOC57 187 (LOC57187),- mRNA 


0 


4501 


XM 043434 


Homo sapiens villin-like (VHX), mRNA 


1.8 


4502 


NM 032675 


Homo sapiens hypothetical protein MGC10954 (MGC10954), mRNA 


0.0008 


4503 


AL589203 


Human DNA sequence from clone RP1 1-39 1F23 on chromosome 6, 
complete sequence [Homo sapiens] 


0.0003 


4504 


M86544 


Cow prostaglandin F synthetase II (PGFSH) mRNA, complete cds 


0.66 


4505 


AF339782 


Homo sapiens clone IMAGE: 1871856, mRNA sequence 


0 


4506 


Z74196 


S.cerevisiae chromosome IV reading frame ORF YDL148c 


6.2 


4507 


AF012751 


Danio rerio olfactory receptor protein 2.4 mRNA, complete cds 


6.8 


4508 


AF184796 


Arabidopsis thai i an a inhibitor tagged site ITS29 genomic sequence 


1 


4509 


AE006187 


Pasteurella multocida PM70 section 154 of 204 of the complete 
genome 


1.8 


4510 


AJ272171 


Podospora anserina degenerate gypsy-like retrotransposon Yeti 


0.074 


4511 


AK024656 


Homo sapiens cDNA: FLJ21003 fis, clone CAE03685 


1E-90 


4512 


NC_002629 


Lithobius forficatus mitochondrion, complete genome 


0.066 


4513 


X87205 


M.fascicularis mRNA for metalloprotease-like, disintegrin-like 
protein, IVa ' 


4.9 
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n \ r AT TTT7 

P VALUE 


4514 


XM 011595 


Homo sapiens CDC10 (cell division cycle 10, S. cerevisiae, homolog) 
TCDCIO^ mRNA 


5E-47 


4515 


AK021537 


nuirn? bapzciib cjl/xva xvljii* 1 US, Clone llJiIvlrjA10i/1734, 

uiuuwidivijr axxxxxXoX ivl w/\x^xjLEixvx1N - 1 1 xX\^l^UJtxjjlJi\. 


0.068 


4516 


AB044877 


RflllllQ ftJihiillllQ ONlf A mir , rnciit*»11it<» 


0.076 


4517 


AC079391 


numu oauiciio viuuc i ly-^z^r jiviii , complete sequence 


O.OUOOl 


4518 


AK000932 


Homo sapiens cDNA FLJ10070 fis, clone HEMBA1001581 


0 


4519 


AE001376 


jridbinocuuin icuciparuin cnromosome z, section 13 ol 73 01 the 
complete sequence 


0.24 


4520 


XM 040095 


Homo Q»T>ien<; Hvnnthf»tira1 nrnt*»in (T\\C\*7T>A.'\A-CXC\'\ 1 (W mDXTA 


l.O 


4521 


AF224317 


Ojalliic oalliio prn 7 prl -7 fp'FV«.T\ tnPMA rnmnlAtfi /■vie 
vjcuxuo gcuiua a xx^ixjlgvi ' v 1 ) liLtviN lAHlipiCie COS 


1.3 


4522 


AE001155 


Borrelia burgdorferi (section 41 of 70) of the complete genome 


2 


4523 


AJ313130 




1 


4524 


AF020187 


/uuuwuiiuud dxiicriuciiiuiri ecusyieroia recepror ^AflmwjK a i y mKJN A 
complete cds 


6.3 


4525 


AP000249 


rxuiuu bctpiciib gcuoiuic uisjt\ ? cnromosome ziqzz. i, \jz i oZZo-ajvil 
region, clone:B762015, complete sequence 


2E-19 


4526 


AF2 18380 

ill ^lU<^vv 


A/Til's mncrplllllQ f^RT TsJ^ ffTVIri^ opiip rnmnlAtA />Hr 


u.oy 


4527 


AP000601 


nxaLfiuupaio mriiifHid gcllUxxxxlf x-/IN/\, UillUlQOoOIIlC J, x X CIOITC tVIJI ^ l^r 


V.VZy 


4528 


AE001140 


Borrelia burgdorferi (section 26 of 70) of the complete genome 


0.64 


4529 


AL358852 


1-Tlim^n T~YW^A CP/mon^a Avmti pinna DDI *2 1 TXT A ,,1, .-. . , /T 

nuiLian 1w/ina sequence rrom clone Xrioi /1N9 on cnromosome o, 
wv/tii|ixcLw ocv^uciiwc i^xxvjiiio sapiensj 


2.2 


4530 


NM 024^96 

XilXVX X/A'T.J 7U 


numu bapicnb uypotxieucai proiein tjlj 1Z54 / (^rl^j 1Z<>4 / j, niKJN A 


0.82 


4531 


X12918 ! 


Human Na+,K+ ATPase gene part of intron XVI (alpha HI isoform) ; 


0.0003 


4532 




v^uciiui iiauui ub ciegans uoporoyosin isoiorm 1 v mKJNA, complete cds 


0.57 


4533 


M21590 


M.mycoides capri Gly-tRNA gene 


0.48 


4534 


AF250227 


xxvjiLLvj oatyicxio duciiyiyi cycidbe type vx gene, partial cos 


U.1V 


4535 


Z36061 


S.cerevisiae chromosome 11 reading frame ORF YBR192w 


0.071 | 


4536 


AF006510 


Endogone pisifonnis 18S ribosomal RNA gene, partial sequence, 
internal transcribed spacer 1, 5.8S ribosomal RNA gene and internal 
transcnDea spacer z, complete sequence, ana 2oS nDOSomal RNA 

o/=tip nartial cpniipnrp 
^wxiw, LJuX t leu om|UCUviC 


O.Uo 


4537 


D88262 


Pisum sativum PsCHS5 gene for chalcone synthase, complete cds 


0.075 


4538 


L31504 


Neurospora crassa cytoplasmic dynein heavy chain (ro-1) gene, 
complete cds 


2.3 


4539 


S81605 


S100 beta =neural calcium binding protein beta subunit {5' region} 
rats, Genomic, 2279 nt] 


0.66 


4540 


X95275 


P.felciparum complete gene map of plastid-like DNA (IR-A) 


0.23 


4541 


U18868 


Squalus acanthias multifunctional protein CAD mRNA, complete cds 


0.024 
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4542 


AF184835 


Arabidoosis thaliana inhibitor taffper! sifp 1TS68 ffenomic spoiipucp 


0.71 


4543 


AL513350 


Human DNA sequence from clone RP1 1-509D8 on chromosome 9, 
comnlete seouence ITTomo saniensl 


0.64 


4544 


Z68107 


Caenorhabditis elegans cosmid F48C5, complete sequence 


2 


4545 


AKG02184 


Homo saniens cDNA FLJ11322 fis clone PLACE 10 10329 


0.71 


4546 


U00025 


C^aenorhahditis eleean^i cosmid PAR2 comnlete seanence 


0.46 


4547 


U39548 


Meloidogyne artiellia mtcuticline-1 gene, 5' upstream region 


0.64 


4548 


AK024271 


Homo saniens cDNA FLJ14209 fis clone NT2RP3003346 


0.0003 


4549 


AF248989 


Arabidopsis griffithiana chalcone synthase gene, promoter region and 
partial cds 


0.3 


4550. 


AF081208 


Monoclea gottschei subsp. gottschei tRNA-Thr/tRNA-Leu intergenic 
spacer, partial chloroplast sequence 


0.009 


4551 


XM 002770 


Homo sapiens hypothetical protein (FU10996), mRNA 


5.8 


4552 


AF035672 


Mns musculus MHC class I related nrotein 1 fMRl i sene comnlete 
cds 


1.1 


4553 


AF374466 


lV/facrohrachiuni intermedium 16S ribosomal RNA ffene nartial 
sequence; mitochondrial gene for mitochondrial product 


5.9 


4554 


S80559 


nHm-9==POTT domain <?ene mrosonhila melanoma ster mRNA 2171 

JjU-i.il x W VJ ViVJAiidJ.ll gV/ix^s |x^i\Jovj^fiixxa xlXbluAivj£^£iaLtsA, xixxVL^i^v, ^.x / x 

ntl 


0.009 


4555 


AY029489 


Homo sapiens aminophospholipid-transporting ATPase (ATP 10C) 
gene, exon 3 


1E-89 


4556 


NM 026255 


Mus musculus RUCEN cDNA 4930433D19 eene <4930433D19Riki 

X VALID UUlav UIUJ l\XuXwll Wl/lin. "f/JVTJJl/1/ ^T/JVTJJi/ X ylXlA^) 

mRNA 


2.2 


4557 


AJ276163 


Homo <«aniens nartial CrDH j?ene for uridine dinbo^nbo-fflncose 

dehydrogenase, exons 8-11 


0.0002 


4558 


NC 001566 


Apis mellifera ligustica mitochondrion^ complete genome 


0.19 


4559 


BC003453 


Mus musculus, calreticulin, clone MGC:6209 1MAGE:2655918, 
mRNA comnlete cds 


2 


4560 


XM 048784 


Homo sapiens mature T-cell proliferation 1 (MTCP1), mRNA 


0.62 


4561 


AF1 19845 


Homo sapiens PRO 13 04 mRNA, complete cds 


0.01 


4562 


AF251477 

ill. X » / # 


Plasmodium berffhei glutathione reductase £ene nartial cds 


0.053 


4563 


XM 018443 


Homo saniens ATPase Na+/fC+ transnorrinff beta 3 oolvnentide 
CATPlB3 , i mRNA 


2 


4564 


AF 145282 


Trichomonas vaginalis calcium motive P-tvne ATPase (C* A-2i ffene 

A XX^XlVJXllV/lltlO VOglimUS V<1XV»/11X111 UlUUVv X^**lJr|^W <^vxx oov \v»*^v gvuw, 

nartial cds 


0.11 


4565 


AF280548 


Homo saniens neuronilin-1 fNRPli eene exons 1 2 and nartial cds 

X XVlliw OOUlVlU llw 1X1 UU Mill X y^A 1 JL VX X y gwllVy VAW1W >'l ivi pututu vw 


2.1 


4566 


NC 002253 


Buchnera sp. APS plasmid pLeu, complete sequence 


0.24 


4567 


AJ238613 


Gallus gallus mRNA for teneurin-1 


0.61 


4568 


Y08926 


P.ialciparum mRNA for AARP1 protein, partial 


0.21 


4569 


Z86105 


A.thermophilum celA gene and manA pseudogene 


0.69 


4570 


AL583902 


S.pombe chromosome I pi p32A8 


1.1 


4571 


AL358951 


Human DNA sequence from clone RP3-456L16 on chromosome 6, 
complete sequence [Homo sapiens] 


0.072 


4572 


AJ001164 


Drosophila melanogaster additional sex combs cDNA sequence 


0.1 
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4573 


AF301004 


Schistosoma mansoni thioredoxin peroxidase 3 (TPx3) gene, complete 
cds 


0.19 | 


4574 


AF276995 


XenoDus laevis homeobox nrotein Siv4 9 fSiv4 0\ <*etif» narHai ^Ac 


L 


4575 


BC007465 


Homo saDiens clone 1MAGE'3R2R^4.7 mPNA 


u.uuuuz 


4576 


AL589824 


Human DNA sequence from clone RP1 1-76G1 1 on chromosome X, 
comnlete seauence FHomo ^aniensl 


o in 
U.l / 


4577 


AF151171 


Omithorhvnchim jmatimiQ mnnnncp A-nVincittiatia/i-neiVIin 1«Ita rrrwii7#-"h 
*m i uxfxxij xtvixtxo qii«iiiiuo iiiaiuiusc O UxlUbDlUllG/ xKlSlxllH~xLKe HiO vVXXl 

factor 2 receptor (M6p/Igf2r) gene, exons 2 and 3 and partial cds 


0.67 


4578 


AK000883 


Homo sapiens cDNA FLJ10021 fis, clone HEMBA1000568 


0 


4579 


XM_016630 


Homo saniens KIAA1046 nrotein fiCT A A 1 ndfi ■ mPM A 


i i 


4580 


NM 015906 


Homo sapiens transcriptional intermediary factor 1 gamma 

rriH'l GAMMAS tran^ierint variant altVha ttiRKIA 


z 


4581 


AF274053 


Xenopus laevis maxi-K potassium channel alpha subunit Slo mRNA, 
comnlete cds 


n io 
u.iy 


4582 


AB035874 


jrbcuuoregmd. pencueDuiyi miiocnonuriai genes tor small subunit 
ixu^A, uviN/v vol, idrge suounii rtvJN/v, paruai ana. complete 


o Am 


4583 


AF277081 


Ajellomyces dermatitidis strain 104 blastomyces yeast phase-specific 
orotein 1 flwsl^ trene nntrmlete r.Hc 


o 

U.ZO 


4584 


AF334561 


Homo sapiens chromosome Y landmark: proximal external boundary 
of P7 VCY inverted reneat 


O /CO 


4585 


X55197 


P voelii YEL6 eerie fhr ATPase 




4586 


L32141 


Tvii toch nn H Tin n T.vmjintTia Hicrtnr rfhricomal DMA lo-ro^ cnVknnit 

xvxxcvrv<axuxLuxxvriA ijjriiMiima UIo|yal llUV/sV/XIIcU x\lN/\ UAlfzJC SUDUlUL 


A AAQ 

U.UUo 


4587 


AF272001 


Ebola virus subtype Zaire strain Mayinga complete genome 


1.5 


4588 


AE004405 


v iuiiu ^uuiwiac ^lixiMUUMJlUG 11, bcvlion O^t OI 50 01 UlS COlXipiClG 

chromosome 


1 o 

l.y 


4589 


XMJ)10966 


Homo sapiens debranching enzyme (S. Cerevisiae) homolog 1 | 
(DBR1), mRNA 


2 


4591 


U68246 


Dictyostelium discoideum WacA (wacA) gene, complete cds 


0.002 


4592 


XM 047617 


Homo sapiens KIAA1349 protein (KIAA1349), mRNA 


0.027 


4593 


AK003428 


Mus mus cuius 18 davs embrvo rDNA PTK'PTsI fnll-l^noth pnHrhpH 
library, clone: 1 1 10004 J07 fall insert semienee 




4594 


XM 029072 


Homo sapiens LOC89256 (LOC89256), mRNA 


0.009 


4595 


L43549 


Buchnera aphidicola aspartyl-tRNA synthetase (aspS), thioredoxin 
reductase (trxB), seryl-tRNA synthase fserS^ ohosohoserine 
aminotransferase (serC), 3-enolrjyruvylshildmate-'5-phosphate 
synthetase (aroA), ribosomal protein SI (rpsA), integration host la> 


0.21 


4596 


AK023907 


Homo sapiens cDNA FLJ13845 fis, clone THYRO1000815 


0 


4598 


AB042649 


Watermelon spotted wilt virus nss, np genes for non-structural, 
nucleocapsid protein, complete cds, isolate: WS-O 


0.024 


4599 


D85547 


Bacillus sp. DNA for cytochrome c oxidase subunit I,n and HI, 
complete and partial cds 


0.23 
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4600 


L29252 


Human (clone D13-2) L-iditol-2 dehydrogenase gene, exon 4, exon 5, 
exon 6 and exon 7 


9F-OR 

iSLi WO 


4601 


AE003872 


Xylella fasti diosa 9a5c, section 18 of 229 of the complete genome 


1 S 

A.J 


4602 


U29924 


Human AMP deaminase (AMPD3) gene, exon 12 


0.0003 


4603 


AE000653 


Helicobacter pylori 26695 section 131 of 134 of the complete genome 


0.75 


4605 


AP001235 


Homo sapiens genomic DNA, chromosome 2pl 1.2, clone:cos607/7B8 


0.024 


4606 


AF207725 


Mustela lutreola isolate D23 mitochondrial D-loop, partial sequence 


0.028 


4607 


AE007377 


Streptococcus pneumoniae section 60 of 194 of the complete genome 


0.083 


4608 


XM 018430 


Homo sapiens LOC88330 (LOC88330), mRNA 


4E-27 


4609 


AF273674 


Plasmodium falciparum DNA-dependent RNA polymerase (TRNAP) 
gene, complete cds; nuclear gene for probable mitochondrial product 


0.002 


4610 


AY018919 


Oryza sativa microsatellite MRG1244 containing (AT)X23, genomic 
sequence 


6.7 


4611 


AB025642 


Arabidopsis thaliana genomic DNA, chromosome 5 BAC 
clone:T32G24 


2.1 


4612 


X59275 


D.melanogaster mRNA for posterior sex combs 


0.68 


4613 


AF052959 


Homo sapiens type XV collagen (COL15A1) gene, exon 6 


0.71 


4614 


U32730 


Haemophilus influenzae Rd section 45 of 163 of the complete genome 


0.025 


4615 


AF2 12976 


Bolitoglossa mexicana isolate MEX 6C cytochrome b gene, partial 
cds; mitochondrial gene for mitochondrial product 


0.076 


4616 


AF031493 


Cancer magister molt-inhibiting hormone precursor mRNA, complete 
cds 


0.22 


4617 


AJ245965 


Danio rerio mRNA for pbxy homeodomain protein (pbxy gene) 


0.068 


4618 


XM 031524 


Homo sapiens AF15ql4 protein (AF15Q14), mRNA 


0 


4619 


AL096876 


S.pombe chromosome n cosmid c418 


0.001 


4620 


XM_051190 


Homo sapiens similar to frizzled (Drosophila) homolog 4 (H. sapiens) 
(LOC93401), mRNA j 


6 


4621 


XMJ)05411 


Homo sapiens interferon, omega 1 (IFNW1), mRNA 


0.22 


4622 


Z50156 


D.discoideum gene for 34 kD actin binding protein 


0.027 


4623 


BC006182 


Homo sapiens, Similar to calmodulin 2 (phosphorylase kinase, delta), 
clone MGC:715 IMAGE:3528814, mRNA, complete cds 


2 


4624 


AF106933 


Drosophila melanogaster plexin B (plexB) mRNA, complete cds 


0.66 . 


4625 


U44392 


Human Down Syndrome region of chromosome 21, clone A12H1- 
1G10 


0.077 


4626 


AF145452 


Arabidopsis thaliana branched chain alpha-keto acid dehydrogenase 
El beta subunit (din4) mRNA, complete cds; nuclear gene for 
mitochondrial product 


6.3 


4627 


AE006315 


Lactococcus lactis subsp. lacris IL1403 section 77 of 218 of the 
complete genome 


0.62 
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4628 


X55978 


T. brucei ESAG 8 gene for a leucine-rich repeat family protein 


0.24 


4629 


U41529 


Caenorhabditis elegans cosmid EGAP3 


V.J7 


4630 


XM 039502 


Homo saoiens associated molecule with the domain nf STAAyf 
(AMSH), mRNA 


J.J 


4631 


AF141650 


Nyctalus noctula microsatellite P223 sequence 


1.5 


4633 


AL096887 


Human DNA seouence from clone RP6-1 R nn rhrnmncnm? 

Xq26. 1-27.3, complete sequence [Homo sapiens] 


0 02 


4634 


AL137458 


Homo saoiens mRNA* cDNA DKFZn434E2221 ffrnm clone 
DKFZp434E2221) 


0.074 


4635 


AE002292 


Chlamydia miiridarum, section 24 of 85 of the complete genome 


5.1 


4636 


XM 048200 


Homo sapiens KIAA0276 protein (KIAA0276), mRNA 


5.4 


4637 


AFO 17041 


Dictvostelium discoideum LTR-retrotransnoson Skinner nartial 
genomic sequence, 5' end 


0.16 


4638 


Z98853 


Caenorhabditis elegans cosmid R08A2, complete sequence 


5.7 


4639 


XM_017899 


Homo sapiens LOC89635 (LOC89635), mRNA 


0 


4640 


BC006943 


Mus musculus Similar to RUCEN cDNA 3230401 NO 1 eene clone 
MGC:6906 IMAGE:2655807, mRNA, complete cds 


2 


4641 


AF275345 


Lycopersicon esculentum MADS-box transcription factor jointless 
gene, complete cds 


0.62 


4642 


AK018612 


Mus musculus adult male cecum cDNA RUCEN fhlf-leneth enriched 
library, clone:91300 1 1L1 1, full insert sequence 


1.9 


4643 


AE006568 


Streptococcus nvoeenes Ml GAS strain SF370 section 97 of 167 of 
the comulete eenome 


6 6 


4644 


AF057019 


Dictyostelium discoideum interaptin (abpD) gene, complete cds 


0.67 


4645 


XM_010198 


Homo saniens immune fivcrpfrnljition •nolvenHnf^rinnnathv 

ixuuiv oopibiu uuiuuub ixjr da s>g Uiauuii, ^iwJLjrCJUUUwl. JJlUUdiJ-ijr, 

enteropathy, X-linked (TPEX), mRNA 


5E-29 


4646 


NM 022599 


Rattus norvegicus outer membrane protein (Omp25), mRNA 


1E-14 


4647 


U08455 


HIV-1 isolate 965 clone 26 from Malawi, envelope glycoprotein (env) 
gene, partial cds 


0.72 


4648 


AK023304 


Homo saoiens cDNA FLJ13242 fis clone OVARC1000578 




4649 


U20803 


Gallus gallus fibrinogen alpha and alpha-E subunit gene, exons 5 and 
6, and complete cds for alpha and alpha-E subunits 


2 


4650 


AF189768 


Homo saoiens lenlcocvte immiinncrlohnlin-lilcp rerrentnr S C\ TRS'N ophp 

complete cds 


0.025 


4651 


AK002614 


Mus musculus adult male kidnev rDNA RTTCPN fiill-lenpth enrirheH 

i*xup m iwwuiua wiui 1 1 miv i\m iiw wJ x\ i cv_i, x mi 1 viig ill ciuiwlicu 

library, clone:0610012K15, full insert sequence 


2.1 


4652 


AF010568 


Plasmodium falciparum microsatellite TA80 sequence 


0.05 


4653 


AF042283 


Schizosaccharomyces pombe sulfide dehydrogenase (hmt2) gene, 
complete cds 


0.63 


4654 


Y16262 


Daucus carota mRNA for neutral invertase 


0.22 


4655 


XM_035796 


Homo sapiens hypothetical gene supported by AF001893; AH005586 
(LOC91063), mRNA 


1.5 


4656 


AB008565 


Homo sapiens gene for deoxylribonuclease n, S'-upstream region 


0.73 


4657 


Y17275 


Lycopersicon esculentum p69a gene, complete CDS 


0.51 


4658 


X03100 


Human HLA-SB(DP) alpha gene 


0.23 
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4659 


AL121822 


nuiiidii lyiN-rt. sequence irom cione jwri 1 on curomosome -X.q21.2- 
21.33, complete sequence [Homo sapiens] 


1E-08 


4660 




i iicniiuiugd. ludxiunid section lzd oi ijo oi me complete genome 


0.18 


4661 




Mus musculus adult male lung cDNA, RIKEN full-length enriched 
uvLas y, oiuiic i^uuwzuzz, iuii lnseix sequence 


0.61 


4662 


AJ387942 


ragonia creuca cnioropiast tKfNA-Leu intron ana trnL-trnF intergemc 
spacer 


0.6 


4663 


AB053560 


v^araDub venustus suosp. iiaoningensis nutocnoncnai gene tor n ai jh 
dehydrogenase subunit 5, partial cds, isolate:73-MoVe99LB02 


0.027 


4664 


Z78708 


H.sapiens flow-sorted chromosome 6 Hindm fragment, SC6pA14H12 


2E-63 


4665 


AFO IftSfil 

/VI \J ± UJUJ 


jri<u»iiiutLiuiJLi icuuipaiuiii xiiicrobdiexiiie ia/j sequence 


U.lo 


4666 




Mus musculus Chetk-alpha gene for choline/ethanolamine kinase- 

alrtVtQ Avon A. f\ 7 ft 


A T3 
U. / J 


4667 


T 77fH£ 

i— ' / f\JD\> 


nomo adpieiiis ^suocione uy rrom ri riiyy l/jna sequence 


a r\<\n 
U.UU/ 


4668 


U23037 


\j rycioiagu s cumcuius euKaryonc initiation iactor 2r>-epsiion mRNA, 
complete cds 


2.3 


4669 


AYO^ 1 Q£7 


niV"i isoiaie iNCj^ij-iyyy irorn uoa poi poiyprotem (poij gene, 

yal Hell Uis 


A /CT 
U.O/ 


4670 


AF026069 


riorao sapiens pnospnomevaionate Kinase (HUivirJViKi) gene, partial 
cds 


6.1 


4671 




xiomo sapiens ivecv^ protein-iiKe *t ^injdi^vc.L4 < /, nxKTN A 


o.l 


4672 


M34662 


Human chaperonin (HSP60) non-functional pseudogene 2 


5.3 


4674 


AF009255 


rtomo sapiens putative ciuonae cnannei gene (C/JLC-JNoy, exons 14, 15, 
16, and 17 


2.1 


4675 




numu sapicub xvx v^A~ii Z07CIJ gene, partial cos 


A AAAA/1 
U.UUUU4 


4676 


NCJ)02087 


Plasmid pRL765, complete sequence 


0.7 


4677 


AF270613 


Homo sapiens clone 5qtel_ctg0024_cl03t3c80t7_stelrpt sequence 


0.078 


4678 


AP001467 


Homo sapiens genomic DNA, chromosome 21q22.3, 
clone:21B27A14, telomere region, complete sequence 


0.000001 


4679 


AY026945 


Rattus norvegicus TiG mismatch thymine glycosylase mRNA, 
complete cds 


2.1 


4680 


NM 017783 


Homo sapiens hypothetical protein FLJ20357 (FU20357), mRNA 


0.67 


4681 


Z57523 


TT ^anieir; CnG island DNA ffenntrrie MspI fratrmpnt Hnnf* 1 R^rfO 
reverse read cpgl82d9.rtlb 


3E-55 


4682 


U11761 


Cryptosporidium parvum KSU-1 heat shock 70 kDa protein (hsp70) 
gene, complete cds 


0.076 


4683 


AF142642 


Picea abies isolate D nadl gene, intron b/c; mitochondrial gene for 
mitochondrial product 


0.08 


4684 


XM 018315 


Homo sapiens hypothetical protein ELJ12078 (FLJ12078), mRNA 


0.64 
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4685 


TT609Q7 


vjraiius gauus jL'in/\ puiymera&e gamma hikjn a, nuclear gene 

CllCUlXLUg lmiUCJLLUllUlllU piULCLLL, partial COS 


O M 


4686 


1V11 / Oj / 


D.melanogaster (strain Ja-F) alcohol dehydrogenase gene (allele Adh- 

JT ^, wUllipiCLC ^Uo 


O OR 


4687 


V17965 

Al / &OJ 


Human germline kappa-immunoglobulin variable region (V-kappa-H) 

gCllw Vtt 


0 95 


4688 


XM_030896 


JLJ.OHIO sapiciia aiiiniqi i\j uypuuicucai proiem ivivj\_^z> sapiens^ 
(LOC90321), mRNA 


7E-32 


4689 


AF0R1064 


NAii^ fTmQPii1ii<» "njurillin-lilcf nrfitfifi fWicS^ ap.Tip pnTTinlpfp pffc 


e-103 


4690 


VS794R 

AJ / ZtO 


L.helveticus galK, galT and galM genes for galactokinase, galactose- 1- 

i UllUjrl U LllLOi.V<l dot OilU JLilUKUVIUlo^ ^JtXL LU11 IrViB 


0.003 


4692 


AF07409O 


Mus musculus integral membrane protein 2 A (Itm2a) gene, complete 


0.059 


4693 


TI17^R 
J-/1 /ooo 


/"MnctrirliiiTn CPntimm oy^tiP 1 finr nliVhn —Irwin fwmr*l^tf» rrlc 

V^iUoUXUiUlll svpULrlUll gSsLKs oipJUla. LVIAlli, UVIllipiwLv WD 


0.08 


4694 


AFORQOd.7 


Molothrus badius cytochrome b (cytb) gene, mitochondrial gene 

encuuing Huiucuuuuriai pruicm, partial cus 


0 076 


4695 


AXTOO/1^"* 


Mus musculus adult male lung cDNA, RIKEN full-length enriched 

1iV\-rori7 f1rvn#»* 1 900007077 -frill itic#»r* c(*mtf*nne* 

uurary, cxone. lz\jkj\j\)z\jj«z,, iuii insert sequence 


0 fS\ 


4696 


AI70A7570 


Bacillus thuringiensis insecticidal crystal protein (NRcry V) gene, 
complete cus 




4697 


AF049489 


Cams familiaris factor VIII mRNA, complete cds 


6.3 


4698 


X57599 


S.xylosus xylR, xylA, xylB genes for Xyl repressor, xylose isomerase, 
xylulokinase 


0.024 


4699 


L04873 


jvlus musculus cystic nurosis rransmemDrane regulator yy^r try; gene, 
exon 1, and upstream region 


0.25 


4700 


XM_041483 


Homo sapiens hypothetical protein MGC15827 (MGC15827), mRNA 


2.2 


4701 


U30355 


r^xocanuum caruiagineurn complete internal transcriDeu spacer region 
including ITS 1, 5.8s rDNA and ITS 2 


0.082 


4702 


AF346599 


Ljaiius gauus myosiatin {ivio iin/ gene, exons i,z anti j anu complete 
cds 


1.8 


4703 


YTV/f 0071 151 


Unmn canipnc A IHnsiCf* rPP1<TA^ orrnnnr nrntpin 1 1 ( AT£ API \ \ mPNA 
xlUIiiU odpiwilo *\ mxioDw ^riuvrij aliciiui UiVJLCili x a ^/\ivf\r a x /, jiiuxin^t. 


6.2 


4704 


"VTVyf A-AO^Q^ 


Homo sapiens superkiller virahcidic activity 2 (S. cerevisiae homolog)- 

llKe ^OlVlV^J-*/, UllvLN.rY 




4705 


/ o fyi 


xi.5cipienb xiuw-MJi lcu cm uiiiuounie o numiii jxd^iuciiL, ocu|/-rvxun.u 


2E-08 


4706 


AE006402 


Lactococcus lactis subsp. lactis IL1403 section 164 of 218 of the 
complete genome 


0.7 


4707 


X97876 


H.sapiens EP4 prostaglandin receptor pseudogene C17 


6.6 


4708 


AP001438 


Homo sapiens genomic DNA, chromosome 21q22.2, clone:T695, 
LB7T-ERG region, complete sequence 


0.0001 


4709 


XM 049375 


Homo sapiens WW domain binding protein 4 (forrnin binding protein 
21) (WBP4), mRNA 


3E-16 


4710 


AB016828 


Luehdorfia longicaudata mitochondrial ND5 gene for NADH 
dehydrogenase subunit 5, partial cds 


0.45 
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Table 3 A Nearest Neighbor (BlastN vs. GenbanlO 


SEQ IE 
NO 


ACCESSN 


rye c np tt} 


P VALUE 


4711 


U23439 


Rovine hprneovimc A 1 1_lrH m"DXTA 

JLJUYlllw llbipwsVUUo X » 1 IxU lllrv [y f\ 


0.64 1 


4712 


AP001438 


Homo sapiens genomic DNA, chromosome 21q22.2, clone:T695, 
/ x x^evvj lvgivjn, ^umpieie sequence 


0.00003 


4713 


AP000378 


Arabidopsis thaliana genomic DNA, chromosome 5, BAC 
clone *T4M5 


0.24 


4714 


AC064850 


iiuiuu da^iwa wuue rvx x i*n /rur, complete seQiience 


0.009 


4715 


Z93943 


limuoii xsxvrx dwqucxitc xxuiil UUbllllU U^o JxlO Oil CnTOmOSOme 


0.0U6 


4716 


AJ132901 


Drosophila guanche mitochondrial A+T-rich region 


0.052 


4717 


AE006291 


Lactococcii*; larti*; onHcn 1 antic TT 1 JYY* coMmn A f>io svp+tm 

complete genome 


f\ 1A 


4718 


AF227741 


Rattus norvegicus protein kinase WNK1 (WNK1) mRNA, complete 
cds 


o.l 


4720 


Z73524 


S.cerevisiae chromosome XVI reading frame ORF YPL168w 


0.072 


4721 


AF142484 


Sebastes melanon*; micrn^fltpllitp Cmp 1 ) cpmipnrA 


U. ly 


4722 


X51938 


D. melanogaster DNA for a region 3' to the white gene, corresponding 
to the FB-NOF insertion site iti TF 1 d£> 


L 


4723 


Z80361 


H saniens HLA— RRR rKPriflfwpnf* rmpat iv^o-inn* 




4724 


AF3 12620 


Plasmodium vivax clone C19b thrombospondin-related adhesive 
nmtein ffene rwirrijiT rvfc 


2.1 


4725 


XM 017198 


xiajiiiu ^dpicua iiypuuieuvvdl pruicin JrJuJlzUoD {rLtJ LZ\)oz>) 9 mKJNA 


1E-35 


4726 


AJ408943 


xiomo bapiciib cnromosomai oreaKpoint rragment derivative 11, 
MLL/AF4 fusion, patient UPN051 


0.001 


4727 


XM 050754 


xauiiiu actpicxia JVJLr\r\.l jy*t proicin ^JvaAAlDy*rJ, ITLx\JN/\ 


0.5 


4728 


AE002938 


Drosophila melanogaster genomic scaffold 142000013385520, 

convnletf* ^pniipririP 


0.008 


4729 


AE001390 


Plasmodium falciparum chromosome 2, section 27 of 73 of the 

comolete <*ennencp 


0.21 


4730 


AK000109 


Homo sapiens cDNA FU20102 lis, clone COL04783 


2.1 


4731 


AF163325 


x xx\sxx\j\j\,kxiia. 11x11 zjctiimii liu luuiiQiiujiai piasiTUQ pinri, complete 
plasmid sequence 


0.028 


4732 


AL137277 


xiuiiiu adpiciib iiiiviN/v, cuxsj\ j^jsj*z<p*i'j*fivii 1 14 (rroni clone 
DKFZp434M1114) 


1.9 


4733 


U43145 


Plasmodium chabaudi repeat organellar protein gene, complete cds 


0.64 


4734 


X71875 


Hsapiens gene for protein serine kinase PSK-H1, first coding exon 


1.7 


4735 


AE001479 


nciiuuuauier pyiori, aixaiii jyy secuon 01 Y52, 01 uie complete 

genome 


0.74 


4736 


M29586 


Elops saurus Ig germline H-chain pseudogene V-region, partial cds 


0.66 


4737 


AK020838 


Mus musculus adult retina cDNA, RIKEN full-length enriched 
library, clone: A930009F24, full insert sequence 


0.025 


4738 


AB038343 


Melon yellow spot virus NSs and N genes for nonstructural protein 
and nucleoprotein, complete cds, isolate:Tospo-melo 


0.68 


4739 


AB005130 


Candida maltosa gene for EPD1, complete cds 


0.25 
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SEQIE 
NO 


> 

ACCESSN 




P VALUE 


4740 


AF361581 


/vidoiuopsis mauana Ai4gJ5320/F23E12 120 gene, complete cds 


1.9 


4741 


Z75032 


o.^crevibiae cnromosome A V reading frame ORF YOR124c 


0.025 


4742 


U17681 


Human neurofibromatosis 1 (NF1) gene, exons 16, 17, 18 and 19a 


0.45 


4743 


AK025048 


nuiuo sapiens cljjna. ruzuvo fls, clone COL03557 


0.66 


4744 


AF198444 


Homo sapiens 10q21 mRNA sequence 


0.008 


4745 


AE005556 


nscnencma con uid /.Hi HUL933 genome, contig 3 of 3, section 175 
of 290 

VIA 


0.71 


j 4746 


X74784 


M.musculus mk2e mRNA 


! 0.17 


4747 


AB029480 




0.077 


4749 


AF072932 


Medicago sativa cold acclimation responsive protein CAR1 (CAR1) 

"UUirtj ^A/XlipiClC tub 


0.66 


4750 


AF176056 


Capitonus sp. 16S ribosomal RNA gene, partial sequence; 
•iLULuwuuiuuidi gene ior miiocnonanai product 


0.009 


4751 


X03366 


duvuic bpiccu trypsin lnniDitor u (ol) gene 


0.12 


4752 


L22951 


Human immunodeficiency virus type 1 (UG275) proviral DNA 
eiicuuiug eiiv, lot, vpu, rev, ana net genes 


0.23 


4753 


U95851 


Amoiyomma aarwim izb nbosomal RNA gene, mitochondrial gene 
xi/i xiiiiuwiiuiiuxidi jcvina, paruai sequence 


5.4 


4754 


AE001372 


Plasmodium falciparum chromosome 2, section 9 of 73 of the 
complete sequence 


2.2 


4755 


AF147813 


rroirama raaicis elongation factor 1 alpha gene, partial cds 


0.24 


4756 


U77974 


Tribolium castaneum transcription factor homolog (Tc-eve) gene, 


0.003 


4757 


XM 045423 


Homo sapiens KIAA0701 protein (KIAA0701), mRNA 


6.3 


4758 


J04357 


nea ciover necrotic mosaic virus RNA-1, complete sequence 


0.96 


4759 


XM 010136 


Homo sapiens O-linked N-acetylglucosamine (GlcNAc) transferase 
^uiJt'-jN-acetyigiucosamme^olypeptide-N-acetylglucosammyl 
transferase) (OGT), mRNA 


0.69 


4760 


XM 030855 


Homo sapiens sunilar to KIAA1361 protein (H. sapiens) (LOC90315) 
mRNA 


0.008 


4761 


AY031575 


in v-i isolate NC5000-1999 from USA pol polyprotein (pol) gene, 
partial cds 


6.2 


4762 


XM 007592 


Homo sapiens Homer, neuronal immediate early gene, 2 (HOMER- 
2B), mRNA 


2.2 


4763 


AC016707 


Homo sapiens BAG clone RP1 1-22 1K4 from Y, complete sequence 


0.75 


4764 


AF001848 


Oxytricha nova phosphoglycerate kinase (PGK) gene, complete cds 


5.7 




] 

BC006693 


Mus musculus, clone MGC:7897 IMAGE:3582679, mRNA, complete 
cds 


1.9 


4766 


1 

L81574 


Homo sapiens (subclone 7_gl from PI Hll) DNA sequence, complete 
sequence 


0.00003 


4767 


U67508 


Vfethanococcus jannaschii section 50 of 150 of the complete genome 


0.23 
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Table 3A Nearest Neighbor (BlastN vs. Genbank) 


SEQID 
NO 


ACCESSN 


DESCRIP 


P VALUE 


4768 


AY021494 


Orvza sativa microsatellite MRG381 9 containing (TA1X27 jypnnmir 
sequence 


0.023 


4769 


L75945 


Borrelia burgdorferi fla pell ar hook nrotei n ( fl ^ flhD fl a crpl 1 »r 
motor apparatus (motAB), fliL, fliM, fliZ, flagellar export apparatus 
(fliPQR, flhB), flhF, flbE genes 


0.76 


4770 


AL590606 


S.pombe chromosome n BAC pB7E8 


0.068 


4771 


AF060728 


Conospermum mitchelii chloroplast atpB-rbcL intergenic spacer 
region, partial sequence 


0.2 


4772 


AB056833 


Macaca fascicularis brain cDNA clone:QflA-14255, full insert 
sequence 


0.068 


4773 


AP001416 


Homo sapiens genomic DNA, chromosome 21q22.2, clone:D34, 
LB7T-ERG region, complete sequence 


0.0003 


4774 


XM 007838 


Homo sapiens nuclear factor of activated T-cells 5, tonicity-responsive 
(NFAT5), mRNA 


0.59 


4775 


AC024777 


Caenorhabditis elegans cosmid Y42H9AR, complete sequence 


0.74 


4776 


AF333186 


Dictyostelium discoideum beta-alanine synthase (pyd3) mRNA, 
complete cds 


0.057 


4778 


DO 1021 


Anthocidaris crassispina mRNA for dynein beta-heavy chain, 
complete cds 


0.69 


4779 


U65750 


Xenopus laevis fork head related (XFD-1 1 ) gene, complete cds 


0.079 


4780 


NM_008176 


Mus musculus GROl oncogene (Grol), mRNA 


1.8 


4781 


M88115 


Hylobates lar Myc gene, complete cds 


0.73 


4782 


AP000388 


Arabidopsis thaliana genomic DNA, chromosome 3, PI clone :MRI 12 


0,022 


4783 


AE007246 


Sinorhizobium meliloti plasmid pSymA section 52 of 121 of the 
complete plasmid sequence 


6.7 


4784 


XM_043069 


Homo sapiens KIAA1464 protein (KIAA1464), mRNA 


0.08 


4785 


U41482 


Human Down Syndrome region of chromosome 21 genomic sequence, 
clone A39D1-1H8 


0.001 


4786 


AL1 10134 


Homo sapiens mRNA; cDNA DKFZp564H072 (from clone 
DKFZp564H072) 


0.058 


4787 


XM_005044 


Homo sapiens KIAA0628 gene product (KIAA0628), mRNA 


0.002 


4788 


XM 052430 


Homo sapiens E3 ubiquitin ligase SMURF1 (SMURF1), mRNA 


2E-30 


4790 


NM 031320 


Rattus norvegicus cadherin EGF LAG seven-pass G-type receptor 3 
(Celsr3), mRNA 


1.9 


4791 


AF184590 


Plasmodium falciparum AD A2-like protein gene, partial cds 


0.009 


4792 


AF061434 


Vicia faba amino acid transporter a (AAPA) mRNA, partial cds 


1.9 


4793 


AL1 17594 


Homo sapiens mRNA; cDNA DKFZp564C1563 (from clone 
DKFZp564C1563) 


0.074 


4794 


AJ133025 


Megaselia abdita zen gene, exons 1-2 


0.02 


4795 


XM 033087 


Homo sapiens hypothetical protein FLJ20359 (FU20359), mRNA 


0.024 


4796 


AE001462 


Helicobacter pylori, strain J99 section 23 of 132 of the complete 
genome 


1.1 


4797 


XM_042814 


Homo sapiens hypothetical protein FU13397 (FLJ13397), mRNA 


0.063 
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Table 3A Nearest Neighbor (BlastN vs. Genbank) 


SEQID 
NO 


ACCESSN 




P VALUE 


4798 


AL137325 


Homo sapiens mRNA; cDNA DKFZp434M0835 (from clone 


2E-14 


4799 


AB030620 


mus museums uaetk-alpna gene for cnoline/ethanolamme kinase- 
dipiia, exon % d 7 o, /, o 


0.64 


4800 


AB037417 


v-Fiyza sauva gene ior aspartate carbamoyl transferase, complete cds 


0.026 { 


4801 


AK024522 


Homo sapiens cDNA: FLJ20869 lis, clone ADKA02377 


0.21 


4802 


Y62463 


H. sapiens 3 flanking region for estrogen receptor (placenta) gene 


0.7 


4803 


AE001127 


Borrelia burgdorferi (section 13 of 70) of the complete genome 




4804 


Z70751 


uaenornauams eiegans cosnud r 1 1 A3, complete sequence 


0.22 


4805 


XM 038231 


Homo sapiens KIAA0793 gene product (KIAA0793), mRNA 


6E-53 


4806 


AF223425 


iviu2» mubcuius ivi/\i3zii-iz VMAD211.Z) gene, complete cds 


4.5 


4807 


XM 039048 

./VIVA \7 J 7U*rO 


Homo sapiens hypothetical protein MGC11303 similar to Zink 
uaiisporier z vivavjv^I mKJN A 


2 


4808 


T 175604 

U / JUv*r 


i iiapia mionca type i estrogen receptor mRNA, complete cds 


0.057 


4809 


-/VIVA UIJ^J 


Homo sapiens stimulated trans-acting fector (50 kDa) (STAF50), 

mPXJ A 
mtvXNA 


1.7 


4810 


nTZf /5r /U 


Caenorhabditis elegans cell cycle checkpoint protein Rad9 gene, 
complete cds 


0.008 


4811 


AlZj /** 


Farsiey FcrKl-2 mRNA for pathogenesis-related protein type A 


0.077 


4812 


AF332562 j 


Simian virus 40 strain 777, complete genome 


0.64 


4813 


.AlVA UUJ1I5 


Homo sapiens general transcription factor HE, polypeptide 2 (beta 
SUDumt, (vjlrziiz), mRNA 


1 


4814 


UV^UuU I\J\J 


Mus musculus, Similar to RIKEN cDNA 2310035M22 gene, clone 
ivnj^.ooou iiYiAvrti.3!)Oi2o!>, mRNA, complete cds 


0.21 


4815 




Yaba monkey tumor virus DNA, BamHl restriction fragment E, M 
ana paruai partial and complete cos 


0.18 


4816 


WLVA U*tO*tJ J 


Homo sapiens ATPase, H+ transporting, lysosomal (vacuolar proton 

mitring cttVtu-nif- 1 /ATD/COl\ mDXTA 

pump;, suounii i (Aiirooi), mKJNA 


0.55 


4817 


Y11095 


Rice stripe virus RNA 3 


5.9 


4818 


AT SR0001 


Human DNA sequence from clone RP1 1-241 J5 on chromosome 6, 
complete sequence [tiomo sapiens j 


3E-78 


4819 


XM 043498 


Homo sapiens G protein-coupled receptor 44 (GPR44), mRNA 


0.0002 


4820 


W1VA U*f /*f / / 


riomo sapiens iviAAloJd protein (KIAA1638), mRNA 


0.077 


4821 




Streptococcus thennophilus ORF1, ORF2, ORF3, ORF4, ORF5 and 
UKro i/iN A, strain otu 


0.18 


4822 


XM 016395 


Homo saniens region containing hvnotherical rvrntpin- rrmtT (V* r*eA\\ 
homolog 3 (LOC82389), mRNA 


6E-74 


4823 


XM 043492 


Homo sapiens 42259 (KIAA1728), mRNA 


0.0000006 


4824 


L43603 


Gallus gallus Na+/K+-ATPase alpha- 1 subunit gene, promoter region 


0.41 


4825 


AF257746 


Rattus norvegicus multidrug resistance protein la (Pgyl) mRNA, 
complete cds 


0.5 


4826 


X68794 


D.melanogaster "ten-a" mRNA for secreted protein 


1.2 
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Table 3A Nearest Neighbor (BlastN vs. Genbank) 


SEQID 
NO 


ACCESSN 


DESCRIP 


P VALUE 


4827 


AF330118 


Homo sapiens voltage-gated sodium channel type HI alpha subunit 
(SCN3A) gene, exon 2 


0.12 


4828 


XM 011230 


Homo sapiens ALL1 fused gene from 5q3 1 (AF5Q3 1), mRNA 


5.8 


4829 


XM 030669 


Homo sapiens hypothetical gene supported by AL13301 1 
(LOC90288), mRNA 


1.9 


4830 


[ U36927 


Plasmodium yoelii rhoptiy protein gene, complete cds 


0.21 


4831 


AF201353 


Columba livia glutamate receptor C gene, partial cds 


5 


4832 


AF331435 


HIV-1 D2 from Australia envelope protein (env) gene, partial cds 


0.07 


4833 


AF362391 


Candida albicans heat shock protein Hspl04 (HSP104) gene, HSP104- 
b allele, complete cds 


1.1 


4834 


XM 027927 


Homo sapiens hypothetical gene supported by Z97017 (LOC89989), 
mRNA 


0.48 


4835 


AE001127 


Borrelia burgdorferi (section 13 of 70) of the complete genome 


0.72 


4836 


AL390126 


S. pombe chromosome IQ cosmid c233 


0.023 


4837 


AF110213 


Rattus norvegicus insulin receptor precursor, exon 14 


0.56 


4838 


M38534 


Paramecium aurelia immobilization antigen (51C) gene, exon 5 and 
complete cds 


0.015 


4839 


AE001430 


Plasmodium falciparum chromosome 2, section 67 of 73 of the 
complete sequence 


0.052 


4840 


AF181475 


Mus musculus synaptonemal complex protein 3 (Syq>3) gene, exon 5 


0.19 


4841 


AF177242 


Bodo saltans clone pBME40 mitocondrial minicircle DNA 


0.024 


4842 


XM 004347 


Homo sapiens elongation of veiy long chain fatty acids (FENl/Elo2, 
SUR4/Elo3, yeast)-like 2 (ELOVL2), mRNA 


0.66 


4843 


AF083494 


Mus musculus phospholipase Dl (PLD1) gene, exons 23 and 24, 
complete sequence 


0.53 


4844 


U89684 


Lycopersicon esculentum protein kinase (LePK7) gene, partial cds 


0.22 


4845 


AB025716 


Staphylococcus aureus gene for FmtB, complete cds 


0.57 


4846 


AF017180 


Schizosaccharomyces pombe hexose transporter (Ght2) mRNA, 
complete cds 


0.87 


4847 


Z26314 


P.falcipannn gene for STARP antigen 


0.007 


4848 


XM 045160 


Homo sapiens KIAA0275 gene product (KIAA0275), mRNA 


1 


4849 


AK016495 


Mus musculus adult male testis cDNA, RIKEN full-length enriched 
library, clone:4931431C02, full insert sequence 


0.62 


4850 


BC004496 


Homo sapiens, Similar to hypothetical protein FLJ14058, clone 
IMAGE:3831313, mRNA 


1.7 


4851 


XM 005684 


Homo sapiens tolloid-like 2 (TLL2), mRNA 


1.6 


4852 


AF249077 


Sphaerotheca pluvialis cytochrome b (cytb) gene, partial cds; 
mitochondrial gene for mitochondrial product 


0.047 


4853 


XM 016934 


Homo sapiens hypothetical protein FLJ12891 (FLJ12891), mRNA 


5.4 


4854 


X65747 


Rnorvegivus gnat-3 mRNA for gustducin 


0.2 


4855 


XM 011705 


Homo sapiens paraneoplastic antigen MA2 (PNMA2), mRNA 


0.2 
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Table 3 A Nearest Neighbor (BlastN vs. Genhanlrt 


SEQH 
NO 


ACCESSN 


DESCRIP 


J P VALUE 


I 4856 


J L10692 


AiaDidop^ia Uudiaiia uytosineo metiiyltransferase mRNA, complete 
cds 


! 2.1 


4857 


1 AB052250 


AiauiUoiJ!,i!, Uialiana gene tor MYB transcription fector Atnryb2 
complete cds, strain: Yo-0 


0.36 


4859 


XM_033535 


Homo sapiens region containing tryptase beta 2; tryptase beta 1 
(LOC82314), mRNA 


f 1-3 


4860 


U49846 


Kaiiui uurvcgicus stem ceil lactor gene, 5' flanking region and exon 1 
partial sequence 


>\ 

0.18 


J 4861 


I AJ244021 


Tetrahymena thermophila kin2 gene for kinesin-n homologue, exons 
1-4 


J 0.063 


4862 
4863 


AF201021 
AJ245636 


Stachyarrhena sp. Jansen-Jacobs 4707 ribosomal protein S16 (rpsl6) 
gene, partial intron sequence; chloroplast gene for chloroplast product 
Mus musculus P2 Yl gene for P2Y1 receptor 


j 0.057 
0.2 


4864 

4865 
I 4866 


1 AK023175 

XMJ)34638 
| AB011006 


Homo sapiens cDNA FLJ13 1 13 fis, clone NT2RP3002590 
Homo sapiens hypothetical gene supported by AK024355 
(LOC90867), mRNA 


0.2 
| 1E-08 


4867 
4868 


XM_031128 
AK001442 


Rattus norvegicus gene for leptin receptor, partial cds 

Homo sapiens hypothetical protein DKFZp761I172 (DKFZP761I172) 
mRNA * 


0.22 


4870 


AF307954 


noiuu sapiens cL>NA bL) iu:>80 lis, clone NT2RP2003533 
Saccharonryces servazzn 46. 1 kDa protein, KAR4-like protein, and 
SPBl-like protein genes, complete cds 


2E-12 
0.19 


4871 
4872 


AF055079 
X88849 


Panulirus argus inositol 1,4,5-trisphosphate receptor (IP3R) mRNA, 
complete cos J 


6 


4873 


XM 039699 


C.coli ceuB, ceuC, ceuD, ceuE, OrfA, OrfB genes 

Homo sapiens hypothetical protein FLJ23309 (FLJ23309) mRNA I 


0.0008 
3E-42 


4874 


AF212847 


Lactococcus lactis bacteriophage ul36.2 ORF1 1 lb (ORF1 1 lb) 
ORF78b (ORF78b), ORF61b (ORF61b), ORF245 (ORF245), ORF364 
(UKF3o4), and putative replisome orgamzer (ORF255) genes, 1 
complete cds; and ORF241 fORF241) gene, partial cds I 


1.9 


4875 
4876 


XMJ)52223 ] 
AK024606 ] 


tiomo sapiens ujs^z,F564Cj092 protein (DKFZP564G092), mRNA 
Homo sapiens cDNA: FLJ20953 fis, clone ADSE01979 | 


0.69 


4877 
4878 


( 

U96097 < 
Z48599 ] 


^laviUautcr micluganensis subsp. sepedonicus species-specific DNA 
done Cms85 

-mexicana Imcpc gene cathepsin B-like cysteine proteinase 1 


3E-89 
0.47 

U.UO.5 


4879 
j 4880 1 


NM014160 1 
I 

«r\XVl_U J 1 J oo £ 


lomo sapiens HSPC070 protein (HSPC070), mRNA J 
lomo sapiens NADH dehydrogenase (ubiquinone) 1 alpha i 
nbcomplex, 5 (13kD, B13) (NDUFA5), mRNA | 


5 
0.55 


4881 


I 

AF059531 i 


lomo sapiens protein arginine N-methyltransferase 3 (PRMT3) 
nRNA, partial cds ! 


0.18 


| 4882 1 


( 

AF240700 r 


jymnocrotaphus curvidens cytochrome b (cytb) gene, partial cds; 
aitochondrial gene for mitochondrial product 


0.65 
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Table 3A Nearest Neighbor (BlastN vs. Genbank) 


SEQK 
NO 


) 

ACCESSN 




P VALUE 


4883 


AE005988 


^auiuua^ici urawjuiub secuon j 14 oi 3->y oi tne complete genome 


2 


4884 


AF332785 


Neocalanus plumchrus cytochrome oxidase subunit I (COI) gene, 
partial cds; mitochondrial gene for mitochondrial product 


0.65 


4885 


AJ278986 


Pichia etchellsii piasmid pPElB 


0.051 


4886 


AL5 12660 


xt mi liiii j^ina sequence irom cione Kriio3JM9 on chromosome 10, 

COTTI nipt P IPnilPnPP rWnror* conionrl 


6.3 


4887 


AF147082 


Homo sapiens gamma-glutamyl hydrolase gene, exons 3 through 7 


2 


J 

1 4888 


AL389874 


rauiiidu i>iin/\ sequence rrom clone Krl 1-563A22 on chromosome 20 
^uiiLdni^ uic j ena oi a novel gene weaKiy similar to Xenopus laevis 
puiauvk/ £uis*j uuiuiug proicui, oompieie sequence |riomo sapiens I 


0.008 


4889 


AC084157 


Caenorhabditis elegans cosmid Y46E12BR, complete sequence 


0.084 


4891 


AE001183 


uyjiwia vtuguuixdi i&cuiiuii ay or /uj oi lqc complete genome 


0.66 


4892 


AE002137 


LJicajpid&iiid. uicdiyucum section Jo oi oi tne complete genome 


0.52 


4893 


XM 028606 


rxtuuu a<ipiciib opi uanscnpiion iacior (b"lj, mRNA 


5 


4894 


NC 001453 


Strongylocentrotus purpuratus mitochondrion, complete genome 


6 


4895 


AK023104 


xaumu aapicub wl/in/\ ri^j IIS, Clone l-ZKlr3UU131o 


0.064 


4896 


XM 015243 


Homo sapiens stimulated trans-acting factor (50 kDa) (STAF50), 


1.7 


4897 


XM 043823 


nuiiMj aapiciib iiypuuieucai protein lu-JZUiOo (rLJ2U93o), mRNA 


0.000004 


4898 


AB049900 


Macaca fascicularis brain cDNA, clone:QnpA-19713 


6.1 


4899 


AE001390 


x-idi»iiiuuium idicipaium enromosome z, section 27 of 73 of the 
complete sequence 


0.23 


4900 


AE007346 


ou*;pLvjtAJw**ii& pneumoniae secuon oi i>4 oi tne complete genome 


0.73 


4901 


X17339 


Dengue-2 virus NS1 gene for nonstructural protein (patient M2) 


0.24 


4902 


AK000864 


Homo sanien«; cDNA F"! 110009 fic H**™* I-TRA/to a mnnn/nc 

ixvixivr oa^igiu v*L/r>i.rv rLJiUUUZ 115, Clone JtllllYLDAlUUUU^K) 


0.00001 


4903 


AF1 80076 


Panorpa helena cytochrome oxidase I (COI) gene, partial cds; 

uuLuwiiv/uui iai gciic 1UI liULUUllvJIlViricu pFOQUCl 


0.6 


4904 


AC008855 


Homo sapiens chromosome 5 clone CTD-2178M23, complete 
sequence 


3E-12 


4905 


K02990 


Human hepatitis A virus, complete genome 


0.23 


4906 


AE005310 k 


crbLriicncuia con uio / .n/ iiui^yj j genome, contig 1 of 3, section 134 
of 155 


0.078 


4907 


NM 032717 ] 


Homo sapiens hypothetical protein MGC1 1324 (MGC11324), mRNA 


0.68 


I 4908 


] 

BC004118 < 


Homo sapiens, clone MGC: 11 170 IMAGE:3843148, mRNA, 
complete cds 


1.5 


4909 


] 

L29252 < 


Human (clone D13-2) L-iditol-2 dehydrogenase gene, exon 4, exon 5, 
jxon 6 and exon 7 


0.0000001 


1 4910 


i 

S68117 < 


-PLP-A=prolactin-like protein A {5 f region, exon I, intron 1} [rats, 
3enomic, 1187nt] 


0.11 
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Table 3A Nearest Neighbor (BlastN vs. Genbank) 


SEQID 
NO 


ACCESSN 


DESCRIP 


P VALUE 


4911 


NM 016701 


IMllS TTlll^niliiQ npcfifi /"W^c\ ■m'R M A 


0.21 


4912 


AE006418 


i^duiuixjccus lacus suDSp. iacus UL1403 section 180 of 218 of the 

pomnlptr* opnnmp 


0.009 


4913 


XM 041162 


Homo sapiens hypothetical protein KIAA1165 (KIAA1165), mRNA 


4E-50 


4914 


AF147358 


numu ^dpicub mil icngin insert gl/jna. clone Yi3olxiu3 


0.00003 


4915 


AE000642 


1 iwiiL/vum, lci pyiun AOD7J occiion izu oi i j4 oi tne complete genome 


1.9 


4916 


XM 040436 


Homo sanien*; nphtilin t NIKR* mPXTA 


2 


4917 


AB026296 


riotuu o<x lav lluj. iiixvin/tl lux iihizxioj nomoiog, complete cos 


0.024 


4918 


AF159148 


Danio rerio Noggin 2 mRNA, complete cds 


0.23 


4919 


AF234184 


i^uv/iiici uupniid hCA-icLiiai proicin oaj-iZ nuviN a, complete cds, 
alternatively spliced 


0.026 


4920 


AK007229 


iviub iiiubouiiib duuii maie lesus clwn a, KiJviiJN lull-lengtli enriched 
library, clone: 1700121L03, foil insert sequence 


0.21 


4921 


AJ404228 


o<iv^iuuuiuycea uougiasii nniocnononai lkjn a gene cluster 


0.021 


4922 


X71604 


H.sapiens son-pseudogene 


0.53 


4923 


AF169122 


nemo sapiens ujna (cytosmeo) metnyltransterase gene, nitron 4, 

PcUllcU otLJ[LlCllL>C 


0.059 


4924 


XM 040202 


Homo sapiens protein phosphatase, EF hand calcium-binding domain 


6E-09 


4925 


AY027893 


Homo sapiens voltage-dependent calcium channel beta 2 subunit 
\^r\x^iriDA) gene, exon i 


3.3 


4926 


AF136008 


l^esuixoDacienum sp. t>ov4i lob nbosomal RNA gene, partial 

aCtJUCiiec 


5.6 


4927 


NM_007439 


Mus musculus anaplastic lymphoma kinase (Alk), mRNA 


0.071 


4928 


XM_006758 


xiuuiu bdpiens inzziea (L/rosopniiaj nomoiog 10 (tZjj 10), mRNA 


2 


4929 


AL139329 


numan ujna sequence irom clone KPl 1-228P1 on chromosome 6, 
i^uuipicic sctjueiiGG ixiomo sapiens j 


4E-38 


4930 


L03188 


odiuiuiroiiiyces cerevisiae lniegnn analogue gene, complete cds 


0.23 


4931 


AB039881 


*-^mib Admmaris mtuN/v ior cyiocnrome r p 43Uczi, complete cds 


0.55 


4932 


AK004783 


Mus musculus adult male lung cDNA, RIKEN full-length enriched 

libra TV r*1nTIP* 1 9000 1 1 A fiill incprt cpnnpnro 

j-LL/icLi y , Viiijiic. jd*r, luii hi ben sequence 


6.2 


4933 


U41343 


Human nrnbrain /pRPT rr^n<a a V An 0 
iiLuiuui piuidigiu vxTVErj-frj gene, exon 2. 


0.0000004 


4934 


AF157369 


xivuiiu actpiciib iNicuuinn-x^icK \~ri proiein ^jnx^l^i ) gene, exon j 


0.58 


4935 


XM_0 16505 


Homo sapiens KIAA0999 protein (KIAA0999), mRNA 


9E-26 


4936 


U07807 


Human metallothionein IV (MTTV) gene, complete cds 


0.008 


4937 


NM_018818 


Mus musculus choroidermia (Chm), mRNA 


1.7 


4938 


BC009075 


Mus musculus, bem-l,3-N-acetylglucosaminyltransferase 1, clone 
MGC:6892 IMAGE:2654354, mRNA, complete cds 


1.2 


4939 


AF304354 


Homo sapiens proteoglycan 3 (PRG3) gene, complete cds 


0.24 


4940 


AK000864 


Homo sapiens cDNA FLJ10002 fis, clone HEMBA1000046 


0.00001 


4941 


AF076601 


Dictyostelium discoideum developmental protein (DG1105) gene, 
partial cds 


0.95 
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Table 3A Nearest Neighbor (BlastN vs. Genbank) 


SEQID 
NO 


ACCESSN 


DESCRIP 


P VAT T TP 


4942 


AF025542 


BombvX mori insulin recentor-l i Icp nmfpiii nrp_/*nrcnr /TOiT?^ inPMA 

complete cds 




4943 


XM 038714 


Homo saniens hvnothetical nrotein fLTC7F16 S fn"n7Ri£ ^ m UMA 


O.J 


4944 


ACO 16707 


HomO SaDienS BAC Clone RP1 1 frrnn V rnmnl pff=> cpmipnrp 




4945 


XM 005684 


Homo sapiens tolloid-like 2 (TLL2), mRNA 


1.1 


4946 


AJ276170 


Rattus norvegicus partial Vbs541 nrocessed mendopene T pnH 


0 04.1 


4947 


AJ132986 


Vibrio sp. partial 16S ribosomal RNA, isolate A515 


5.9 


4948 


XM 009513 


Homo sapiens hvnothetical nrotein A3SPC207 1 mRNA 


0 71 


4949 


AF181668 


Homo sapiens clone 173pl7 polymorphic microsatellite sequence 


0.17 


4950 


AF045453 


Dictyostelium discoideum protein kinase YakA (yakA) mRNA, 
complete cds 


0 007 


4951 


AF279135 


Dictvostelium discoideum snore cost ^tnictiiral nrntHn SP/>S /rrrtP i 
gene, complete cds 


0.003 


4952 


AK012631 


Mus musculus 10 11 davs embrvo cDNA RT1CFN fii11-1f»n ath 
enriched library, clone:2810001G20, full insert sequence 


0.071 


4953 


AB047630 


Macaca fascicularis brain cDNA, clone:QnpA-13555 


0.69 


4954 


AJ011848 


Hordeum vulffare chloronla**t rn^l 5 frtartial i nrlhTT nrlhT nHliCr 

iiviuwuiii y m^ymv viuwivpiuul A V£ *** 1 S 1*1 ^ ^ XXCIXXXX, XUXX1X, I f\t 1 LVJ, 

ndhE, psaC, ndhD and ndhA genes 


0.054 


4955 


U34363 


Plasmodium falciparum CTRP gene, complete cds 


0 18 

V. xo 


4956 


AF220175 


Homo saniens acid ceramida^e fASATTl apnp Avnnc S thrmio-Vi l A. anH 
complete cds 


0.0009 


4958 


D26442 


Yeast gene for PES4 PAB-like nrotein. comnlete cd<s 


0 003 


4959 


NC_002184 


Penaeus monodon mitochondrion, complete genome 


L9 


4960 


XM 039251 


Homo saniens 37381 fKIAA1660i mRNA 


s 

j 


4961 


XM 004462 


Homo sapiens signal sequence receptor, alpha (translocon-associated 
jrotein alpha) (SSR1), mRNA 


0 72 


4962 


AF252420 i 


Bos taurus clone MNB-5 microsatellite sequence 


0 OS 

v/.vo 


4963 


XM_035186 


Homo saniens f S» f^P.rpvi a hnmnlna nrpnvl nmt^tn nrnfpacp 

(RCE1), mRNA 


2 


4964 


AL1 10134 


Homo saniens mRNA* cDNA DTCP7n^64H079 /'from Hnnp 
DKFZp564H072) 


0.058 


4965 


AL049452 


Homo saniens mRNA* cDNA DKF7n*?R6Pl 199 /'frnm r1rm*» 
DKFZp586C1322) 


0.67 


4966 


AB047614 


Macaca fascicularis brain cDNA, clone:QnpA-12025 


0.077 


4967 


XM 043315 


Homo sapiens KIAA0229 protein (KIAA0229), mRNA 


6.2 


4968 


AK017495 


Mus musculus 8 days embryo cDNA, RIKEN full-length enriched 
library, clone:5730405K23, full insert sequence 


0.025 


4969 


AB012141 


Candida glabrata TEF3 gene for translation elongation factor3, 
complete cds 


0.003 


4970 


AK022914 


Homo sapiens cDNA FLJ12852 fis, clone NT2RP2003445 


0.0003 


4971 


AF133211 


Homo sapiens COT kinase proto-oncogene, partial cds, alternatively j 
spliced 


0.008 
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Table 3A Nearest Neighbor (BlastN vs. Genbank ) 


SEQID 
NO 


ACCESSN 


DESCRIP 


T> \7A T TTD 

r VALUr, 


4972 


AF349681 


Simian immunodeficiency virus strain SIVrcmNg409 from Nigeria 5* 
* vii o W1.UUXUU icpcdi, yaiuai ssctjueiicc, gag protein (gag) gene, 
complete cds; pol protein (pol) gene, partial cds; and vif gene, 
complete sequence 


0.68 


4973 


AF3 50254 


Homo sapiens fibrinogen gamma chain precursor (FGG) gene, 
complete cds. alternativelv snliced 


0.21 


4974 


AF3 14052 


RhlZObilim SD CIAT613 nifTTc ppnp riai-rial Cf*nvtf*Ttr>t* 


i a 
1.9 


4975 


XM_036277 


Homo sapiens inhibitor of DNA binding 4, dominant negative helix- 
loop-helix protein (ID4), mRNA 


0.076 


4976 


AF057740 


Homo saniens 17-beta-hvdrnxv^t^rfiirf t\f*\wATr\ot*T\**ce± t\t /ttcth td/i\ 
gene, exon 24 and complete cds 


0.00003 


4977 


XM 051777 


Homo saniens similar to NADfPi nWw»nrfont ct«vrwisi rioViir^i^rranno^ 

llwmw &o.xjl\*w3 ammoi ivr iiruy^r / ucpciiuclll SlClOlQ QenyuFOgenaSe— 

like (M. musculusi fLOC93517i mRNA 


4Jb-ii 


4978 


AF3 14052 


Rhizobium sp. CIAT613 nifHc gene, partial sequence 


2 


4979 


BC010911 


nOnift COr*ipi-|C Similar f r\ drl<jr»tr\7- nrAfotn r*/vn+oim TT J tvtt> 

aav,iiiaj oo|/iona, oixuiLuii iu ctuopLur protein containing pxi Gomain, Jr l J5 
domain and leucine zipper motif, clone IMAGE:4295177, mRNA 


0.076 


4980 


AF2 14529 


Homo Saniens dvStronhin fDMT)> ppnp intiYvn SI reirrtal cArtiiAnra 


n Afti 
U.UUl 


4981 


AK024235 


Homo saniens cDNA hlj 1417*3 fi« rinn^» "Nrr^pp^nno^s 

xxvuiv dayiwiu wJ^iin x liJlti / -Lid, AN 1 Zl\r /Jill/ / Jj 


e-iU4 


4982 


AB035089 


Homo sapiens SCCA2 gene for squamous cell carcinoma antigen 2, 
complete cds 


ft 0*3 


4983 


M97555 


Schistosoma mansoni fronnmvocin ml? XT A mmnlAtP 


ft 11 


4984 


AE002745 


Drosophila melanogaster genomic scaffold 142000013385404, 
comnlete seauence 


ft ftftT 

U.UUj 


4985 


X99288 


B.calamita repeat region, clone-lib micro-8 


0.23 


4987 


AJ308840 


Coffea arabica microsatellite TTNTA Hnn*» 1 fc-Af^TCi 


ft AOC 

U.U25 


4988 


AB049134 


Cucurbita moschata CmATSl;l gene for acyl-(acyl-carrier-protein): 
£lvcerol-3-nhosnhate acvltran^rferacp rnmn1f»tf» rvlc 


ft ^"5 


4989 


X03991 


Human glucagon gene 


ft AT/f 

U.U/4 


4990 


AY042871 


Arabidopsis thaliana zinc finger-like protein (F3C22__200) mRNA, 
comnlete cds 


ft ATT 

U.077 


4991 


AF2 14529 


HomO S3t)ienS dv^trOnllin /T)A/f 1 ")1 CPflP ITltrnn ^1 rtarftol ronuanr>a 
xxv/iuu oapiviio VfjauvJ|JllXll ^J^/XVLL/^ gCllG, XI11XUH Ji, jJill 11 al SCQUenCC 


ft ftfti 
U.UUl 


4992 


AF269319 


Staphylococcus epidermidis strain SRI clone step.l002a02 genomic 
seauence 


2 


4993 


AB009801 


HflTTIft CflHI PT1Q (TPTIP friT nct^wn-i rl/"»<«»n intmn 

iiuiuv aapitua gwuc xui UbLCOIULlUgcll, llUXOn Z 


A ^A 

U.69 


4994 


M69146 


Torulopsis glabrata metal-activated transcription fector (amtl) gene, 
complete cds 


0.65 


4995 


AL121822 


Human DNA sequence from clone 346P1 1 on chromosome Xq21.2- 
21.33, complete sequence [Homo sapiens] 


5E-09 


4996 


AF072273 


Lactuca sativa resistance protein candidate RGC2N (RGC2N) 
pseudogene, complete sequence 


0.22 


4997 


AL358273 


S.pombe chromosome n cosmid c29C10 i 


0.22 


4998 


BC010011 


Homo sapiens, Similar to RIKEN cDNA 4931428D14 gene, clone 
MGC:15407 IMAGE:4309613, mRNA, complete cds 


0.22 
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Table 3A Nearest Neighbor (BlastN vs. Genbank) 


SEQID 
NO 


ACCESSN 


DESCRIP 


P VALUE 


4999 


AF090889 


Homo sapiens clone HQ0092 


I 0.71 


5000 


M93140 


Glycine max cv Prize protein kinase mRNA 




5001 


AE005266 


Escherichia coli 0157:H7 EDL933 genome, contig 1 of 3, section 90 
of 155 


1 ft 

l.o 


5002 


AE007350 


Streptococcus pneumoniae section 33 of 194 of the complete genome 


0 23 


5003 


BC008973 


Homo sapiens, transmembrane protein with EGF-like and two 
follistatin-like domains 2, clone MGC: 17007 IMAGE:4 180709, 
mRNA, complete cds 


0.023 


5004 


X13439 


Cochliobolus heterostrophus mitochondrial gene for ATPase subunit 6 


0.19 


5005 


AF138069 


HFV-1 p5c028-h from USA envelope glycoprotein (env) gene, partial 
cds 


2.1 


5006 


NM_002847 


Homo sapiens protein tyrosine phosphatase, receptor type, N 
polypeptide 2 (PTPRN2), mRNA 


5 


5007 


XM 036827 


Homo sapiens hypothetical gene supported by X04201; X04588; 
AK026559* AY004867* BC000771- BC008407- BP00842S 
(L0C91191), mRNA 


1 


5008 


NM 031389 


Mus musculus ribonuclease/angiogenin inhibitor 2 (Rnh2), mRNA 


1.7 


5009 


BC010355 


Homo saoiens. Similar to unci ear nrntein rlrvnf* TM'AfrF-d.^d.ftSlfi 
mRNA 


5.4 


5010 


AE003909 


Xylella festidiosa 9a5c, section 55 of 229 of the complete genome 


5.1 


5011 


NM 031389 


Mus musculus ribonuclease/angiogenin inhibitor 2 (Rnh2), mRNA 


1.7 


5012 


XM 016747 


Homo saniens latent transform in e Prnwth istcfnr hpfa hinHinp - nmt**in 
1 (LTBP1), mRNA 


2E-10 


5013 


M85304 


Schistosoma mansoni homeodomain nrotein fcmox-S^ mRNA 
complete cds 


0.009 


5014 


NC_001398 


Saccharorrryces cerevisiae 2 micron circle plasmid, complete sequence 


0.65 


5015 


AF267746 


Drosonhila melano caster rad21 mitotic cohesin (dradHW 2ene exon^ 
3 through 8 and complete cds 


0.003 


5016 


XM 052430 


Homo sapiens E3 ubiouitin liease SMLTRF1 fSMURFl , i mRNA 


2E-30 


5017 


X67813 


C.femiliaris SRP72 mRNA for signal recognition particle 


0.16 | 


5018 


AF031848 


Dermacentor marginatum 1 rihriQrYmal BNA apnp mitochondrial 

gene for mitochondrial RNA, partial sequence 


0.061 


5019 


XM 003317 


Homo sapiens PDZ domain containing guanine nucleotide exchange 
factor(GEF)l (PDZ-GEF1), mRNA 


0.67 


5020 


XM 030673 


Homo sapiens methyl-CpG binding domain protein 4 (MBD4), 
mRNA 


0.6 


5021 


L38957 


Saccharomyces cerevisiae nuclear-encoded mitochondrial isoleucyl- 
tRNA synthetase (ISM1) gene, complete cds 


2.1 


5022 


AF345089 


Hepatitis C virus isolate SCpreSC2cl4 nonfunctional polyprotein 
gene, partial sequence 


0.59 
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|seqid| 
no accessn 

^023 AC091124 

5024 X66451 

5025 XM 010563 

5026 AF256722 

5027 AF385609 

5028 AF355103 

5029 I XM 034863 

5030 I AF374005 

5031 AK014447 

5032 AL355577 

5033 1 AB030616 



Table 3A Nearest n^m^ (BlastN vs Genbank) 

— -,, DESCRIP 

Caenorhabdius elegans cosmid Y47A ; - 
Wcar.natus RPA2 gene tor RNA polymer I s^dla^jT 

Homo sapiens Hypothetical gene supported by AB033071- ARnsman 
1AK000726;NM 015383 (LOC9239fi\ ggff A AB051480; 
Salmo salar clon e HHMS458 microti*,, o^ .^ 
Homo sapiens minisatelhte sequence ~ — " 



0.066 

1E-15 

0.5 
5.3 

0.18 

1.8 



5034 



S66671 



Rattus norveg.cus bradykinin B2 recent o ene , promoter semien rft 

E? "T?* 18 Pregnant adult female placenta and extra 
embryonic tissue cDNA, RKEN full-length enriched library 
|clone:3830421F13, full insert sequent 

Ij^Tnf N ^r UeDCe , fi0m cIoneR PH-299G2 on chromosome 6 
Contains a GSS, complete sequence m nmn ^ 

Mus musculus Chetk-alpha gene for cholme/etnanolamine kinase- 
alpha, exon 1 and 5'-flanking region 

pol I [simianMason-l^rD-typerebo^ SM -PDT, systemic Judus 
erythematosus pati ent TFA 68. Genntni . pxta 113 ' ^ enuc ^ 

I I^l^sfnt f - * " . « <■ ^ 1 ' 



5035 AB038481 

5036 1 U05013 



AF242384 
X52863 

XMJ)02625 
U00789 

5042 I AB017429 

5043 I U67518 



5044 



5045 



5046 



5047 
5048 



X06266 



lucasii chloroplast *cL gene for nbulose-l,5-bisphosphate 
carboxylase/oxygenase large snbunit, partial cds 
Rattus^ aozvegtcus Spmgue-Dawley heme oxygenase-2 non-redncing 
lsoform gene, complete cds 6 

se q rnS PienS (SUbClODe AJ ^ fr ° m P1 H69) DNA complete- 

Homo sapiens phytanoyl-CoA hydroxy ^ (PHYH) gene, e xm, 7 

G.max Oy4 gene for glycinin ~ — ■ 

Homosapiens ankynn repeat and SOCS box^ntaining 3 (ASB3), 

Mus musculus D pV l (DMD) gene. ^rtiaTcds" 
Oryaa sativa gene lo r mitochondrial ribosomal portein S14 succinate 
dehydrogenase iron-protein subunit renwm ^succinate 



8.3 

0.18 

0.000 3 

6.1 
0.008 




0.22 
1.9 
1.7 
1.9 

_6_ 
0.068 



Anthirrhumm majus DNA for t^nsposon tam2 



0.25 



0.007 
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Table 3A Nearest Neighbor (BlastN vs. Genbank) 



|SEQID| 
NO 



ACCESSN 



DESCRIP 



P VALUE 



5049 
5050 



AF272387 Homo sapiens leucine zipper nuclear factor (BLZF1) gene, exon 6 



7E-22 



NCJ)01587 [Human papillomavirus type 34, complete genome 



0.25 



5051 NM_017078 Rattus norvegicus Acetylcholine receptor alpha 5 (Chrna5). mRNA 



0.64 



5052 



XM_030510 Homo sapiens microfibrillar-associated protein 2 (MFAP2), mRNA 



5053 



Z4843 3 L. odomensis vicilin pseudogene 



0.047 



Rhybrida 5-enolpyruvylshikimate 3-phosphate synthase mRNA 
5054 | M21084 [complete cds 



0.2 



5055 



U67518 



5056 AF215555 



Methanococcus jannaschii section 60 of 150 of the complete genome 
jBlakea trinervia NADH dehydrogenase subunit F (ndhF) gene, partial 
[cds; chloroplast gene for chloroplast product 



0.002 



5057 AE001379 



Plasmodium falciparum chromosome 2, section 16 of 73 of the 
[complete sequence 



0.63 



5.4 



5059 AF362391 



Candida albicans heat shock protein Hspl04 (HSP104) gene, HSP104 
|b allele, complete cds 



5060 AJ000056 jHomo sapiens tyrosine aminotransferase gene, 5' flanking region 



4.4 



0.57 



5061 API 12541 



HIV-1 isolate 08108v3 from USA, envelope glycoprotein (em) gene, 
[partial cds 



.0.007 



5062 AF300534 



5063 



J00571 



IDiadasia consociata cytochrome oxidase subunit I gene, partial cds; 
tRNA-Leu gene, complete sequence; and cytochrome oxidase subunit 
II gene, partial cds; mitochondrial genes for mitochondrial products 



Mouse Ig kappa unproductively rearranged V-T2; V-J region 



0.22 



0.001 



5064 
5065 



X95275 iRfalciparum complete gene map of plastid-like DNA (n£A) 



0.062 



Y09746 [Holigactis mRNA for heat shock protein 70 



0.46 



5066 1 X95536 |H. sapiens earl gene 



5067 
5068 



C.ellipsoidea rbcL, rpsl4, trnM, trnG, trnD, trnS, rps4, atpE, atpB 
D10997 jgenes, complete cds 



2E-64 



1.8 



5069 
5070 



Z73968 Caenorhabditis elegans cosmid C05A2, complete sequence 



XM_012733 Homo sapiens KIAA1012 protein (K1AA1012), mRNA 



0.67 



0.24 



AE000745 Aquifex aeolicus section 77 of 109 of the complete genome 



0.21 



Diploptera punctata high-affinity Na+-dependent glutamate 
transporter (EAAT1) mRNA, complete cds 



5071 I A F208521 

JHomo sapiens mannosidase, alpha, class 2B, member 1 (MAN2B1) 

5072 | XM 032656 mRNA 



0.08 



1.8 



5073 
5074 



Z28063 S.cerevisiae chromosome XI reading frame ORF YKL063c 



6.2 



XMJ)50445 Homo sapiens KIAA0164 gene product (KIAA0164), mRNA 



0.66 



5075 



Pectobacterium chrysanthemi ArgG (argG) gene, partial cds; 
regulatory protein PecS (pecS), regulatory protein PecM (pecM), 
indigoidine systhesis protein IdgA (idgA), and indigoidine systhesis 
AF2652 1 1 Iprotein IdgB (idgB) genes, complete cds; and indigoidine sys> 



0.72 
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SEQIT 
NO 


) 

ACCESSN 


5076 


X87201 


5077 


AF120475 


5078 


AF270407 


5079 


U47138 


5080 


AB049420 


5081 


AF009415 


5082 


AE001395 


5083 


U62542 



Table 3A Neare st Neighbor (BlastN vs . Genhanlrt 

, DESCRIP 

IBorrelia burgdorferi plasmid, orfA, B, C, D, F, bdrV genes, clone 
I pOMB 14 and pOMB 1 7 

Mus musculus myelin-associated oligodendrocytic basic proteins 
MOBP170 and MOBP69 (Mobp) gene, exons 3 through 5 and 
[complete cds, alternatively spliced 

Staphylococcus epidermidis strain SRI clone step.4050g06 genomic 
I sequence 

Inkoo virus SW AR 83-161 nucleocapsid protein and nonstructural 
protein genes, complete cds 

Mus musculus DNA, clone:7-l, derived from 4x chromosome of 



5084 



5085 



5086 
5087 
5088 
5089 
5090 



Staphylococcus xylosus choline transporter (cudT), putative regulatory 
protein (cudC), glycine betaine aldehyde dehydrogenase (cudA) and 
choline dehydrogenase (cudB) genes, complete cds 
Plasmodium falciparum chromosome 2, section 32 of 73 of the 
complete sequence 

jDrosophila melanogaster dead ringer (dead ringer) mRNA, complete 
cds 



P VALUE 1 
0.076 



AL445527 



Human DNA sequence from clone RP1 l-i 15C2 on chromosome 1 
complete sequence [Homo sapiens! 
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Table 3A Nearest Neighbor (BlastN vs. Genbank) 


SEQH 
NO 


> 

ACCESSN 




P VALUE 


5103 


AB055370 


Macaca fasaculaxis brain cDNA, clone:QflA-12708 


0.079 


5104 


AK008762 


iviub iiiu^uuiub auuii maie stomacn cdn A, kikkn full-length 
enriched I ihrarv r^nn^'OOinCiO moo fiiii tnca^ «^» aMm 


2.1 


5105 


I M71245 


W Jit nrn^t Jiff*!!! C^'X CllVlllTtlt (TPno pnmnlata 

ixai pivjauxt&in ouuuiui gene, complete cos 


0.69 


5106 


I AY007089 


Homo sanietK clone TfTYTAnm in mPMA cpnnon^ 


0.000009 


5107 


AF1 13007 


Homo saniens PROOOfifi mPMA pnmniptp ™ic 


0.0007 


5108 


AF125461 


Caenorhabditis elegans cosmid Y8A9A, complete sequence 


0.69 


5109 


AF016179 


Rattus norvegicus putative pheromone receptor (Go-VN2) mRNA, 
complete cds 


0.071 


5110 


AK021604 


Homo sapiens cDNA FLJ11542 fis, clone HEMBA1002801 


0.026 


5111 


Z73968 


Caenorhabditis elegans cosmid C05 A2, complete sequence 


0.54 


5112 


U40995 


Danio rerio Hox-al gene, complete cds 


I 0.009 


5113 


NC 001728 


Odontoglossum ringspot virus, complete genome 


0.027 


5114 


2056 P10 07A 
3 268157 


8778469 


gi|8778469| 
gb|AAF794 
77.1|AC02 
2492_21 
(AC022492 
) F1L3.30 
Arabidops 
is thaliana] 


j 5115 


AF234932 


C 'Tiff n OTYtPrW ci/Milo r^l nnp Q en-tin nana «-irkW~in1 r^Ar> 
^✓.i. pi&i y a oiuuid wIUlIC d-t/LLll gene, Ual Hal COS 


0.24 


5116 


M60858 


Hlimfln nnplfilVHn apnp prim-nl^f-p* aHc 

illillUUl llUWICUliil gCllw, CVJILILHCLC ttlb 


0.66 


5117 


XM 003100 


Homo sapiens hypothetical protein FLJ1 1 142 (FLJ1 1 142), mRNA 


0.23 


5118 


AF129278 


Buchnera aphidicola natural-host Diuraphis noxia gluconate-6- 
phosphate dehydrogenase (gnd) gene, partial cds 


6.4 


5119 


AF283665 


Hetemrfpra pIvpitipq oimnvK/l rvr1<>cp ml? "MA mmniafa 
ix^iMuugia giywuica guauyiyi uyciaac UltvlNA, COmpieie CuS 


5.4 


5120 


AJ005205 


Homo sapiens 5HT3 gene for serotonin 3 receptor 


0.24 


5121 


AF135472 


vivouiiuiuu uwij w nitMi ivcp piuLcm gene, complete COS 


0.23 


5122 


XM 036007 


Homo Qanipnc trsncmpmKrQUA nmtoin i / r v\/rc\At \ m n\TA 
iiuiuv aapituia u<u&MiiGiiiUialie protein X \ 1 NIXltV/LLjy nucNA 


2.1 


5123 


AL031426 


Human DNA sequence from clone CTA-191D12 on chromosome 
z,zq± i uoniams two exons or tne ArULz gene for apolipoprotein L 
2, ESTs and GSSs, complete sequence [Homo sapiens] 


0 000001 

V . W W Vv X 


5124 


XM_049821 ] 


Homo sapiens hypothetical protein MGC14832 (MGC14832), mRNA 


5.4 


5125 


XM 028344 ] 


Homo sapiens ribosomal protein L5 (RPL5), mRNA 


0.16 


5126 


AF016653 I 


Caenorhabditis elegans cosmid C41D7, complete sequence 


0.7 


5127 


] 

Z96752 I 


H.sapiens telomeric DNA sequence, clone 8QTEL025, read 
SQTELOO025.seq 


0.006 


5128 


M61883 J 


3us scrofa (pig) apomucin mRNA, 3' end 


0.026 
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ISEQID 
NO 


ACCESSN 


1 5129 


AY005107 


5130 
j 5131 


AF 195044 
AK021963 


5132 


AF034085 


5133 
5134 


U91929 
NMJ)10781 


| 5135 | 


AB017918 


1 5136 
\ 5137 


U55243 
AF061296~ 


5138 


AL050314 



5139 
5140 



XM_036394 
XMJ)47419 
AJ404314 



Table 3A Nea rest Neighbor (BlastN vs. Genbank) 

DESCRIP _ 
Pentachondra pumila specimen-voucher Cheny 96/1 maturase (matK) 
gene, partial cds; chloroplast gene for chloroplast product 
Homo sapiens beaded filament component protein (CP49) gene 
partial cds 9 

Homo sapiens cDNA FLJ1 1901 fis, clone HEMBA1007347~ 
JCaenorhabditi s elegans UNC-45 (unc-45) gene, complete cds" 

Mus musculus LI neural cell adhesion protein gene, 5' sequence and 
|exon 1 

JMus musculus mast cell protease 6 (Mcpt6), mRNA* 

|Mus musculus gene for gross cystic disease fluid protein 15, complete 

I cds 

Dictyostelium discoideum glyceraldehyde-3-phosphate dehydrogenase I 

(GAPDH) gene, partial cds 

Homo sapiens sulfonylurea receptor (SUR2) gene, exonT 

Human DNA sequence from clone 100G10 on chromosome 22ql3.31- 
13.33. Conta ins GSSs, complete sequence [Homo sapiens] 

Homo sapiens calcium/calmodulin-dependent protein kinase IV 
1(CAMK4), mRNA 



2.2 

0.57 
0.0005 
0,000 1 

2.1 
0.68 

0.001 

0.24 
9E-45 



6.4 



0.003 



Homo sapiens LQC87974 (LOC87974), mRNA 
I Amphisorus hemprichii 18S rRNA gene, isolate 720 



5142 AF324889 



Homo sapiens myosin phosphatase target subunit 2 (MYPT2) 
lexon 1 



0.23 



5143 



Bacteriophage PBS2 (from B.subtilis) uracii-DNA glycosylase 
J04434 inhibitor gene, complete cds 



0.61 



b a . Homo ^ens inter-alpha (globulin) inhibitor, HI polypeptide 

5144 | XMJ)41213 JqiTHl), mRNA 



5.5 



5145 
5146 



5147 



AF187881 iHomo sapiens l eukointegrin alpha d gene, promoter and partial cds 

Z70180 P.quinquangularis chloroplast matK gene 
AK025364 jHomo sapiens cDNA: FLJ21711 fis. clone rOT.im 



0.026 



XM 045594 Homo sapiens hypothetical protein MGC4816 (MGC4816), mRNA 



AB029068 



5151 AF215555 



Mus muscul us gene for Ser/Thr kinase KKIAMRE, exon 1, 2, 3, 4, 5 
iBlakea trinervia NADH dehydrogenase subunit F (ndhF) gene, paxtM 
Jcds; chloroplast gene for chloroplast product 



5E-82 
0.63 



0.71 



5152 
5153 



XMJ)26967 Homo sapiens h ypothetical protein MGC2668 (MGC2668), m RNA 
M32328 IC.pipiens esterase Bl gene, complete cds" 



1.9 



0.008 



5154 | U58744 



Caenorhabditis elegans cosmid F55F1 

Escherichia coli AgaR (agaR), KbaZ (kbaZ), AgaV (agaV), AgaW 
?f ^i'^ff (agaE) > AgaF (agaF), AgaA (agaA), AgaS (agaS), 

AF22849K Sin ^f^' Ag * C (3gaC) ' AgaD (agaD) ' A ^ 

AF228498 |(agal) genes, complete cds 



5155 



0.72 
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Table 3A Nearest Neighbor (BlastN vs. Genbank) 


SEQJD 
NO 


ACCESSN 


DESCRIP 


P VALTJF 


5156 


L36898 


Saccharomyces cerevisiae mitochondrion transfer RNA-Glu (tRNA- 
Glu) gene 


0.69 


5158 


D88262 


Pisum sativum PsCHS5 gene for chalcone synthase, complete cds 


0.7 


5159 


AK018080 


Mus musculus 1 1 days embiyo head cDNA, RIKEN full-length 
enriched library, clone:6230403H02, full insert sequence 


6.4 


5160 


X95275 


P.falciparum complete gene map of plastid-like DNA (IR-A) 


0.022 


5161 


D16417 


Dictyostelium discoideum mRNA 


0.009 


5162 


AF267204 


Candida tus Carsonella ruddii natural-host Cacopsylla brunneipennis 
ATP synthase alpha subunit (atoA) eene oartial cds* ATP synthase 
gamma subunit (atpG) gene, complete cds; and ATP synthase beta 
subunit (atpD) gene, partial cds 


0.026 


5163 


AF264650 


Asterella tenella trnL gene, intron sequence; chloroplast gene for 
chloroplast product 


6.4 


5164 


AF003143 


Caenorhabditis elegans cosmid C53H9, complete sequence 


0.23 


5165 


M19459 


Plasmodium falciparum interspersed repetitive DNA and an open 
reading frame 


0.008 


5166 


AK001362 


Homo sapiens cDNA FLJ10500 fis, clone NT2RP2000369 


0.23 


5167 


AF170023 


Anopheles gambiae clone 138 genomic sequence 


0.57 


5168 


AJ012333 


Buchnera aphidicola plasmid pBTc2, trpE and tipG genes 


0.006 


5169 


AF158077 


Pisacha naga 16S ribosomal RNA gene, partial sequence; 
mitochondrial gene for mitochondrial product 


0.019 


5170 


U68721 


Botiyotinia fuckeliana endopolygalacturonase 5 (BcPGAS) gene, 
complete cds 


0.21 


5171 


AK026372 


Homo sapiens cDNA: FLJ22719 fis, clone HSI14307 


e-116 


5172 


NM 031130 


Rattus norvegicus nuclear receptor subfamily 2, group F, member 1 
(Ni2fl), mRNA 


3 


5173 


AY034470 


Homo sapiens clone BGL2 mRNA sequence 


0.064 


5174 


AF108841 


Homo sapiens human endogenous retrovirus HERV-H10 pol protein 
(pol) gene, partial cds; env pseudogene and 3' LTR, complete 
sequence 


0.009 


5175 


AJ251957 


Ciona intestinalis mRNA for nuclear lamin (lamin L2 gene) 


1.9 


5176 


XM 045437 


Homo sapiens a disintegrin and metalloproteinase domain 33 
(ADAM33), mRNA 


0.71 


5177 


AF177983 


Homo sapiens NAD-Kdependent 1 5-hydroxyprostaglandin 
dehydrogenase (PGDH) gene, promoter, exons 1 and 2 and partial cds 


9.9 


5178 


AF181632 


Drosophila melanogaster BcDNA.GH05095 (BcDNAGH05095) 
mRNA complete cds 


0.23 


5179 


AK018339 


Mus musculus 10 days neonate cerebellum cDNA, RIKEN foil-length 
enriched library, clone:6530414F19, full insert sequence 


2.1 


5180 


XM 050838 


Homo sapiens wingless-type MMTV integration site family, member 
10A (WNT10A), mRNA 


0.22 


5181 


AJ308588 


Medicago truncatula partial mRNA for nodulin 25 (nod25 gene), 
clone 3 


4.3 
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Table 3 A Nearest Neighbor (BlastN vs. Genbanto 


oCy JJJ 

NO 


ACCESSN 


DESCRIP 


p VALUE 


5182 


AF229797 


Homo sapiens cis-golgi SNARE (GOSR1) gene, exon 9 and partial 
cds 


1 0 




76SL 1 fl*7 


Caenorhabditis elegans cosmid F48C5, complete sequence 


2.1 


JloO 


AYUZOU4D 


Mus musculus germ cell-specific unknown protein mRNA, complete 
cds 


0.079 


C1Q7 




Staphylococcus aureus serine protease operon, complete sequence 


1 


JlOO 


Ar3:>5103 


Pleurotus ostreatus linear mitochondrial plasmid mlp2, partial 
sequence 


0.069 


5189 


AF055079 


Panulirus argus inositol 1,4,5-trisphosphate receptor (IP3R) mRNA, 
complete cds 


5.2 


5190 


AC007184 


Arabidopsis thaliana chromosome II section 163 of 255 of the 
complete sequence. Sequence from clones Tl 1P1 1 


6.1 




Ad044J48 


Arabidopsis thaliana AtSUGl mRNA, complete cds 


1.2 


5192 


XM 003320 


Homo sapiens hypothetical protein FLJ1 1 155 (FIJI 1 155), mRNA 


5.3 ! 


5193 


AF148987 


Legionella fallonii macrophage infectivity potentiator (mip) gene, 
partial cds 


0.067 


5194 


X54200 


Chicken mRNA for GARS-AIRS-GART 


0.019 


5195 


AE005885 


Caulobacter crescentus section 211 of 359 of the complete genome 


0.62 


5iyo 


AF228498 


Escherichia coli AgaR (agaR), KbaZ (kbaZ) AeaV (aeaV> AraW 
(agaW), AgaE (agaE), AgaF (agaF), AgaA (agaA), AgaS (agaS), 
KbaY (kbaY), AgaB (agaB), AgaC (agaC), AgaD (agaD), and Agal 
(agal) genes, complete cds 


0.63 


5197 


U09859 


Bos tauras enterokinase mRNA, complete cds 


6.2 | 




Ar 057708 


Populus balsamifera subsp. trichocarpa PTD protein (PTD) gene 
complete cds 


0.68 


5199 


AF238381 


Homo sapiens PTOV1 (PTOV1) gene, complete cds 


0.18 


5200 


AF1 19554 


Plasmodium falciparum para-aminobenzoic acid synthetase gene, 
complete cds 


0.13 




Xo2675 


B.taurus BoLA-All gene (exon 6-8) 


0.71 


5202 


AF303087 


Tilapia mossambica glycoprotein alpha subunit mRNA, partial cds 


0.64 




JNM U0oo49 


Homo sapiens serologically defined colon cancer antigen 16 
(SDCCAG16),mRNA 


1.7 


5204 


AF240597 


Homo sapiens clone 7ptelcllt7 sequence 


e-112 


MAC 


OC006702 


Mus musculus, RJKEN cDNA 4930529008 gene, clone MGC:7935 
IMAGE:3583936, mRNA, complete cds 


6.3 


5206 


AF295769 


Mus musculus SMAD1 gene, intron 1, partial sequence 


0.007 


5207 


AF1 18099 


Toxoplasma gondii PITSLRE-like protein kinase (tpkl) gene, partial 
cds 


1.9 


5208 


AF143952 ] 


Homo sapiens PELOTA (PELOTA) gene, complete cds 


0.44 


5209 


: 

AF028710 i 


[nfluenza A vims (A/HongKong/156/97(H5Nl)) nucleoprotein 
mRNA, complete cds 


1.7 


5210 


XM 017097 ] 


Homo sapiens active BCR-related gene (ABR), mRNA 


0.7 
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Table 3A Nearest Neighbor (BlastN vs. Genbank) 


SEQID 
NO 


ACCESSN 


DESCRIP 


P VALUE 


5211 


M33312 


Rat hepatic steroid hydroxylase IIA1 (CYP2A1) gene, complete cds 


0.54 


5212 


L05616 


Dictyostelium purpureum (Dpp4) DNA sequence, repeat region 


0.061 


5213 


AE001425 


Plasmodium falciparum chromosome 2, section 62 of 73 of the 
complete sequence 


0.001 


5214 


XM 006574 


Homo sapiens similar to plakophilin 2 (H. sapiens) (LOC93271) 
mRNA 


0.009 


5215 


XM_011195 


Homo sapiens centromere protein E (3 12kD) (CENPE), mRNA 


0.002 


5216 


AF181881 


Helicobacter pylori TolB (tolB) and PAL (excC) genes, complete cds 


0.65 


5217 


U33208 


Human B-lymphocyte activation antigen (B7. 1) gene, enhancer region 
and 5'UTR 


0.0003 


5218 


AB052187 


Macaca fascicularis brain cDNA, clone:QnpA-12170 


0.7 


5219 


AY032741 


Drosophila melanogaster H.M.S. Beagle transposon long terminal 
repeat, complete sequence; and heat shock protein Hsp70Ab gene, 
promoter and partial cds 


0.63 


5220 


AJ276592 


Pisum sativum lipl mutant copl gene for constitutively 
photomorphogenic 1 protein, exons 1-19 


5.7 


5221 


XM_045136 


Homo sapiens similar to cadherin related 23 (H. sapiens) 
(LOC92444), mRNA 


2E-52 


5222 


AK014534 


Mus musculus 0 day neonate skin cDNA, RIKEN full-length enriched 
library, clone:4631424J17, full insert sequence 


2E-55 


5223 


U67572 


Methanococcus jannaschii section 114 of 150 of the complete genome 


0.65 


5224 


AK026607 


Homo sapiens cDNA FLJ22954 fis, clone KAT09813, highly similar 
to AF0103 15 Homo sapiens Pigl 1 (PIG1 1) mRNA 


0.21 


5225 


Y08580 


F.rubripes hsp70-3 gene, 3*1701 


1.8 


5226 


AL359819 


Human DNA sequence from clone RP1 1-109B17 on chromosome 1, 
complete sequence [Homo sapiens] 


0.76 


5227 


AF1 18099 


Toxoplasma gondii PITSLRE-like protein kinase (tpkl) gene, partial 
cds 


1.9 


5228 


U48937 


Homo sapiens amiloride-sensitive epithelial sodium channel gamma 
subunit gene, promoter region and exon 1 


5.1 


5229 


AKO 17057 


Mus musculus adult male testis cDNA, RIKEN full-length enriched 
library, clone:4933434I20, full insert sequence 


1.8 


5230 


AF044770 


Phaeoprogne tapera microsatellite HrU6 allele 2 repeat region 


4.6 


5231 


XM 012268 


Homo sapiens amino acid transporter 2 (KIAA1382), mRNA 


0.009 


5232 


BC008544 


Mus musculus, Similar to hypothetical protein, clone MGC:7703 
IMAGE:3497634, mRNA, complete cds 


7E-22 


5233 


AB046353 


Sapporo-like virus Yak2-2000-jp pol gene for RNA-dependent RNA 
polymerase, partial cds 


6.2 


5234 


L07305 


Histoplasma capsulatum (clone pMS3) H-ATPase gene, complete cds 


0.65 


5235 


NM 012828 


Rattus norvegicus Calcium channel subunit beta 3 (Cacnb3), mRNA 


2 


5236 


AF277959 


Helicobacter pylori strain 96-74 BabB (babB) gene, partial cds 


0.71 
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Table 3A Nearest Neighbor (BlastN vs. Genbank) 


SEQID 

\Trv 
ViiJ 


ACCESSN 


DESCREP 


P VALUE 


5237 


Z16681 


H. sapiens (D8S266) DNA segment containing (CA) repeat; clone 
AFM151ye3; single read 


0.003 


5238 


AB049452 


Staphylococcus epidermidis composite transposon DNA, complete 
sequence 


0.077 


5239 


X83673 


X.laevis SSB1 gene 


1.5 


5240 


XM 045182 


Homo sapiens serine (or cysteine) proteinase inhibitor, clade F (alpha- 
2 antiplasmin, pigment epithelium derived factor), member 2 
(SERPINF2), mRNA 


6.2 


5241 


AF263074 


Cobitis paludica cytochrome b gene, partial cds; mitochondrial gene 
for mitochondrial product 


0.15 


5242 


XMJ>29785 


Homo sapiens Kell blood group (KEL), mRNA 


2.1 


5243 


AE006148 


Pasteurella multocida PM70 section 115 of 204 of the complete 
genome 


5.5 


5244 


X81975 


B. vulgaris mRNA for sucrose 6-phosphate synthase 


1.3 


5245 


NM_033105 


Homo sapiens beta cysteine string protein (LOC85479), mRNA 


0.000006 


5246 


AC006164 


Homo sapiens clone UWGC:y28gap ftom 6p21, complete sequence 


0.47 


5247 


AJ279971 


Cams familiaris microsatellite DNA, clone DTRcn.20 


0.022 


5248 


AE005618 


Escherichia coli 0157:H7 EDL933 genome, contig 3 of 3, section 237 
of 290 


0.56 


5249 


AJ012338 


Listeria monocytogenes NotI restriction site number two and flanking 
sequences 


1.6 


5251 


D 14027 


Mouse DNA for TL antigen, complete cds 


5 


5252 


Y12285 


H.armigera mRNA for putative serine protease, clone SRI 06 


0.009 


5253 


AF281912 


Homo sapiens clone 16qtel_c89bt7 sequence 


1E-68 


5254 


NC_001790 


Ovine papillomavirus 2, complete genome 


0.61 


5255 ' 


XM 031110 


Homo sapiens kallikrein 10 (KLK10), mRNA 


0.041 


5256 


XMJ)42732 


Homo sapiens guanine nucleotide binding protein (G protein), alpha 
inhibiting activity polypeptide 3 (GNAI3), mRNA 


0.42 


5257 


XM 033541 


Homo sapiens KIAA0704 protein (ORP3), mRNA 


0.022 


5258 


AL512432 


Human DNA sequence from clone RP11-245H22 on chromosome 6 
Contains GSSs, complete sequence [Homo sapiens] 


0.0001 


5259 


S82508 


Human adenovirus type 9 E4 protein (E4), Or£2, Orf3, Orf4, and OrtfS 
genes, complete cds; and Orf7 gene, partial cds 


1.6 


5260 


XM 015579 


Homo sapiens hypothetical protein FLJ13105 (FLJ13105), mRNA 


12 


1 

5261 


AF009177 


Helicobacter pylori pfs protein homolog (pfs) and sigma 80 (rpoD) 
genes, complete cds 


0.6 


5262 


AB015470 


Arabidopsis thaliana genomic DNA, chromosome 5, TAC 
clone:KlB16 


5.2 


5264 


L08970 


Nematode choline acetyltransferase (cha-1) gene, exons 1-11 and 
complete cds 


0.29 


5265 


D88193 | 


Brassica rapa DNA for S-receptor kinase, complete cds 1 


0.69 


5266 


XM 046294 


Homo sapiens testis expressed sequence 15 (TEX15), mRNA 


0.21 


5267 


XMJ)27078 


Homo sapiens ATPase, Class V, type 10D (ATP10D), mRNA 


0.22 
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SEQID 
NO 


ACCESSN 


DESCRIP 


P VALUE 


5268 


AK019309 


Mus musculus adult male hippocampus cDNA, RIKEN full-length 
enriched library, clone:2900009I19, full insert sequence 


2E-13 


5269 


NM 016985 


Mus musculus myotubularin related protein 1 (Mtmrl), mRNA 


0.0004 


5270 


AK020049 


Mus musculus 13 days embiyo male testis cDNA, RIKEN full-length 
enriched library, clone:6030408K18, full insert sequence 


1.8 


5271 


AF323989 


Danio rerio transcriptional intermediary factor 2 mRNA, complete cds 


6.3 


5272 


AF175275 


Pisum sativum trans-cinnamic acid hydrolase (CYP73 A9) gene, 
CYP73A9-vl allele, partial cds 


0.073 


5274 


AF327749 


Mus musculus smoothelin gene, promoter and partial sequence 


6E-11 


5275 


AJ276633 


Glomus mosseae partial TOR2 gene, exons 1-9 


6 


- 

5276 


AF110612 


Boophilus microplus cytochrome b apoenzyme (Cytb) gene, partial 
cds; tRNA-Ser, tRNA-Leu, tRNA-Cys, and tRNA-Met genes, 
complete sequence; and NADH dehydrogenase subunit 2 (ND2) gene, 
partial cds, mitochondrial genes for mitochondrial products 


0.072 


5278 


XM 043395 


Homo sapiens fatty acid desaturase 2 (FADS2), mRNA 


3E-36 


5279 


XM.046528 


Homo sapiens zinc finger protein 133 (clone pHZ-13) (ZNF133), 
mRNA 


2 


5280 


U20807 


Bos taurus protein tyrosine phosphatase BA14 mRNA, complete cds 


0.19 


5281 


XM 036368 


Homo sapiens hypothetical gene supported by AK026802 
(LOC91 132), mRNA 


0.026 


5282 


XM 007023 


Homo sapiens G protein-coupled receptor kinase-interactor 2 (GIT2), 
mRNA 


7E-44 


5283 


U86362 


Saccharomyces cerevisiae MAL gene divergent promoter region, and 
hypothetical protein, complete cds 


0.24 


5284 


Y11740 


H. sapiens whn gene, exon la and lb 


0.22 


5285 


S78915 


{Mul element insertion site, clone 11} [maize, Transposon, 300 nt] 


0.002 


5286 


AF3 10896 


Dictyostelium discoideum RacJ (racJ) gene, complete cds; LagG-like 
protein 0agC3) gene, partial cds; and unknown genes 


0.23 


5287 


AJ012333 


Buchnera aphidicola plasmid pBTc2, trpE and trpG genes 


0.027 


5288 


AF282046 


Homo sapiens clone 15qtel_c321bt7 sequence 


4E-30 


5289 


L25128 


Lycopersicon esculentum auxin-induced proteinase inhibitor (ARPI) 
gene, complete cds 


0.51 


5290 


XMJ)49605 


Homo sapiens KIAA1467 protein (KIAA1467), mRNA 


2E-46 


5291 


AE002114 


Ureaplasma urealyticum section 15 of 59 of the complete genome 


0.054 


5292 


AF208678 


Aedes aegypti clone p502, Pony-Aa-B6 METE repeat region 


3.9 


5293 


AK024656 


Homo sapiens cDNA: FLJ21003 lis, clone CAE03685 


5E-55 


5294 


NC 002355 


Bombyx mori mitochondrion, complete genome 


2.1 


5295 


AK023898 


Homo sapiens cDNA FLJ13836 lis, clone THYRO1000734 


0.008 


5296 


AE002342 


Chlamydia muridarum, section 70 of 85 of the complete genome 


1.9 
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Table 3A Nearest Neighbor (BlastN vs Onhanlr) 


SEQIE 
NO 


> 

ACCESSN 


DESCRIP 


P VALUE 


DZy / 


AM 032347 


Homo sapiens region containing TLS-associated serine-arginine 
protein 1; TLS-associated serine-arginine protein 2; TLS-associated 
serine-arginine protein 2; TLS-associated serine-arginine protein 2; 
TLS-associated serine-arginine protein 1; TLS-associa> 


o 


5298 


Z63116 


Hsapiens CpG island DNA genomic Msel ftagment, clone 78e4, 
reverse read cpg78e4.rtla 


0.24 


5299 


AE006833 


Sulfolobus solfetaricus section 192 of 272 of the complete genome 


6.4 


5300 


XM 032811 


Homo sapiens similar to ZINC FINGER PROTEIN 20 (ZINC 
FINGER PROTEIN KOX13) (DKFZP572P0920) (H. sapiens) 
(LOC90591), mRNA 


0.21 


5301 


AF202552 


Homo sapiens DNA methyltransferase (DNMT1) gene, exons 2, 3, 
and 4 


0.23 


5302 


U92963 


Pygathrix roxellana NADH dehydrogenase suhnnit ^ rwrm <yn^n 
partial cds, tRNA-Are gene, comnlete seauence NADH 
dehydrogenase subunit 4L (ND4L) and NADH dehydrogenase subunit 
4 (ND4) genes, complete cds, and tRNA-His, tRNA-Ser and tRNA- 
Leu genes, comp> 


0.17 


5303 


Z99833 


Euglena deses chloroplast psbC gene: complete group HI twintron, 
complete internal matl gene, partial 5' and 3' psbC exons 


4.7 


5304 


AB047962 


Macaca fascicularis brain cDNA, clone:QnpA-13041 


e-125 


5305 


AF159913 


Euplotes crassus transposon Tecl clone Tecl-2 orf 2 and orf 3 
pseudogenes, complete sequence 


0.5 


5306 


Y15435 


Kluyveromyces lactis PDC1 gene, promoter region | 


0.087 


5307 


U67506 


Methanococcus jannaschii section 48 of 150 of the complete genome 


1.4 


530© 


U73943 


Bacillus subtilis phosphofructokinase I (pfkl) gene, partial cds; and 
pyruvate kinase I gene, complete cds 


2.2 


5309 


U72027 


Bos taurus photoreceptor disk rim specific protein rom-1 fROMl) 

m. 4L Mr Mr *w * ■ ■ * -A. ^AW/ITU 1 

mRNA, partial cds 


0.06 


5310 


X16509 


Rice alpha-amylase gene 


2.1 


5312 


AB049900 


Macaca fascicularis brain cDNA, clone:QnpA-19713 


0.01 


5313 


Z24756 


S.pombe rhp51 and rpal genes, complete CDS's 


0.003 


C2 1 /I 

5314 


XMJJ02989 


Homo sapiens similar to hypothetical protein FU10546 (H. sapiens) 
(LOC93 548), mRNA 


5E-62 


5316 


AL358272 


S.pombe chromosome I cosmid c458 


0.61 


5317 


XM 044123 


Homo sapiens cadhenn 20, type 2 (CDH20), mRNA 


4E-48 


5318 


AF038866 


Bacteroides fragilis transposon Tn5520 transposase (bipH) and 
mobilization protein BmpH (bmpH) genes, complete cds 


0.73 


5319 


AF332577 


Homo sapiens prosomal P27K protein (PSMA6) gene, partial cds 


2E-51 


5320 


X68650 


O.cuniculus mRNA for ryanodine receptor 


0.55 [ 


5321 


XM 038450 ] 


Homo sapiens hypothetical protein FLJ20694 (FLJ20694), mRNA 


0 
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SEQID 
NO 


ACCESSN 


DESORIP 


P VALUE 


5322 


Z65960 


Rsapiens CpG island DNA genomic Msel fragment, clone 69d2, 
reverse read cpg69d2.rtlb 


1.7 


5323 


AF3 17661 


Candida albicans 60S acidic ribosomal protein type P2-A (p2A) gene, 
complete cds 


5.8 


5324 


AE001776 


Thermotoga maritima section 88 of 136 of the complete genome 


0.64 


5325 


NM_031987 


Rattus norvegicus carnitine octanoyltransferase (COT), mRNA 


0.62 


5326 


AB035500 


Rana rugosa gene for FTZ-F1, exon 1 


0.009 


5327 


U67490 


Methanococcus jannaschii section 32 of 150 of the complete genome 


0.2 


5328 


AB060000 


Glycine max mRNA for hypothetical protein, complete cds, 
clone:SSC2 


0.22 


5329 


U50840 


Borrelia garinii 70 kbp plasmid D6 protein gene, complete cds 


0.072 


533a 


U67535 


Methanococcus jannaschii section 77 of 150 of the complete genome 


2.1 


5331 


AK001125 


Homo sapiens cDNA FLJ10263 fis, clone HEMBB 1000991 


3E-43 


5333 


AF029114 


Ascogaster sp. 16S ribosomal RNA gene, mitochondrial gene for 
mitochondrial RNA, partial sequence 


0.068 


5334 


AF320594 


Homo sapiens PKD1P4 pseudogene, exons 2 through 15 


6.3 


5335 


AY021494 


Oryza sativa microsateUite MRG3819 containing (TA)X27, genomic 
sequence 


0.024 


5336 


AF146651 


Homo sapiens glyoxalase-I gene, complete cds 


0.056 


5337 


AB054516 


Globicephala melas DNA, SINE flanking sequence Mago22 locus 


0.024 


5338 


AC001052 


Homo sapiens (subclone l_f6 from PI H56) DNA sequence, complete 
sequence 


0.003 


5339 


U00951 


Human clone A9A2BR1 1 (CAC)n/(GTG)n repeat-containing mRNA 


0.65 


5340 


AF298180 


Caenorhabditis elegans tropomyosin isoform IV mRNA, complete cds 


0.64 


5341 


XM 042336 


Homo sapiens hypothetical gene supported by AK025398 
(LOC92036), mRNA 


2.1 


5342 


L44121 


Homo sapiens (clone pHK2.1D) CMT1A gene, repeat unit 


0.0003 


5343 


AE005556 


Escherichia coli 0157:H7 EDL933 genome, contig 3 of 3, section 175 
of 290 


0.23 


5344 


AB035185 


Homo sapiens RHD gene, intron 9, complete sequence 


0.63 


5345 


AC013455 


Homo sapiens BAC clone CTD-2347014 from 7pl2-pl4, complete 
sequence 


0.23 


5346 


AJ234722 


Hordeum vulgare genomic DNA fragment; clone MWG203 l.rev 


0.24 


5347 


X79699 


Rsapiens ALU repeat, 230bp 


0.0003 


5348 


XM 050246 


Homo sapiens secretory protein SEC8; KIAA1699 protein (SEC8), 
mRNA 


0 


5349 


AB063075 


Macaca fascicularis brain cDNA clone:QtrA-l 1888, full insert 
sequence 


2E-19 


5350 


Z56909 | 


H. sapiens CpG island DNA genomic Msel fragment, clone 152h9, 
reverse read cpgl52h9.rtla 


0.69 


5351 


L44121 


Homo sapiens (clone pHK2.1D) CMT1A gene, repeat unit 


0.069 
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jSEQn 
NO 

5352 


>J 

ACCESSN 
AE002722 


DESCRIP 

xyiubupiuia meianogaster genomic sca£fold 142000013385665 
complete sequence 


P VALUE 
; 1.5 


5353 

1 CO ^ A 

J 5354 


AF201021 
f Y17506 


Stachyarrhena sp. Jansen-Jacobs 4707 ribosomal protein S16 (rpsl6) 
- S ene > paniai intron sequence, chloroplast gene for chloroplast product 
Euplotes octocannatus tRNA-Cys gene 


0.072 ! 


5356 


XM_040513 


Homo sapiens hypothetical protein FLJ14454 (FU14454) mRNA 


0.024 

! 0.7 1 


[ 5357 


j AF287736 


iviua museums i^KHAM/caisemiin/KChIP3 gene, complete cds and 
exons 3, 4 and 5, alternatively spliced 


2 ( 


1 5358 


AF230807 


Danio reno PHP-less RdgB-like protein (plrdgB) mRNA, complete 
cds r 


6.1 


5359" 


1 U67563 


Methanococxais jannaschii section 105 of 150 of the complete genome 


0.24 j 


5360 


AE000935 


ivicuidnoDactenum inennoautotropmcum from bases 1655364 to 
1666496 (section 141 of 148) of the complete genome 


0.21 j 


5361 
5362 


NM_031819 
XM_005889 


xvduub uorvegicus rAi tumor suppressor (Drosophila) homolog (Fat) 
mRNA 


5.2 

3.6 I 


j 5363 
| 5364 


AF339829 
NM 007715 


Homo sapiens clone IMAGE:609847, mRNA sequence 

iviiu> uiubcmus circacuan locomoter output cycles kaput (Clock), 

mRNA 


0.022 
e-116 


[ 5365 | 


XM 016117 


nomo sapiens nypornetical protem DKFZp586Fl 122 similar to 
axotrophin (DKFZP586F1 122), mRNA 


0 


| 5366 j 


XM 005554 


Homo sapiens regulator of differentiation (in S. pombe) 1 (ROD1) 
mRNA 


6.3 


5368 


X76311 


Rsapiens endothelial nitric oxidase synthase gene, exons 15 and 16 


0.7 j 


5369 


XM 001738 


Homo sapiens proteoglycan 4, (megakaryocyte stimulating factor, 


0.63 1 


5370 
1 5371 1 


AE000935 
Y18011 


Methanobacterium thermoautotrophicum from bases 1655364 to 
1WOTyo ybeciion i^a oi i4»j oi the complete genome 
Phaedon cochleariae mRNA for chitinase 


0.23 
0.021 


5372 
5373 


Z99768 ] 

] 
j 

AF342852 ! 


Flaveria trinervia gdcsP pseudogene 

Fusobactenum ulcerans strain NCTC 12112 16S ribosomal RNA 
gene, partial sequence; internal transcribed spacer 1, complete 

seouence* anri nhncAmoi dm a „ a „- „ 

° noosomai ivJNA gene, partial sequence 


4.5 J 
0.18 


5374 


] 

BC003397 ] 


ftomo sapiens, hypothetical protein FLJ20505, clone MGC4960 
CMAGE:3448518, mRNA, complete cds 


1F-69 1 


5375 
5376 


] 

AF385088 i 
NC_000863 ] 


lomo sapiens NOD2 (NOD2) gene, NOD2-2722G>C allele, exon 8 
ind partial cds 


0.58 


5377 


1 

AF121217 c 


>apio cynocephalus provirus, complete genome 

*attus norvegicus pro-alpha-2(I) collagen (colla2) mRNA, complete 
;ds 


0.22 
5.5 | 


5378 | 


I 

BC009075 I 


tfus musculus, beta-l,3-N-ac^tylglucosaminyltransferase 1, clone 
rfGC:6892 IMAGE:2654354, mRNA, complete cds 


1.8 { 
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SEQIE 
NO 


ACCESSN 


DESCRIP 


P VALUE 


5379 


XM 001341 


Homo sapiens DKFZP547E1010 protein (DKFZP547E1010), mRNA 


3E-15 


5380 


XM 046408 


Homo sapiens KIAA1041 protein (KIAA1041), mRNA 


0.52 


5381 


AK020701 


Mus musculus 6 days neonate skin cDNA, RIKEN full-length 
enriched library, clone: A030009B 12, full insert sequence 


0.000001 


5383 


U32711 


Haemophilus influenzae Rd section 26 of 163 of the complete genome 


0.021 


5384 


XM 046598 


Homo sapiens DKFZP434K156 protein (DKFZP434K156), mRNA 


0.58 


5385 


L43551 


Buchnera aphidicola anthranilate synthase large subunit (trpE) gene 
and anthranilate synthase small subunit (trpG) gene, complete cds 


0.65 


5386 


XM 012233 


Homo sapiens MDS023 protein (MDS023), mRNA 


0.6 


5387 


AF139358 


Homo sapiens BUB1 protein gene, exon 17 


0.008 


5388 


AK020916 


Mus musculus adult retina cDNA, RIKEN full-length enriched 
library, clone: A930031L14, full insert sequence 


1.5 


5389 


AL390170 


Homo sapiens mRNA; cDNA DKFZp547E184 (from clone 
DKFZp547E184) 


1.8 


5390 


AK021659 


Homo sapiens cDNA FLJ1 1597 fis, clone HEMBA1003856 


0.21 


5391 


AK024200 


Homo sapiens cDNA FLJ14138 fis, clone MAMMA1002765 


0.00003 


5392 


AJO 11727 


Homo sapiens partial TOP2 beta gene, sequence G 


0.00003 


5393 


AFO 18079 


Homo sapiens leukemia inhibitory factor receptor (LIFG) gene, 
alternative promoter which is functional in non-placental tissues 


6.2 


5394 


AB063085 


Macaca fascicularis brain cDNA clone:QtrA-14876, full insert 
sequence 


0.026 


5395 


AF154673 


Homo sapiens olfactory receptor HPFHIOR (HPFHIOR) gene, 
complete cds 


0.23 


5396 


AE007505 


Streptococcus pneumoniae section 188 of 194 of the complete genome 


6.6 


5397 


Z75199 


S.cerevisiae chromosome XV reading frame ORF YOR291 w 


0.081 


5398 


AB042120 


Bombyx mori Pao-like retrotransposon Kamikaze DNA, similar to 
polyprotein coding region 


0.24 


5399 


AF379854 


Cloning vector pVLH/hsp, complete sequence ! 


0.0003 


5400 


XM 009648 


Homo sapiens HSPC072 protein (HSPC072), mRNA 


0.022 


5401 


AB056333 


Macaca fascicularis brain cDNA, clone:QflA-13486 


9E-14 


5402 


AK021824 


Homo sapiens cDNA FIJI 1762 fis, clone HEMBA100567O 


3E-46 


5403 


AL5 13010 


Human DNA sequence from clone RP1 1-30M17 on chromosome 6, 
complete sequence [Homo sapiens] 


0.026 


5404 


AK023753 


Homo sapiens cDNA FLJ13691 fis, clone PLACE2000100 


0.077 


5406 


AKO 14452 ( 


Mus musculus 18 days pregnant adult female placenta and extra 
embryonic tissue cDNA, RIKEN full-length enriched library, 
clone:3830422K02, full insert sequence 


0.17 


5407 


XM 003220 : 


Homo sapiens polybromo 1 (PB1), mRNA 


1E-35 


5408 


Z78918 


HLsapiens flow-sorted chromosome 6 Hindin fragment, SC6pAl 9C6 


1.5 



INSDOCID: <WO 0214500A2J_> 



WO 02/14500 



PCT7US01/25840 
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SEQIE 
NO 


> 

ACCESSN 




P VALUE 


5409 


L13289 


Mycoplasma pirum purine phosphorylase gene, 3* end, 
aeoxynooaiaoiase, tnymiaine phosphorylase, cytidine deaminase, 
phosphomannomutase genes, complete cds, and triose phosphate 

laUIllCldaC gene, D end 


0.06 


5410 


I AC024203 


Caenorhabditis elegans cosmid Y72A2A, complete sequence 


6.3 


5411 


) AF373586 


nomo sapiens translocation breakpoint MLLT1/MLL fusion gene, 
partial sequence 


0.54 


5412 


Z46774 


A.pjeuropneumoruae troA gene for transiernn-binding protein (1949 
bp) 


0.025 


5413 


AF3 19044 


ivicuidiiosarcina aceuvorans strain C2A MtaF (mtaF) and MtaG 
(mtaG) genes, complete cds 


0.44 


5414 


AE001933 


i^cmococcus racuoaurans ivi section 70 ol 229 of the complete 
chromosome 1 


6.3 


5415 


j M22734 


jTuiiMii pidieiei-aenvea growtn iactor A type receptor mRNA, 
comnlefp ivIq 


0.59 


5416 


AF264650 


Asterella tenella trnL gene, intron sequence; chloroplast gene for 

chlnmnljict r\rnHiir*t 


6.4 


5417 


AF378571 


Sambucus nigra clone VBL85 thaumatin-like protein (TLP1) mRNA, 


1.6 


5418 


U53183 


Kanus norvegicus zdu KUa estrous-specific protein mRNA, partial cds 


0.7 


5419 


AK026673 


nuino sapiens ci^ina. rl-JZJU2u ns, clone .LNG00943 


0.24 


5420 


XM 041367 


Homo sapiens NADH dehydrogenase (ubiquinone) 1 beta subcomplex, 


2E-10 


5421 


AY039939 


Arabidopsis thaliana unknown protein (F11F12.23) mRNA, complete 

Uid 


5.5 


5422 


AF250135 


ivuis museums splicing iactor Sc35 (Pr264) mRNA, partial cds, 

j)1ff*i"n£rHi/«*1v er*1tr»^H 
<xi Lti ll<lt l v Ciy aJL/llUCU 


2.1 


5423 


M32328 


C.pipiens esterase Bl gene, complete cds 


0.003 


5424 


U95369 


Bos taurus inward rectifier potassium channel BIK mRNA, complete 


0.009 


5425 


AE002157 


ureapiasma ureaiyucum section 58 or 59 or the complete genome 


5.7 


5426 


AF270633 


Homo sapiens clone 5qtel c25t3 sequence 


9E-56 


5427 


Z28252 


o.cerevisiae enromosome XI reading name ORF YKR027w 


0.18 


5428 


AE002601 


Drosophila melanogaster genomic scaffold 142000013385253, 
uvuiipicLc sequence 


0.2 


5429 


AJ403990 ] 


ciyut diuurea micFosaieinie j-jjna, clone WrlAl-61 


3.8 


5430 


J 

NMJU9253 { 


Rattus norvegicus protein tyrosine phosohatase non-recentor tvne 5 
[Ptpn5),mRNA 


0.73 


1 1 1 

34.51 [ 


AE006543 1 


Streptococcus pyogenes Ml GAS strain SF370, section 72 of 167 of 
the complete genome 


0.61 


5432 


] 

AF146363 


ftomo sapiens islet cell autoantigen ICA69 (ICA1) gene, exon 1 and 
martial cds 


0.073 


5433 


M96943 


3uman profilaggrin gene exons 1-3, 5' end 


0.066 


5434 1 


AK025246 ] 


iomo sapiens cDNA: FLJ21593 fis, clone COL07050 


0.18 
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SEQK) 
NO 


ACCESSN 


DESCRIP 


P VALUE 


5435 


XM 027835 


Homo sapiens general transcription factor IIIC, polypeptide 4 (90kD) 
(GTF3C4), mRNA 


5.9 


5437 


AJ223441 


Branchiostoma floridae Pa\6 gene (AmphiPax6), clone J2 


0.24 


1 5438 


AB048345 


Equus caballus DNA, chromosome Xp21, microsatellite TKY39 


0.73 


! 5439 


AB049844 


Macaca fascicularis brain cDNA, clone:QnpA-18434 


2.2 


5440 


XM 036740 


Homo sapiens similar to nuclear pore membrane glycoprotein 210 (M. 
musculus) (LOC91181), mRNA 


0.19 


5441 


AK019537 


Mus musculus adult male testis cDNA, RIKEN full-length enriched 
library, clone:4921505C17, full insert sequence 


0.081 


5442 


AE003964 


Xylella iastidiosa 9a5c, section 1 10 of 229 of the complete genome 


6 


5443 


L X95276 


P.falciparum complete gene map of plastid-like DNA (IR-B) 


0.008 


5444 


AF3 10264 


Ovis aries leptin mRNA, partial 3* UTR 


2.2 


5445 


XM 048688 


Homo sapiens hypothetical protein FLJ10420 (FU10420), mRNA 


2.2 


5446 


AC007133 


Arabidopsis thaliana chromosome n section 209 of 255 of the 
complete sequence. Sequence from clones T6A23, F13I13 


2 


5447 


AJ011727 


Homo sapiens partial TOP2 beta gene, sequence G 


0.00003 


5448 


AF2 14665 


Homo sapiens cAMP responsive element modulator (CREM) gene, 
exonH 


0.52 


5449 


AF269319 


Staphylococcus epidermidis strain SRI clone step.l002a02 genomic 
sequence 


1.9 


5450 


AE001393 


Plasmodium falciparum chromosome 2, section 30 of 73 of the 
complete sequence 


0.72 


5451 


XM 002437 


Homo sapiens cAMP-regulated guanine nucleotide exchange factor n 
(CAMP-GEFII), mRNA 


7E-32 


5452 


AF298224 


Homo sapiens RPCI-H 289E13 CD21 gene, partial cds 


0.007 | 


5453 


AF254641 


Homo sapiens oxytocin/vasopressin intergenic region 


0.01 


5454 


XM_010303 


Homo sapiens phosphate cytidylyltransferase 1, choline, beta isoform 
(PCYT1B), mRNA 


2 


5455 


AKO 14447 


Mus musculus 18 days pregnant adult female placenta and extra 
embryonic tissue cDNA, RIKEN full-length enriched library, 
clone:3830421F13, full insert sequence 


0.22 


5456 


M21084 


P.hybrida 5-enolpyruvylshikimate 3-phosphate synthase mRNA, 
complete cds 


0.24 


5457 


XM_006648 


Homo sapiens Microfibril-associated glycoprotein-2 (MAGP2), 
mRNA 


1.4 


5458 


AF175434 


Gallus gallus T cell receptor delta chain (TCRD) pseudogene, partial 
sequence 


0.76 




AE000775 


Aquifex aeolicus section 107 of 109 of the complete genome 


0.0008 


5460 


XM 031261 


Homo sapiens similar to KIAA0377 gene product (H. sapiens) 
(LOC90371), mRNA 


1.6 


5461 


BC008389 


Homo sapiens, Similar to hypothetical protein LOC57821, clone 
IMAGE:4076369, mRNA 


0.24 
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Table 3A Nearest Neighbor (BlastN vs. Genbanlri 


SEQ ID 
NO 


ACCESSN 


DESCRIP 


P VALUE 


5462 


AF327739 


Streptococcus thermophilic Pebl (pebl), GlnQ (glnQ), response 
regulator Rrl (rrl), Hpk2 (hpk2), YycJ-like protein (yycJ), and MurM 
(murM) genes, complete cds 


0.63 


5463 


AY030939 


HTV-1 isolate NC3940-1998 from USA pol polyprotein (pol) gene, 
partial cds 


0.026 


5464 


XM 049237 


Homo sapiens KIAA0841 protein (KIAA0841), mRNA 


0.053 


5465 


AY017147 


Paracheilognathus imbeibis mitochondrial control region, partial 
sequence 


0.25 


5466 


X81451 


Nfrontalis enolase gene, promotor region 


0.009 


5467 


AF271036 


Mus musculus clone rF12m odorant receptor gene, partial cds 


2E-08 


! 5468 


J05637 


Yeast (S.cerevisiae) 70 kDa heat shock protein (SS A4) gene, complete 
cds 


0.22 


5469 


X75420 


P.falciparum (FCR3) cpn60 gene 


0.025 


5470 


AF213017 


Acinetobacter calcoaceticus KHP18 partial pKLH2 plasmid including 
aberrant mercury resistance transposon TnPKLH2, truncated insertion 
sequence IS101LD1, and determinants for CinH resolution system 


1.9 


5471 


AC024806 


Caenorhabditis elegans cosmid Y52E8A, complete sequence 


0.026 


5472 


AF183178 


Drosophila melanogaster filamin isoforms A and B (Filamin) gene, 
exons 1 and 2 


0.043 


5473 


AF209925 


Plasmodium falciparum STARP antigen (STARP) gene, complete cds 


2.1 


5474 


AJ410864 


Ectomycorrhizal isolate ECM 133 internal partial internal transcribed 
spacer 1 (TTS1), 5.8S ribosomal RNA gene, and partial internal 
transcribed spacer 2 (TTS2); isolate ECM 133 


2 


5475 


NC.001889 


Dictyostelium discoideum plasmid Ddp5, complete sequence 


2.1 


5476 


AY009594 


Boophilus microplus clone PNC 189 isolate Kilkivan2 microsatellite 
sequence 


0.19 


5477 


U55723 


Human ataxia-telangiectasia (ATM) gene, exon 25 


0.69 


5478 


AF197229 


Saguinus oedipus MHC class II antigen (Saoe-DRBl) gene, Saoe- 
DRB1*0303 allele, partial sequence 


0.00004 


5479 


AE001290 


Chlamydia trachomatis section 17 of 87 of the complete genome 


0.2 


5480 


XM 016611 


Homo sapiens similar to 8-oxoguanine DNA glycosylase (H. sapiens) 
(LOC93577), mRNA 


0.069 


5481 


AJ224148 


Saccharomyces cerevisiae mitochondrial Tyr-tRNA, Asn-tRNA and 
Met-tRNA genes 


0.026 


5482 


AK022243 


Homo sapiens cDNA FLJ12181 fis, clone MAMMA1000746 


0.17 


5483 


U07354 


Legionella pneumophila Philadelphia- 1 intracellular multiplication 
region (icmV, icmW, icmX, lphB) genes, complete cds 


0.33 


5484 


AE00I366 


Plasmodium falciparum chromosome 2, section 3 of 73 of the 
complete sequence 


0.14 


5485 


AE001385 


Plasmodium falciparum chromosome 2, section 22 of 73 of the 
complete sequence 


1.5 


5486 


L81686 


Homo sapiens (subclone l_a3 from PI H54) DNA sequence, complete 
sequence 


0.62 


5487 


XM 017648 


Homo sapiens leucine-rich repeat-containing 2 (LRRC2), mRNA 


0.0006 
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5496 
5497 
5498 

5499 
5500 



U68246 
AK009952 
AF190794 
AF336796 



1.9 



AB035844 

AF127240 

AY019789 
AF106953 
XM_0Q7172 

NMJ)069 15 
X95961 

XMJ)4142 8 
S50200 



5503 
5504 
5505 



AE00117 8 
Z74797 
X15979 



Dictyostelium discoideum WacA (wacA) gene, complete cds 
f? 8 musciuus adult male tongue cDNA, REKEN full-length enriched 

library, clone:2310057C01, full insert sequence 

Arabidopsis thaliana subtilisin-type serine endopeptidase XSP1 
mRNA, complete cds 

Plasmodium falciparum dynamin-Iike protein (dyn) mRNA, complete 
cds 



Coturnix coturnix japonica DNA, microsateUite GUJ0034 sequence 
Nicotiana tabacum cultivar Xanthi arginine decarboxylase 1 (ADC1) 
gene, complete cds 

Oryza sativa microsatellite MRG21 14 containing (AT)X78, genomic 
sequence 

Homo sapiens SOS1 (SOS1) gene, partial cds 
Homo sapiens mab-21 (C. elegans)-like 1 (MAB21L1), mRNA 
Homo sapiens retinitis pigmentosa 2 (X-linked recessive) (RP2), 

mRNA 

A. thaliana CER3-like gene 

Homo sapiens inositol 1,4,5-triphosphate receptor, type 1 (ITPR1) 
mRNA 

dopamine beta-hydroxylase [mice, Genomic/roRNA, 2274 ntl 



5506 AF221538 



5507 AF083226 



5508 I AF262729 



5509 AF263057 



5510 | AF061244 



5511 



X64612 

AF269472 
AF297021 



Borrelia burg dorferi (section 64 of 70) of the complete genore 
S.cerevisiae chromosome XV reading frame ORF YOL 055c 
Plasmodium falciparum alpha-tubulin I gene 

Cryptosporidium felis heat shock protein 70 (HSP70) gene, partial cds 



Caenorhabditis elegans nuclear receptor NHR-8 (nhr-8) mRNA, 
complete cds 

Ornithorhynchus anatinus olfac tory receptor pla32 gen e, partial cds 

PaU^A nMA l.tt - A - — 



0.003 



0.08 



1.9 



Cobitis arachthosensis ATP synthase 8 (ATP 8) and ATP synthase 6 
(ATP 6) genes, complete cds; mitochondrial genes for mitochondrial 
products 

Agrocybe aegerita B type DNA polymerase (Mtpol) gene, complete 
cds; tRNA-Asn gene, complete sequence; and unknown genes 
mitochondrial genes for mitochondrial products 



Hsapiens dystrophin-gene (introns 13/42) 

Staphylococcus epidermidis strain SRI clone step. 1005al2 genomic 
sequence 

Homo sapiens GE36 gene, exon 8 



9.6 



0.003 
0.43 
0.014 

0,003 
0.071 



0.026 

2.1 
0.0008 

1.8 



0.003 
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Table 3A Nearest Neighbor (BlastN vs. Genbank) 


SEQID 


ACCESSN 


DESCRIP 


P VALUE 


5517 


AF361817 


Arabidopsis thaliana Atlg06730/F4H5__22 gene, complete cds 


0.69 


trio 


Z48243 


Athaliana PARP mRNA for PARP protein 


0.69 


5519 


AL390169 


Homo sapiens mRNA; cDNA DKFZp547D064 (from clone 
DKFZp547D064) 


e-180 


5520 


NM 032141 


Homo sapiens hypothetical protein DKFZp434K1421 
(DKFZP434K1421), mRNA 


e-169 


5521 


XM 037214 


Homo sapiens laminin, alpha 5 (LAMA5), mRNA 


0.62 


5522 


AL1 10675 


Botrytis cinerea strain T4 cDNA library under conditions of nitrogen 
deprivation 


0.008 


5S23 


AF124216 


Dasyurus vivemnus/maculatus mixed DNA library microsatellite 
Q4.4.10 sequence 


1.9 


5524 


AL583762 


Human chromosome 14 DNA sequence Partial sequence from BAC R- 
1109N18_PCR1 of library RPCI-11 from chromosome 14 of Homo 
sapiens (Human), complete sequence 


6E-42 


5525 


AC024758 


Caenorhabditis elegans cosmid Y37E11AM, complete sequence 


1.4 


5526 


AF260528 


Homo sapiens MER receptor tyrosine kinase (MERTK) gene, exons 
17 and 18 


3E-35 


5527 


AF246997 


Plasmodium falciparum malaria exported protein-1 (EXP-1) gene, 
complete cds 


0.019 


5528 


XM 011810 


Homo sapiens sigma receptor (SR3 1747 binding protein 1) (SR-BP1), 
mRNA 


3.9 


5529 


XM 011100 


Homo sapiens complement component 7 (C7), mRNA 


5.9 


5530 


X64315 


S.scrofa mRNA for Na/D-glucose cotransporter regulatoiy-subunit 


0.0007 


5531 


XM 048945 


Homo sapiens bromodomain adjacent to zinc finger domain, 2A 
(BAZ2A), mRNA 


3.4 


5532 


XM 046506 


Homo sapiens DKFZP566I1024 protein (DKFZP566I1024), mRNA 


1E-10 


5533 


XM 002977 


Homo sapiens ring finger protein 13 (RNF13), mRNA 


2.8 


5534 


AJ300359 


Homo sapiens partial RYR2 gene for ryanodine receptor 2, exon 20 


1E-36 


5535 


BC008590 


Homo sapiens, hypothetical protein FLJ21313, clone MGC:16820 
IMAGE:4 148772, mRNA, complete cds 


e-154 


5536 


XM 050246 


Homo sapiens secretoiy protein SEC8; KIAA1699 protein (SEC8), 
mRNA 


0 ! 


5537 


AL442094 


Homo sapiens mRNA; cDNA DKFZp547E024 (from clone 
DKFZp547E024) 


2 


5538 


AF225975 


Tribolium castaneum Tc -tailless gene, upstream regulatory region and 
partial cds; tRNA-Tyr gene, complete sequence; and unknown gene 


0.24 


5539 


AJ304934 


Sidalcea malviflora x Sidalcea oregana 5.8S rRNA gene, internal 
transcribed spacer 1 (ITS1) and internal transcribed spacer 2 (ITS2) 


2.1 


5540 


AE001528 


Helicobacter pylori, strain J99 section 89 of 132 of the complete 
genome 


0.009 
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Table 3A Nearest Neighbor (BlastN vs. Genbank) 


SEQID 
NO 


ACCESSN 


DESCRIP 


P VALUE 


5541 


XM 046698 


Homo sapiens putative DNA/chromatin binding motif (PLU-1), 
mRNA 


2.5 


5542 


AF101309 


Caenorhabditis elegans cosmid H24G06, complete sequence 


2.1 


5543 


AL023839 


Caenorhabditis elegans cosmid Y39A1C, complete sequence 


1.4 


5544 


XM 017673 


Homo sapiens hypothetical protein FLJ21934 (FLJ21934), mRNA 


0 


5545 


U27317 


Homo sapiens 11 beta-hydroxysteroid dehydrogenase 2 (HSD11B2) 
gene, complete cds 


2E-92 


5546 


XM 045095 


Homo sapiens KIAA1320 protein (KIAA1320), mRNA 


0.6 


5547 


AF143066 


Lepidopilum scabrisetum ribosomal protein system 4 (rps4) gene, 
partial cds 


0.25 


5548 


BC010880 


Homo sapiens, Similar to dihydropyrimidinase-like 2, clone 
MGC:1757 IMAGE:3542012, mRNA, complete cds 


5.2 


5549 


L25128 


Lycopersicon esculentum auxin-induced proteinase inhibitor (ARPI) 
gene, complete cds 


0.55 


5550 


NM 030665 


Homo sapiens retinoic acid induced 1 (RAI1), mRNA 


9E-57 


5552 


AF127699 


Erythroxylum argentinum chloroplast atpB-rbcL spacer 


5.5 | 


5553 


U97192 


Caenorhabditis elegans cosmid C01F4, complete sequence 


2.7 


5554 


AF145302 


Arabidopsis thaliana 14-3-3 protein GF14 epsilon (GRF10) gene, 
complete cds 


8.9 


5555 


AF169241 


Citrus sinensis capsanthin/capsorubin synthase (CCS) gene, complete 
cds 


0.21 


5556 


AF153268 


Tetrahymena thermophila dynein heavy chain (DYH6) gene, partial 
cds 


1E-15 


5557 


L13966 


Mouse delta/YYl/NF-E 1/UCRBP transcription factor, exon 3 


0.65 


5558 


AF172730 


Streptococcus gordonii Sagp-like protein gene, complete cds; and Oct- 
like protein gene, partial cds 


1.9 


5559 


AL591422 


Human DNA sequence from clone RP1 1-124P3 on chromosome X, 
complete sequence [Homo sapiens] 


1.8 


5560 


AF328433 


Homo sapiens clone 2qtel c275 24t3 sequence 


4.8 


5561 


AF052640 


Podarcis pityusensis cytochrome b (cytb) gene, mitochondrial gene 
encoding mitochondrial protein, partial cds 


2.3 


5562 


AF133000 


Cicindela marutha 16S large subunit ribosomal RNA, mitochondrial 
gene for mitochondrial product, complete sequence 


0.024 


5563 


AJ239057 


Antirrhinum majus far gene for farinelli protein 


5.1 


5564 


AL032651 


Caenorhabditis elegans cosmid Y6D1A, complete sequence 


2 i 


5565 


AJ278974 


Homo sapiens partial SNAP-23 gene for synaptosome associated 
protein-23. exons 6-8 


1.8 


5566 


M33379 


Guinea pig lipoprotein lipase (gplpl) gene, exon 3-7 


0.46 


DjO 1 


AF303443 


Rheum reticulatum tRNA-Leu (trnL) gene and tmL-trnF intergenic 
spacer, partial sequence; chloroplast gene for chloroplast product 


0.018 


5568 


L81826 


Homo sapiens (subclone l^glO from PI H39) DNA sequence, 
complete sequence 


4E-16 


5569 


NM 005958 


Homo sapiens melatonin receptor 1 A (MTNR1A), mRNA 


2 


5570 


t AJ133356 


Homo sapiens MSMB gene (partial), exons 2-3 


0.66 
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Table 3A Nearest Neighbor (BlastN vs. Geribank) 


SEQID 
NO 


ACCESSN 


DESCRIP 


P VALUE 


5571 


D86577 


Human herpesvirus type 7 U38 gene, U39 gene for glycoprotein B and 
U40 gene for DNA polymerase, complete and partial cds 


0.5 


5572 


AE006578 


Streptococcus pyogenes Ml GAS strain SF370, section 107 of 167 of 
the complete genome 


0.008 


5573 


U31170 


Streptococcus mutans ATPase operon, complete sequence 


0.041 


5575 


AF301004 


Schistosoma mansoni thioredoxin peroxidase 3 (TPx3) gene, complete 
cds 


0.23 


5576 


AF269295 


Staphylococcus epidermidis strain SRI clone step.l000d09 genomic 
sequence 


2 


5577 


AF1 14934 


Saccharomyces sp. S6U ATP synthase subunit 8 (ATP8) gene, 
mitochondrial gene encoding mitochondrial protein, complete cds 


0.008 


5578 


AF261143 


Homo sapiens chromosome 9 lung cancer-associated mRNA 


2 


5579 


X95276 


P.falciparum complete gene map of plastid-like DNA (IR-B) 


0.00003 


5580 


NM 030844 


Rattus norvegicus islet cell autoantigen 1, 69 kDa (leal), mRNA 


0.63 


5581 


XM 001738 


Homo sapiens proteoglycan 4, (megakaryocyte stimulating factor, 
articular superficial zone protein) (PRG4), mRNA 


0.21 


5582 


AB060860 


Macaca fascicularis brain cDNA clone:QtrA-l 1624, full insert 
sequence 


5.1 


5583 


NC_001566 


Apis mellifera ligustica mitochondrion, complete genome 


0.15 


5584 


AF239566 


Polycycnis ornata tRNA-Leu (trnL) gene, partial sequence; and trnL-F 
intergenic spacer region, complete sequence; chloroplast gene for 
chloroplast product 


0.008 


5585 


X92893 


C.roseus GGPP synthase gene 


1.6 


5586 


XM 002601 


Homo sapiens desmin (DES), mRNA 


5.7 


5587 


NM_0 12745 


Rattus norvegicus CD94 antigen (located within the rat natural killer 
gene complex) (Klrdl), mRNA 


1.4 


5588 


AK022035 


Homo sapiens cDNA FLJ11973 fis, clone HEMBB1001221 


0.023 


5589 


AB020595 


Plasmodium falciparum 1-CysPxn mRNA for 1-cys peroxidoxin, 
complete cds 


1.4 


5590 


AL121778 


Human DNA sequence from clone RP5-839B1 1 on chromosome 20. 
Contains the first coding exon of the gene for a novel protein with a 
Kunitz/Bovine pancreatic trypsin inhibitor domain and WAP-type 
(Whey Acidic Protein) Tour-disulfide core 1 domains and an e> 


0.07 


5591 


AF222338 


Rattus norvegicus voltage gated N-type calcium channel alpha IB 
gene, partial cds, alternatively spliced 


5.8 


5592 


AF247193 


Mus musculus endobrevin (Vamp8) gene, exon 1 


0.18 


5593 


AC026061 


Homo sapiens BAC clone RP1 1-223K9 from Y, complete sequence 


e-163 


5594 


XM 030132 


Homo sapiens ubiquitin specific protease 9, X chromosome 
(Drosophila fet facets related) (USP9X), mRNA 


0.21 


5595 


AK015052 


Mus musculus adult male testis cDNA, RIKEN full-length enriched 
libraiy, clone:4930402K13, full insert sequence 


0.22 


5596 


AK023333 


Homo sapiens cDNA FLJ13271 fis, clone OVARC1001000 


0.002 


5597 


AB058702 


Homo sapiens mRNA for KIAA1799 protein, partial cds 


0.007 


5598 


Z36800 


H.sapiens (xsl49) mRNA, 250bp 


0.0003 
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Table 3A Nearest Neighbor (BlastN vs. Genbank) 


SEQID 
NO 


ACCESSN 


DESCRIP 


P VALUE 


5599 


XM 044123 


Homo sapiens cadherin 20, type 2 (CDH20), mRNA 


4E-48 


5600 


XM 030272 


Homo sapiens LOC88835 (LOC88835), mRNA 


0.3 


5601 


AF030944 


Brugja malayi microfilarial sheath protein SHP3a (Bmshp3a) and 
microfilarial sheath protein SHP3 precursor (Bmshp3) genes, 
complete cds 


0.089 


5602 


U59882 


Mus musculus MLH1 gene, exon 2 


5.2 


5603 


AL591498 


Human DNA sequence from clone RP1 1-1 13L12 on chromosome 13, 
complete sequence [Homo sapiens] 


1.8 


5604 


AB002169 


Rattus norvegicus RT1.P1 pseudogene for TL antigen 


2 


5605 


AB009907 


Luciola kuroiwae mitochondrial DNA for 16S rRNA, partial sequence 


0.003 


5606 


AF336829 


Rattus norvegicus NEDD8-actrvating enzyme mRNA, complete cds 


0.58 


5607 


AJ3 10344 


Scyliorhinus canicula partial mRNA for follicle stimulating hormone 
beta subunit (fsh beta gene) 


0.22 


5608 


D00653 


Homo sapiens gene for enteric smooth muscle gamma-actin, exon 8 


2 


5609 


AF203972 


Shuttle vector pBA complete sequence 


2 


5610 


AK004804 


Mus musculus adult male lung cDNA, RIKEN full-length enriched 
library, clone:1200016Bll, full insert sequence 


5.3 


5611 


AF270009 


Staphylococcus epidermidis strain SRI clone step.l040e03 genomic 
sequence 


0.037 


5612 


BC006448 


Homo sapiens, clone IMAGE:3637085, mRNA 


1.9 1 


5613 


AF298202 


Dictyostelium discoideum putative transposon DDT-B, complete 
sequence 


0.019 


5614 


AF272001 


Ebola virus subtype Zaire strain Mayinga complete genome 


0.002 


5615 


M86544 


Cow prostaglandin F synthetase II (PGFSII) mRNA, complete cds 


0.075 


5616 


XM_039684 


Homo sapiens putative G-protein coupled receptor (SH120), mRNA 


0.16 


5617 


AJ299399 


Medicago truncatula mRNA for putative AUXl-like permease (lax3 
gene) 


6.4 


5618 


L39214 


Ovis aries glucose transporter type 3 (GLUT-3) mRNA, complete cds 


0.23 


5619 


XM 010126 


Homo sapiens stromal antigen 2 (STAG2), mRNA 


e-127 


5620 


Z23801 


H. sapiens (D17S93 1) DNA segment containing (CA) repeat; clone 
AFM248tg5; single read 


• 5E-19 


5621 


I AF2 10842 


Homo sapiens HARP (HARP) gene, exon 17 and complete cds 


0.65 


5622 


AB047239 


Staphylococcus aureus DNA, complete structure of cassette 
chromosome(SCC)-like element, strain: ATCC25923 


0.25 


5623 


AB035193 


Homo sapiens RHCE gene, intron 7, complete sequence 


2.1 


5624 


XM 045035 


Homo sapiens hypothetical protein FLJ21343 (FLJ21343), mRNA 


2.4 


5625 


XM 052223 


Homo sapiens DKF2P564G092 protein (DKFZP564G092), mRNA 


0.67 


5626 


Z28007 


S.cerevisiae chromosome XI reading frame ORF YKL007w 


0.23 
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Table 3A Nearest Neighbor (BlastN vs. Genbank) 


SEQID 
NO 


ACCESSN 


DESCREP 


P VALUE 


5627 


M14080 


Herpesvirus saimiri thymidylate synthase gene, complete cds 


0.025 | 


5628 


AF067218 


Caenorhabditis elegans cosmid K10E9 


0.071 


5629 


AK017102 


Mus musculus adult male testis cDNA, RIKEN full-length enriched 
library, clone:4933437K13, full insert sequence 


0.76 


5630 


XM 038524 


Homo sapiens hypothetical protein FLJ23059 (FLJ23059), mRNA 


e-118 


5631 


AB052773 


Bombyx mori Bmdsx gene, exon 1, 2, 3, 4, female-specific splicing 
product complete cds 


0.24 


5632 


AF233069 


Galdieiia sulnhuraiia maturase CmatK) eene. oartial cds* 50S 
ribosomal protein, ribulose-l,5-bisphosphate carboxylase/oxygenase 
large subunit (rbcL), ribulose-l,5-bisphosphate carboxylase/oxygenase 
small subunit (rbcS), acetohydroxy-acid synthase small> 


0.55 


5633 


AK027218 


Homo sapiens cDNA: FLJ23565 fis, clone LNG10846 


0.24 


5634 


AF198462 


HTV-1 isolate WM710-5 from Australia envelope glycoprotein (env) 
gene, partial cds 


0.61 


5636 


AF213717 


Euhadra quaesita haplotype Mana-B 16S ribosomal RNA gene, partial 
sequence; mitochondrial gene for mitochondrial product 


0.21 


5637 


AE002730 


Drosophila melanogaster genomic scaffold 142000013385730, 
complete sequence 


1.4 


5638 


AF1 10027 


Candida albicans ATP-dependent transporter (YCF) gene, complete 
cds 


0.069 


5639 


Z73978 


Caenorhabditis elegans cosmid ZC302, complete sequence 


0.21 


5640 


AF379877 


Aclista sp. M226 16S ribosomal RNA gene, partial sequence; 
mitochondrial gene for mitochondrial product 


0.52 


5641 


AF364544 


Larix lyallii microsatellite LLY13 sequence 


2.2 


5642 


AF346905 


Dermatophagoides farinae allergen Def f II precursor, mRNA, partial 
cds 


0.008 


5643 


AE001426 


Plasmodium falciparum chromosome 2, section 63 of 73 of the 
complete sequence 


0.001 


5644 


XM_002850 


Homo sapiens TU3 A protein (TU3 A), mRNA 


0.078 


5645 


AB032899 


Rattus norvegicus PIPK2 alpha mRNA for phosphatidylinositol 5- 
phosphate 4 -kinase alpha, complete cds 


4.4. 


5646 


AK017370 


Mus musculus 6 days neonate head cDNA, RIKEN full-length 
enriched library, clone:5430431E21, full insert sequence 


6E-64 


5647 


XM 042126 


Homo sapiens similar to GTP-RHO BINDING PROTEIN 1 
(RHOPHUJN) (M musculus) (LOC91997), mRNA 


3E-47 


5648 


XM 048666 


Homo sapiens runt-related transcription factor 1 (acute myeloid 
leukemia 1; amll oncogene) (RUNX1), mRNA 


6.3 


5649 


| AJ000259 


Caenorhabditis elegans osm-6 mRNA 


0.6 


5650 


AK017713 


Mus musculus 8 days embryo cDNA, RIKEN full-length enriched 
library, clone: 5730488L07, full insert sequence 


0.087 


5651 


XM 010294 


Homo sapiens arylsulfatase E (chondrodysplasia punctata 1) (ARSE), 
mRNA 


8E-36 ! 


5652 


XM 039942 


Homo sapiens Meis (mouse) homolog 3 (MEIS3), mRNA 


0.53 



421 



3NSDOCID: <WO 0214500A2J_> 



WO 02/14500 



PCT/US01/25840 





Table 3A Nearest Neighbor (BlastN vs. Genbank) 


SEQID 
NO 


ACCESSN 


DESCRIP 


P VALUE 


5653 


AK019524 


Mus musculus 0 day neonate head cDNA, RIKEN full-length enriched 
library, clone:4833439017, full insert sequence 


1.7 


5654 . 


XM 045122 


Homo sapiens KIAA145 1 protein (KIAA145 1), mRNA { 


5E-38 


5656 


AK009345 


Mus musculus adult male tongue cDNA, RIKEN full-length enriched 
library, clone:2310014M14, full insert sequence 


3E-35 


5657 


AF192496 


Homo sapiens PDX-1 gene, 5 ! flanking region 


1.5 


5658 


XM_052620 


Homo sapiens KIAA0935 protein (KIAA0935), mRNA 


0.0003 


5659 


XMJ)49514 


Homo sapiens hypothetical protein FLJ13153 (FLJ13153), mRNA 


5.6 


5660 


AJ400965 


Proteus mirabilis ORF1 DNA, katA gene and menE gene, strain PR 


6.5 


5661 


NM 031809 


Rattus norvegicus cyclic nucleotide-gated channel beta subunit 1 
(Cngbl), mRNA 


0.63 


5662 


AF230741 : 


Notothenia coriiceps alphaNCPl (alphaNCPI), betaNCPl (betaNCPI), 
alphaNCP2 (alphaNCPH), and betaNCP2 (betaNCPH) genes, 
complete cds 


1.6 


5663 


AC079878 


Homo sapiens BAC clone RP11-343P21 from 7, complete sequence 


0.086 ! 


5664 


XM 029904 


Homo sapiens hypothetical gene supported by AK023961 
(LOC90205), mRNA 


1.9 


5665 


AY042792 


Arabidopsis thaliana translin-like protein (At2g37020; T1J8.20) 
mRNA, complete cds 


4.6 


5666 


AB018347 


Homo sapiens mRNA for KIAA0804 protein, partial cds 


0.068 


5667 


AY018017 


Oryza sativa microsatellite MRG0342 containing (AG)X22, closest to 
marker G1185, genomic sequence 


0.059 


5668 


AE005310 


Escherichia coli 0157:H7 EDL933 genome, contig 1 of 3, section 134 
of 155 


0.49 


5669 


XM 034498 


Homo sapiens farnesyl diphosphate synthase (farnesyl pyrophosphate 
synthetase, dimethylallyltranstransferase, geranyltranstransferase) 
(FDPS), mRNA 


5.3 


5670 


1 AK027123 


Homo sapiens cDNA: FIJ23470 fis, clone HSI11950 


0.17 ! 


5671 


NM 013696 


Mus musculus thyrotropin releasing hormone receptor (Trhr), mRNA 


0.23 


5673 


L23758 


Bos taurus lysozyme 7 A mRNA 


1 Q.8 ! 


5674 


AY034963 


Arabidopsis tha1i?Tia putative polypeptide chain release factor 
(F13K23.17) mRNA, complete cds 


L4 


5675 


XM_028009 


Homo sapiens KIAA1563 protein (KIAA1563), mRNA 


2E-71 


5676 


XM 046473 


Homo sapiens Rho GDP dissociation inhibitor (GDI) alpha 
(ARHGDIA), mRNA 


! 0.59 


5677 


NMJ)00148 


Homo sapiens fucosyltransferase 1 (galactoside 2-alpha-L- 
fucosyltransferase, Bombay phenotype included) (FUT1), mRNA 


0.0000001 


5678 


AF203477 


Drosophila melanogaster Sticky chl (stichl) mRNA, partial cds 


2.1 


5679 


AF269242 


Plasmodium falciparum transmission-blocking target antigen P&230 
(s230) gene, complete cds 


0.23 
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NO 
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P VALUE 


5680 


AE006954 


Mycobacterium tuberculosis CDC1551, section 40 of 280 of the 
complete genome 


2.1 


5681 


Z93386 


Caenorhabditis elegans cosmid R11H6, complete sequence 


0.023 


5682 


XMJ)12723 


Homo sapiens chromosome 18 open reading frame 1 (C18orfl), 
mRNA 


5.8 


5683 


AF171785 


Drosophila melanogaster clone 69 mRNA sequence 


0.73 


5684 


AF159105 


Homo sapiens intestinal trefoil fector gene, 5* flanking region 


0.02 


5685 


NM011889 


Mus musculus septin 3 (Sept3), mRNA 


4.6 


5688 


NMJ)08541 


Mus musculus MAD homoiog 5 (Drosophila) (Madh5) ? mRNA 


0.0000002 


5689 


XM 043345 


Homo sapiens similar to inner centromere protein antigens (135kD, 
155kD) (H. sapiens) (LOC92175), mRNA 


0.075 


5690 


AL590109 


Human DNA sequence from clone RP1 1-405 J10 on chromosome 10, 
complete sequence [Homo sapiens] 


0.003 


5691 


XMJ)51593 


Homo sapiens hypothetical gene supported by AF055004 
(LOC93477), mRNA 


0.55 


5692 


U22345 


Human chromosome 20ql2 locus-specific repeat 


0.008 


5693 


X63385 


B.thuringiensis IS23 IF DNA 


0.72 


5694 


AF288224 


Brachyspira hyodysenteriae DNA Gyrase B subunit gene, complete 
cds 


0.006 


5695 


AF279456 


Lycopersicon hirsutum sesquiterpene synthase 2 (SSTLH2) mRNA, 
complete cds 


0.06 1 


5696 


AF055992 


Homo sapiens Dufify antigen/chemokine receptor (FY) gene, FY*X 
allele, complete cds 


1.5 


5697 


AF167306 


Homo sapiens synuclein alpha interacting protein (SNCAIP) gene, 
exon 10 and complete cds 


0.99 


5698 


AJ277985 


Nicotiana tabacum dreppl gene, exons 1-4 


0.22 


5699 


NMJB0238 


Mus musculus dynein, cytoplasmic, heavy chain 1 (Dnchcl), mRNA 


4E-25 


5701 


X95343 


N. tabacum mRNA for HSR201 protein 


5.9 


5702 


AE002116 


Ureaplasma urealyticum section 17 of 59 of the complete genome 


0.68 


5704 


XM 029907 


Homo sapiens anillin (Drosophila Scraps homoiog), actin binding 
protein (ANLN), mRNA 


0.72 


5705 


AJ304804 


Homo sapiens ANKRD2 gene for skeletal muscle ankyrin repeat, 
exons 1-9 


0.22 


5706 


AF371373 


Cricetulus griseus hypothetical protein Al-3 mRNA, complete cds 


0.72 ! 


5707 


AE007446 


Streptococcus pneumoniae section 129 of 194 of the complete genome 


0.18 


5708 


L44121 


Homo sapiens (clone pHK2. ID) CMT1 A gene, repeat unit 


6.6 


5709 


X90848 


S.scrofa ppk98 gene 


0.23 


5710 


AF397903 


Pisum sativum AAA-metalloprotease FtsH (FTSH) mRNA, complete 
cds; nuclear gene for mitochondrial product 


3.8 


5711 


XM 027365 


Homo sapiens ADP-ribosylauon fector-like 6 interacting protein 
(ARL6IP), mRNA 


1E-30 
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SEQID 
NO 


ACCESSN 


DESCRIP 


P VALUE 


5712 


■ 

L01574 


Homo sapiens P-selectin CD62 (GRMP) gene, 5* end and promoter 
region 


0.00001 


5713 


AY023618 


Oryza sativa microsatellite MRG5943 containing (11 A)X55, closest 
to marker RG908, genomic sequence 


0.00009 


5714 


AE004377 


Vibrio cholerae chromosome n, section 34 of 93 of the complete 
chromosome ! 


0.58 


5715 


AL1 15472 


Botrytis cinerea strain T4 cDNA library under conditions of nitrogen 
deprivation 


0.2 


5716 


XM 042855 


Homo sapiens hypothetical gene supported by BC006369; BC008013 
(LOC92105), mRNA 


8E-23 


5717 


XM 015203 


Homo sapiens region containing deleted in azoospermia 2; deleted in 
azoospermia (LOC90902), mRNA 


1E-19 


5718 


AY019789 


Oryza sativa microsatellite MRG2114 containing (AT)X78, genomic 
sequence 


0.0000003 


5719 


AB010300 


Garlic virus A genomic RNA, complete sequence ! 


0.001 


5720 


AF171785 


Drosophila melanogaster clone 69 mRNA sequence 


0.73 


5721 


AF325188 


Phaseolus coccineus suspensor-specific protein (C541-like) gene, 
complete cds 


0.24 


5722 


XM_048692 


Homo sapiens similar to hypothetical protein (H. sapiens) 
(LOC92998), mRNA 


1E-16 


5723 


AE006878 


Sulfolobus solfataricus section 237 of 272 of the complete genome 


2.2 


5724 


AF038919 


Dictyostelium discoideum PslA (psLA) gene, complete cds 


0.026 


5725 


AF368257 


Bacillus thuringiensis plasmid encoded CrylBa (crylBa) gene, 
complete cds 


2.1 


5726 


AF148319 


Homo sapiens imprinting center, AS-SRO region 


1.8 


5727 


D49412 


Human gene for interleukin 3 receptor alpha subunit, exon 11 


0.23 


5728 


AJO 12488 


Mus musculus gene encoding serotonin receptor 5-HT2B, exons 1-3 


0.25 


5729 


AL445224 


Human DNA sequence from clone RP1 1-259P20 on chromosome 6, 
complete sequence [Homo sapiens] 


2 


5730 


AK025021 


Homo sapiens cDNA: FLJ21368 fis, clone COL03056, highly similar 
to AF158555 Homo sapiens glutaminase C mRNA 


e-171 


5731 


D38252 


Raphanus sativus dark-inducible and senescence-associated gene, 
exon 1 to 5 


0.62 


5732 


XM 051880 


Homo sapiens hypothetical gene supported by AK026792 
(LOC93533), mRNA 


5.6 


5733 


K00769 


yeast (s.pombe) 5s rrna gene and flanks, clone psprl 1 


0.023 | 


5734 


AL031254 


Caenorhabditis elegans cosmid 4R79, complete sequence 


0.082 


5735 


AK001945 


Homo sapiens cDNA FLJ11083 fis, clone PLACE1005232 


2.1 | 


5736 


XM 012452 


Homo sapiens ryanodine receptor 3 (RYR3), mRNA 


0.008 


5737 


XM 046442 


Homo sapiens hypothetical protein FLJ23045 (FLJ23045), mRNA 


6E-33 


5738 


AF072513 


Illex argentinus microsatellite Ia203 


0.69 


5739 


XM_050747 


Homo sapiens TClO-like Rho GTPase (TCL), mRNA 


5E-20 


5740 


AC079757 


Homo sapiens clone RP11-109N2, complete sequence 


0.62 
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SEQID 
NO 


ACCESSN 


DESCRIP 


P VALUE 


5741 


XM_042128 


Homo sapiens KIAA1691 protein (KIAA1691), mRNA 


4.4 i 


5742 


NM_004342 


Homo sapiens caldesmon 1 (CALD1), mRNA 


4E-67 


5743 


BC002303 


Mus museums, clone IMAGE:3591199, mRNA 


0.69 


5744 


AF273045 


Homo sapiens CTCL tumor antigen sel4-3 mRNA, complete cds 


5E-28 


5745 


AE001798 


Thermotoga maritima section 1 10 of 136 of the complete genome 


0.71 


5746 


AF273225 


Homo sapiens Fanconi anemia complementation group D2 protein 
(FANCD2) gene, exon 7 


0.009 


5747 


AL589920 


Human DNA sequence from clone RP1 1-1 17A20 on chromosome 6, 
complete sequence [Homo sapiens] 


0.026 


5748 


AF101092 


Homo sapiens collagen type XI alpha- 1 (COL1 1 Al) gene, exon 29 


0.019 


5749 


S80521 


dMax=basic helix-loop-helix/leucine zipper protein {alternatively 
spliced} [mice, WEHI 231 B-lymphoma cells, mRNA Partial, 752 nt] 


0.003 


5750 


AF291825 


Gorilla gorilla von Hippel-Lindau tumor suppressor (VHL) gene, 
promoter region 


0.078 


5751 


AF120716 


Kluyveromyces lactis cytochrome c oxidase subunit HI (COX3) gene, 
mitochondrial gene encoding mitochondrial protein, complete cds 


0.027 


5752 


NM_001423 


Homo sapiens epithelial membrane protein 1 (EMP1), mRNA 


5E-31 


5753 


AB045575 


Danio rerio DNA, fkk-7c locus, insertion site of the medaka fish Tol2 
element 


L7 


5754 


AL050120 


Homo sapiens mRNA; cDNA DKFZp586D21 1 (from clone 
DKFZp586D211); partial cds 


2 


5755 


AP000468 


Homo sapiens genomic DNA, chromosome 21q22.3, clone:f60A9, 
complete sequence 


0.009 


5756 


AK001649 


Homo sapiens cDNA FLJ10787 fis, clone NT2RP4000481, weakly 
similar to ATP-DEPENDENT RNA HELICASE DOB1 


0 


5757 


NM_031511 


Rattus norvegicus Insulin-like growth factor II (somatomedin A) 
(Ig£2), mRNA 


0.025 


5758 


AL162036 


Homo sapiens mRNA; cDNA DKFZp434B249 (from clone 
DKFZp434B249) 


0.71 


5760 


AF3 17787 


Rattus norvegicus Cd36 gene, promoter region 


2.1 


5761 


XM 032021 


Homo sapiens calreticulin (CALR), mRNA 


6E-41 


5762 


XM_037847 


Homo sapiens neurexophilin 3 (NXPH3), mRNA 


4.2 


5763 


AF288818 


Rattus norvegicus cytochrome P450 4F5 gene, complete cds 


0.026 


5764 


AF269237 


Cloning vector pTX-GFP, complete sequence 


0.71 


5765 


AK021493 


Homo sapiens cDNA FLJ11431 fis, clone HEMBA1001094 


8E-76 


5766 


AF034077 


Equus caballus alpha- 1 -antitrypsin (Spi2) gene, complete cds 


0.0003 


5767 


XM_030851 


Homo sapiens G protein-coupled receptor kinase-interactor 1 (GIT1), 
mRNA 


5.4 | 


5768 


AY039919 


Arabidopsis thaliana unknown protein (AT4g09340) mRNA, 
complete cds 


4.8 
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SEQID 
NO 
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5769 


AK018211 


Mus musculus adult male medulla oblongata cDNA, RIKEN full- 
length enriched library, clone:6330510M09, full insert sequence 


6.4 


5770 


XM 006622 


Homo sapiens PTPRF interacting protein, binding protein 1 (liprin 
beta 1) (PPFEBP1), mRNA 


0.079 


5771 


U72335 


Human platelet activating factor acetylhydrolase, brain isoform, 45 
kDa subunit (LIS1) gene, exons 3 and 4 


0.2 


5772 


XM_048810 


Homo sapiens hypothetical protein FLJ22 116 (FU22116), mRNA 


0 


5773 


AF218257 


Arabidopsis thaliana 12-oxo-phytodienoate reductase gene, complete 
cds 


0.23 


5774 


AKO 14450 


Mus musculus 18 days pregnant adult female placenta and extra 
embryonic tissue cDNA, RIKEN foil-length enriched library, 
clone:3830422A13, full insert sequence 


0.083 


5775 


AK005342 


Mus musculus adult male cerebellum cDNA, RIKEN full-length 
enriched library, clone: 1500032H18, fall insert sequence 


1.6 


5776 


M99412 


Human interleukin-8 receptor (IL8RB) gene, complete cds 


0.063 


5777 


NC 001619 


Canine oral papillomavirus, complete genome 


1.8 


5778 


NM 030238 


Mus musculus dynein, cytoplasmic, heavy chain 1 (Dnchcl), mRNA 


0 


5779 


AL031254 


Caenorhabditis elegans cosmid 4R79, complete sequence 


0.084 


5781 


Z74797 


S.cerevisiae chromosome XV reading frame ORF YOL055c 


2.2 


5782 


XM 040841 


Homo sapiens cullin 5 (CUL5), mRNA 


0.23 


5783 


U20807 


Bos taurus protein tyrosine phosphatase BA14 mRNA, complete cds 


0.22 


5784 


AE006648 


Sulfolobus solfataricus section 7 of 272 of the complete genome 


0.007 


5785 


AYO 18666 


Oiyza sativa microsateUite MRG0991 containing (AT)X19, closest to 
marker R1713, genomic sequence 


0.008 


5786 


AK026299 


Homo sapiens cDNA: FLJ22646 fis, clone HSI07178 


0.23 


5787 


X91787 


L.luteus mRNA for tRNA-glutamine synthetase 


0.17 


5788 


XM_017281 


Homo sapiens endosulfine alpha (ENSA), mRNA 


| 1.9 


5789 


XM 017281 


Homo sapiens endosulfine alpha (ENSA), mRNA 


1.9 


5790 


! AF229090 


Homo sapiens myosin VI (MY06) gene, exon 11 


0.69 


5791 


AK001945 


Homo sapiens cDNA FLJ11083 fis, clone PLACE1005232 


2.1 


5792 


AF189157 


Foot-and-mouth disease virus (strain Ol) polyprotein gene, complete 
cds 


6.5 


5793 


AK013313 


Mus musculus 10, 11 days embryo cDNA, RIKEN full-length 
enriched library, clone:2810449G22, fall insert sequence 


6.1 


5794 


AJ005205 


Homo sapiens 5HT3 gene for serotonin 3 receptor 


O.oV 


5795 


NM 019083 


Homo sapiens hypothetical protein (FLJ10287), mRNA 


I 6E-63 


5796 


AK021451 


Homo sapiens cDNAFLJ11389 fis, clone HEMBA1000557 


2E-54 


5797 


Z34885 


T.pyriformis (CGL) gene for TCPlgamma protein 


0.6 


5798 


AF288408 


Rattus norvegicus transient receptor potential Trp4beta mRNA, 
complete cds 


0.081 


5799 


XM 015806 


Homo sapiens hypothetical protein FU22621 (FLJ22621), mRNA 


0 
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NO 
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5800 


] 

AF332918 j 


Plasmodium falciparum Indochina El/CDC erythrocyte binding 
protein BAEBL (baebl) gene, complete cds 


3.9 I 


5801 


3 

XM 052063 { 


Somo sapiens hypothetical protein DKFZp547 A023 
[DKFZp547A023), mRNA 


0 


5802 


AB013796 < 


Ceratopteris richardii mRNA for CRHB6, complete cds 


0.024 


5803 


] 

AF288408 < 


Rattus norvegicus transient receptor potential Trp4beta mRNA, 
complete cds 


J 

0.082 


5804 


AF174213 


rerebra subulata 16S ribosomal RNA gene, partial sequence; 
mitochondrial gene for mitochondrial product 


2.8 


5805 


AF308286 


Homo sapiens serologically defined breast cancer antigen NY-BR-18 
mRNA partial cds 


0.59 


5806 


AF030199 


Mus musculus type 1 sigma receptor gene, complete cds 


0.49 I 


5807 


M99412 


Human interleukin-8 receptor (IL8RB) gene, complete cds 


0.51 ] 


5808 


Z34885 


T.pyriformis (CGL) gene for TCPlgamma protein 


0.58 


5809 


AF305539 


Mus musculus B lymphocyte induced maturation protein 1 (Prdml) 
gene, exon 8 and complete cds 


1E-67 


5810 


AF288408 


Rattus norvegicus transient receptor potential Trp4beta mRNA 
complete cds 


0.087 


5811 


AE001181 


Borrelia burgdorferi (section 67 of 70) of the complete genome 


0.001 1 


5812 


AY027894 


Homo sapiens voltage-dependent calcium channel beta 2 subumt 
(CACNB2) gene, exon 2 


0.00008 


5813 


XM 029481 


Homo sapiens similar to KIAA0732 protein (H sapiens) (LOC90154), 
mRNA 


0.68 | 


5814 


| AK021659 


Homo sapiens cDNA FLJ11597 fis, clone HEMBA1003856 


2.1 j 


5815 


\ U39721 


Mycoplasma genitalium section 43 of 51 of the complete genome 


0.19 


5816 


Z73267 


S cerevisiae chromosome XII reading frame ORF YLR095c 


0.009 j 


5817 


XM 029453 


Homo sapiens oculocerebrorenal syndrome of Lowe (OCRL), mRNA 


0.26 


5818 


AF027728 


Xenopus laevis kinesin-related protein (XCENP-E) mRNA complete 
cds 


J 

0.052 


5819 


AE001407 


Plasmodium falciparum chromosome 2, section 44 of 73 of the 
complete sequence 


0.18 


5820 


AL157491 


Homo sapiens mRNA; cDNA DKFZp434Kl 111 (from clone 
DKFZp434Kllll) 


1 

9E-50 


5821 


XM 002437 


Homo sapiens cAMP-regulated guanine nucleotide exchange factor n 
(CAMP-GEFH), mRNA 


4E-37 1 


5822 


XM_050146 


Homo sapiens KIAA0472 protein (KIAA0472), mRNA 


8E-20 


5823 


AY032600 


Carica papaya xyloglucan endo-transglycosylase mRNA complete cds 


0.19 


5824 


Z36061 


S.cerevisiae chromosome II reading frame ORF YBR192w 


3 


5825 


! AK021459 


Homo sapiens cDNAFLJ11397 fis, clone HEMBA1000622 


0.046 


5826 


AF261017 


Nicotiana tabacum putative chloroplast RNA helicase VDL (VDL) 
gene, exons 1 through 5 and complete CDS, alternatively spliced; 
nuclear gene for chloroplast products 


2.1 


5827 


1 AJ277162 


Beta vulgaris mRNA for putative Cdc2-related protein kinase CRK2 


! 0.009 1 
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SEQID 
NO 
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DESCRIP 


P VALUE 


5828 


AL390874 


Human DNA sequence from clone RP1 1-238K16 on chromosome 9, 
complete sequence [Homo sapiens] 


0.026 


5829 


S84872 


Solanum tuberosum type A phytochrome (c-phyA) mRNA, complete 
cds 


0.43 


5830 


AF271370 


Homo sapiens interphotoreceptor matrix proteoglycan 200 (IMPG2) 
gene, exon 10 


0.078 


5831 


XM 051183 


Homo sapiens embiyonic ectoderm development (EED), mRNA 


0.00002 


5832 


BC006523 


Homo sapiens, serum/glucocorticoid regulated kinase 2, clone 
IMAGE:2988475, mRNA 


3E-34 


5833 


AF100658 


Caenorhabditis elegans cosmid H08G01 


0.44 


5834 


XM 003725 


Homo sapiens putative ribonuclease HI (RNASE3L), mRNA 


0.008 


5835 


M58029 


Amoeba proteus symbiotic bacterixnn 29kDa protein (s29x) gene, 
complete cds 


0.71 


5836 


U46541 


Staphylococcus aureus sarA gene, complete cds 


0.003 


5837 


AJ225333 


Acidianus ambivalens plasmid pDLlO 


0.17 


5838 


X15702 


Fruitfly LA9 mRNA for DNA binding protein 


0.077 


5839 


AL031590 


Human DNA sequence from clone CTA-232D4 on chromosome 
22ql3.1 Contains a GSS, complete sequence (Homo sapiens] 


0.16 


5840 


X07946 


Yeast plasmid DNA coding for RNA polymerase subunit 


0.082 


5841 


AL590324 


Human DNA sequence from clone RP1 1-479D1 1 on chromosome 10, 
complete sequence [Homo sapiens] 


0.082 


5842 


XM 004947 


Homo sapiens paraoxonase 2 (PON2), mRNA 




5843 


AF262582 


Dolichorhinotermes sp. 'Manaus 1 16S mitochondrial ribosomal RNA 
gene, partial sequence; mitochondrial gene for mitochondrial product 


0.067 


5844 


L81839 


Homo sapiens (subclone 2_h3 from PI H43) DNA sequence, complete 
sequence 


0.00006 


5845 


U67550 


Methanococcus jannaschii section 92 of 150 of the complete genome 


0.087 


5846 


XM_044987 


Homo sapiens hypothetical gene supported by U00951 (LOC92415), 
mRNA 


6.1 


5847 


AE001420 


Plasmodium falciparum chromosome 2, section 57 of 73 of the j 
complete sequence 


0.71 


5848 


AF3 15823 


Stylonychia mytilus DNA-dependent RNA polymerase II largest 
subunit RPB1 (RPB1) gene, partial cds 


0.51 


5849 


AB066207 


Physcomitrella patens chloroplast genes for ribulose-l,5-bisphosphate 
carboxylase/oxygenase large subunit, tRNA-Arg and acetyl-CoA 
carboxylase beta subunit, complete cds 


0.65 


5850 


AF286898 


Plasmodium chabaudi adami carbamoyl phosphate synthetase n gene, 
partial cds 


0.002 


5851 


AL583888 


Human DNA sequence from clone RP11-132E18 on chromosome 10, 
complete sequence [Homo sapiens] 


0.028 


5852 


AB011811 


Chromatium vinosum genes for light-harvesting and reaction center 
proteins, partial and complete cds 


6.4 


j 5853 


AF153448 


Zea mays nitrate reductase (NR1) gene, complete cds 


0.067 
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SEQID 
NO 


ACCESSN 


DESCRIP 


P VALUE 


5854 


XM 012762 


Homo sapiens hypothetical protein FLJ10891 (FLJ10891), mRNA 


0 


5855 


AF248814 


Maoricicada cassiope isolate TB-MI-004 tRNA-Asp gene, complete 
sequence; ATPase subunit 8 gene, complete cds; and ATPase subunit 
6 gene, partial cds; mitochondrial genes for mitochondrial products 


0.029 


5856 


D01021 


Anthocidaris crassispina mRNA for dynein beta-heavy chain, 
complete cds 


2E-22 


5857 


AE001401 


Plasmodium falciparum chromosome 2, section 38 of 73 of the 
complete sequence 


0.54 


5858 


BC007548 


Homo sapiens, clone IMAGE:2959994, mRNA 


1.9 


5859 


AB063065 


Macaca fascicularis brain cDNA clone:QmoA- 12555, full insert 
sequence 


0.18 


5860 


NMJ)08538 


Mus musculus myristoylated alanine rich protein kinase C substrate 
(Macs), mRNA 


1.5 


5861 


XM_029266 


Homo sapiens sarcoglycan, epsilon (SGCE). mRNA 


0.66 


5862 


AE006685 


Sulfolobus solfataricus section 44 of 272 of the complete genome 


0.37 


5863 


BC008007 


Homo sapiens, clone IMAGE.3510538, mRNA 


6E-34 


5864 


AB024511 


Nicotiana tabacum mRNA for TMV response-related gene product, 
complete cds 


2.1 


5865 


U25027 


Glycine max phosphatidylinositol-specific phospholipase C mRNA, 
complete cds 


0.077 


5866 


U50715 


Mus musculus alpha-galactosidase A gene, complete cds 


0.024 


5867 


U64465 


Stylophora pistillata L-type calcium channel alpha- 1 subunit 
(STPCACHL) mRNA, complete cds 


0.072 


5868 


XM_040043 


Homo sapiens similar to hypothetical protein DKFZp566D1346 (H. 
sapiens) (LOC91690), mRNA 


0.023 


5869 


U60135 


Arabidopsis thaliana serine/threonine protein phosphatase 2 A-3 
catalytic subunit gene, complete cds 


0.027 


5870 


AF275745 


Lycopersicon esculentum plasma membrane H+-ATPase (LHA2) 
mRNA, complete cds 


1.5 


5871 


Z29521 


C.crispus Chloroplast gene encoding 16S rRNA, tRNA-Ile, tRNA- 
Ala, and 23S rRNA (partial) 


0.003 


5872 


AJ132557 


Oryctolagus cuniculus CYP19 gene, ovarian promoter region 


0.24 


5873 


AF124524 


Arabidopsis thaliana gamma-adaptin 1 gene, complete cds 


0.24 


5874 


AK013131 


Mus musculus 10, 1 1 days embryo cDNA, RIKEN full-length 
enriched library, clone:2810422B04, full insert sequence 


0.72 


5875 


AB044413 


Sus scrofa MMP-3 mRNA for matrix metalloproteinase-3, partial cds 


0.23 


5876 


AF247039 


Porcine adenovirus 3 171R (El A), 202R (E1B-1), 474R (E1B-2), 
288R(288R), 198R(pIX), 163R* (163R*), 162R(162R), 97R(97R) 
and 184R (184R) genes, complete cds; and IVa2 (IVa2) gene, partial 
cds 


5.2 


5877 


AF060579 


Gossypium barbadense clone pXP020 repetitive DNA sequence 


0.055 


5878 


AE006499 


Streptococcus pyogenes Ml GAS strain SF370, section 28 of 167 of 
the complete genome 


2.1 
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SEQID 

1 NO 


ACCESSN 


DESCRIP 


P VALUE 


5879 


AL591498 


Human DNA sequence from clone RP11-113L12 on chromosome 13, 
complete sequence [Homo sapiens] 


0.003 


5880 


NM 017066 


Rattus norvegicus Pleiotrophin (Heparine binding factor, Hbnf, in the 
mouse) (Ptn), mRNA 


0.21 


5881 


XM 047505 


Homo sapiens hypothetical gene supported by AK022483 
(LOC92818), mRNA 


0.7 


5882 


J00912 


chicken brain tubulin, alpha chain mrna 


0.24 


5883 


M15840 ! 


Human interleukin 1-beta (IL1B) gene, complete cds 


0.08 


5884 


X92683 


Rsapiens DNase I hypersensitive site (HSS-1) 


2.1 


5885 


AL390767 


Human DNA sequence from clone RP1-68P15 on chromosome llpl3- 
14.2 Contains GSSs and ESTs. Contains part of a novel gene, 
complete sequence [Homo sapiens] 


1.8 


5886 


XM 012386 


Homo sapiens KIAA0377 gene product (KIAA0377), mRNA 


1E-48 


5887 


AL137466 


Homo sapiens mRNA; cDNA DKFZp434H1322 (from clone 
DKFZp434H1322) 


0.00001 


5888 


X62745 


Z.mays OBF1 mRNA for ocs-element binding factor 1 


0.23 


5890 


AB048902 


Macaca fascicularis brain cDNA, clone:QnpA-17448 


0.24 


5892 


AF221070 


Ophiostoma piliferum strain CBS129.32 18S ribosomal RNA gene, 
partial sequence; internal transcribed spacer 1, 5.8S ribosomal RNA 
gene, and internal transcribed spacer 2, complete sequence 


0.003 


5893 


AE006530 


Streptococcus pyogenes Ml GAS strain SF370, section 59 of 167 of 
the complete genome 


0.22 


5894 


AB018698 


Broad bean wilt virus 2 gene for precursor polyprotein, complete cds, 
isolate: IP 


0.025 


5895 


U67537 


Methanococcus jannaschii section 79 of 150 of the complete genome 


0.026 


5896 


AJ000742 


Homo Sapiens hisHl gene, 5' UTR 


0.008 


5897 


AF247039 


Porcine adenovirus 3 171R (E1A), 202R (E1B-1), 474R (E1B-2), 
288R(288R), 198R(pIX), 163R* (163R*), 162R(162R), 97R(97R) 
and 184R (184R) genes, complete cds; and IVa2 (IVa2) gene, partial 
cds 


5.2 


5898 


AJ290290 


Asphodeline lutea chloroplast trnL-trnF intergenic spacer 


0.23 


5899 


XM 042440 


Homo sapiens UDP-Gal:betaGlcNAc beta 1,4- galactosyltransferase, 
polypeptide 3 (B4GALT3), mRNA 


6.4 


5900 


U78090 


Rattus norvegicus potassium channel regulator 1 mRNA, complete cds 


2.1 


5901 


NM 017053 


Rattus norvegicus Neuromedin K receptor (Neurokinin B/Tachikin 3) 
(Tac3r), mRNA 


1.9 


f AAA 

5902 


AF208658 


Tapesia yallundae eburicol 14 alpha-demethylase (CYP51) gene, 
complete cds 


6.2 


5903 


AF281786 


HIV-1 isolate DK-g3 from USA gag polyprotein (gag) gene, partial 
cds 


0.63 


5904 


AJ011019 


Capra hircus csn2 gene, exons 1 to 9, allele 0(null) 


2 


5905 


AJ3 11054 


Chironomus pallidivittatus 4L ORF for putative recombinase 
subtelocentric clone, Cp5.5 


0.023 
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SEQID 
NO 


ACCESSN 


DESCRIP 


P VALUE 


5906 


Y17466 


Fugu rubripes SIAT3C gene, last three exons 


0.65 ! 


5907 


AK022127 


Homo sapiens cDNA FLJ12065 fis, clone HEMBB 1002249 


0.001 


5908 


X79100 


C.vicina (Bl 1) mRNA for aiylphorin receptor 


0.6 


5909 


AK023079 


Homo sapiens cDNA FLJ13017 fis, clone NT2RP3000628 


0.23 


5910 


AK014448 


Mus musculus 18 days pregnant adult female placenta and extra 
embryonic tissue cDNA, RIKEN full-length enriched library, ' 
clone:3830421J05, full insert sequence 


1.4 


5911 


AK020293 


Mus musculus adult male cecum cDNA, RIKEN full-length enriched 
library, clone:9130404Hll, full insert sequence 


0.23 


5912 


AE006285 


Lactococcus lactis subsp. lactis IL1403 section 47 of 218 of the 
complete genome 


0.25 


5913 


U41292 


Human prolargin (PRELP) gene, 5* flanking sequence and exon 1 


0.001 


5914 


AL109707 


Homo sapiens mRNA full length insert cDNA clone EUROIMAGE 
295344 


0.00004 


5915 


NM_024154 


Rattus norvegicus proton gated cation channel ASIC1 (Accn2), 
mRNA 


6.2 


5916 


U79732 


Plasmodium berghei extrachromosomal plastid PB-2, tRNA-Pro, 
tRNA-Glu, tRNA-Lys, tRNA-Asp, tRNA-Ser, tRNA-Tyr, tRNA-Met, 
tRNA-Leu, tRNA-Cys, and tRNA-His genes, complete sequence, rps4 
gene, complete cds, tRNA-Thr gene, complete sequence, and large 
subu> 


0.023 


5917 


AF098715 


Coniglobus mercatorius 16S ribosomal RNA gene, mitochondrial 
gene encoding mitochondrial product, partial sequence 


0.095 


5918 


AF095904 


Toxoplasma gondii yc£24 protein (yc£24) gene, partial cds; DNA 
dependent RNA polymerase beta subunit (rpoB) gene, complete cds; 
and DNA dependent RNA polymerase beta 1 subunit (rpoCl) gene, 
plastid genes encoding plastid proteins, partial cds 


0.026 


5919 


XM_027416 


Homo sapiens mesothelin (MSLN), mRNA 


0.7 


5920 


BC004772 


Mus musculus, Similar to inhibitor of kappa light polypeptide gene 
enhancer in B-cells, kinase beta, clone IMAGE:3256890, mRNA 


1.6 


5921 


AK000935 


Homo sapiens CDNAFLJ10073 fis, clone HEMBA100 1731 


0.18 


5922 


BC009942 


Homo sapiens, clone MGC: 12595 IMAGE:4303422, mRNA, 
complete cds 


0.001 


5923 


J05008 


Homo sapiens endothelin-1 (EDN1) gene, complete cds 


0.5 


5924 


AE002174 


Chlamydophila pneumoniae AR39, section 10 of 94 of the complete 
genome 


0.69 


5925 


AJ223616 


Caehorhabditis elegans mRNA for calcium ATPase 


0.7 


5926 


AF3 10894 


DictyosteUum discoideum prespore-specific protein (pspC) gene, 
partial cds; RacH (racH) gene, complete cds; and BOP (bopA) gene, 
partial cds 


0.027 


5927 


AE003910 


Xylella fastidiosa 9a5c, section 56 of 229 of the complete genome 


0.73 


5928 


AF335424 


Homo sapiens sperm protein Spl7-2 pseudogene, complete sequence 


1.8 
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SEQID 
NO 


ACCESSN 


DESCRIP 


P VALUE 


5929 


XM 016924 


Homo sapiens hypothetical protein MGC4054 (MGC4054), mRNA 


4.3 | 


5930 


U62057 


Mycoplasma capricolum NADH oxidase (naox) gene, partial cds, and 
lipoate-protein ligase (lpla), pyruvate dehydrogenase EI alpha subunit 
(odpa), pyruvate dehydrogenase EI beta subunit (odpb), pyruvate 
dehydrogenase EE (odp2), dihydrolipoamide dehydroge> 


0.72 


5931 


AF202552 


Homo sapiens DNA methyltransferase (DNMT1) gene, exons 2, 3, 
and 4 


2.2 


5932 


AF010604 


Homo sapiens SMAD5 (Smad5) gene, exon 3 


0.0001 


5933 


AC084157 


Caenorhabditis elegans cosmid Y46E12BR, complete sequence | 


0.71 


5934 


XM 027907 


Homo sapiens hypothetical protein DKFZp564D 172 
(DKFZP564D172), mRNA 


0.18 


5935 


AF360191 


Arabidopsis thaliana unknown protein (T9C5.150/AT3g49560) 
mRNA complete cds 


6.3 


5938 


AF282728 


Culex pipiens Twin-Cp5 SINE retroposon, genomic sequence and v- 
SNARE-like gene, partial sequence 


0.066 


5939 


M81702 ; 


Candida boidinii methanol oxidase (AOD1) gene, complete cds 


0.008 i 


5940 


AF057740 


Homo sapiens 17-beta-hydroxysteroid dehydrogenase IV (HSD17B4) 
gene, exon 24 and complete cds 


0.21 


5941 


XM 031463 


Homo sapiens exonuclease NEF-sp (LOC81691), mRNA 


0.072 


5942 


AB022048 


Mus musculus gene for prolyl oligopeptidase, exon 3, 4, 5, 6 


2.1 


5943 


Z48165 


Kviolacea chloroplast genes for alpha and beta subunit of 
phycocyanin 


2.1 


5944 


AF181251 


Rattus norvegicus lung Kruppel-like factor (Lklf) gene, complete cds 


5.9 


5945 


AP001057 


Homo sapiens genomic DNA, chromosome 21, clone:D40Gll, MX1- 
D21S171 region, complete sequence 


0.00001 


5946 


AK016374 


Mus musculus adult male testis cDNA, RIKEN full-length enriched 
Ubrary, clone:4930589L23, full insert sequence 


5.9 


5947 


M27697 


Pig urate oxidase mRNA, complete cds 


2.1 j 


5948 


XM_032603 


Homo sapiens similar to DESTRIN (ACTTN DEPOLYMERIZING 
FACTOR) (ADF) (H. sapiens) (LOC90563), mRNA 


0.69 


5949 


U39688 


Mycoplasma genitalium section 10 of 5 1 of the complete genome 


0.077 


5950 


AF320250 


Lactobacillus delbrueckii YmdA (ymdA) gene, partial cds; and 
putative undecaprenyl-phosphate N-acetyl-glucosaminyl transferase 
(rgpG), YvyE (yvyE), putative ComFl protein (comFl), putative 
ComF3 protein (comF3), YvyD (yvyD), putative preprotein translo> 


0.69 


5951 


Y09852 


H.sapiens FGFR3 gene, partial 


2E-10 


5952 


Z79103 


H.sapiens flow-sorted chromosome 6 Hindm fragment, SC6pA22Fll 


0.00002 


5953 


AP000321 


Homo sapiens genomic DNA, chromosome 21q22.1, D21S226-AML 
region, clone:Q82F5, complete sequence 


0.00004 


5954 


AK025042 


Homo sapiens cDNA: FLJ21389 fis, clone COL03455 


2E-66 
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SEQID 
NO 
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P VALUE 


5955 


U14576 


Dictyostelium discoideum filopodin (talA) gene, complete cds 


0.21 


5956 


AE001405 


Plasmodium falciparum chromosome 2, section 42 of 73 of the 
complete sequence 


0.052 


5957 


M64861 


D.discoideum TFHD mRNA, complete cds 


0.002 j 


5958 


AJ245435 


Plasmodium falciparum gc-alpha gene for guanylyl cyclase alpha 


0.0001 


5959 


XM 031443 


Homo sapiens phosphodiesterase 8 A (PDE8A), mRNA 


8E-13 


5960 


AC079875 


Homo sapiens clone CTD-2333B13, complete sequence 


0.00003 


5961 


AF275272 


Schizophyllum commune imknown gene; mitochondrial gene for 
mitochondrial product 


0.44 


5962 


Z60119 


H.sapiens CpG island DNA genomic Msel fragment, clone 18el, 
reverse read cpgl8el.rtlb 


0.008 


5963 


AF100654 


Caenorhabditis elegans cosmid C24E9 


0.026 


5964 


AE001399 


Plasmodium falciparum chromosome 2, section 36 of 73 of the 
complete sequence 


0.001 


5965 


U89019 


Hepatitis C virus polyprotein gene, complete cds 


0.2 


5966 


XM_052512 


Homo sapiens ARP3 (actin-related protein 3, yeast) homolog 
(ACTR3), mRNA 


2E-32 


5967 


U17634 


Human chromosome 21, Down syndrome critical region transcript, 
SP6 end of clone 3-3-f8 


0.7 


5968 


AF197477 


Homo sapiens TRBP pseudogene, partial sequence 


0.22 


5969 


U78599 


Streptococcus mutans putative D,D-carboxypeptidase and putative N- 
acetyl-muramidase genes, partial cds 


2.1 


5970 


AP001414 


Homo sapiens genomic DNA, chromosome 21q22.2, clone:D30, 
LB7T-ERG region, complete sequence 


2.1 


5971 


AJ277067 


Homo sapiens partial SSH3BPl/e3Bl gene for spectrin SH3 binding 
protein, exon3-5 


5.7 


5972 


X13294 


Human mRNA for myb-related gene A-myb 5*-iegion 


6E-74 


5973 


NC_001796 


Human parainfluenza virus 3, complete genome 


0.053 


5974 


AJ222787 


Hordeum vulgare mRNA for alpha-keto acid dehydrogenase-like 
protein, clone RG136 


0.073 


5975 


AF284034 


Melanogrammus aeglefmus vitellogenin B (VtgB) mRNA, complete 
cds 


2.1 


5976 


AF127565 


Arabidopsis thaliana ubiquitin-protein ligase 2 (UPL2) gene, complete 
cds 


4.3 


5977 


AP001419 


Homo sapiens genomic DNA, chromosome 21q22.2, clone:PAC24K9, 
LB7T-ERG region, complete sequence 


0.0002 


5978 


AL590384 


Human DNA sequence from clone RP1 1-349 A16 on chromosome 
Xq22.3-24, complete sequence [Homo sapiens] 


0.02 


5979 


AB014460 


Homo sapiens TSC2, NTHL1/NTH1 and SLC9A3R2/E3KARP genes, 
partial and complete cds 


0.69 


5980 


XM 018334 


Homo sapiens hypothetical protein FLJ22418 (FLJ22418), mRNA 


0.026 


5981 


AF229850 


Arabidopsis thaliana ferritin (Ferl) gene, complete cds 


0.69 


5982 


AKO 17445 


Mus musculus 10 days neonate head cDNA, RJKEN full-length 
enriched library, clone:5530402H04, full insert sequence 


2.1 


5983 


AF197159 


Mus musculus cubilin mRNA, partial cds 


0.7 
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NO 
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5984 


XM 029346 


Homo sapiens hypothetical gene supported by U79302; AK026607 
(LOC90138), mRNA 


0.026 


5985 


AK021517 


Homo sapiens CDNAFLJ11455 fis, clone HEMBA1001497 


0.05 


5986 


Z74823 


S.cerevisiae chromosome XV reading frame ORF YOL081w 


0.23 


5987 


XM 003091 


Homo sapiens KIAA0001 gene product; putative G-protein-coupled 
receptor; G protein coupled receptor for UDP-glucose (K1AA0001), 
mRNA 


1.9 


5988 


XM 039450 


Homo sapiens similar to NONHISTONE CHROMOSOMAL 
PROTEIN HMG-14 (H. sapiens) (LOC91606), mRNA 


4E-47 


5989 


XM 045103 


Homo sapiens G protein coupled receptor interacting protein, 
complement-clq tumor necrosis factor-related (ZSIG37), mRNA 


0.23 


5990 


AK026086 


Homo sapiens cDNA: FLJ22433 fis, clone HRC09172 


0.078 


5991 


Z22800 


H.sapiens microsatellite repeat 


3E-45 


5992 


AF165136 


Mycoplasma mycoides mycoides LC hypothetical surface located 
membrane protein and lipoprotein B precursor (lppB) genes, partial 
cds 


0.23 


5993 


AF269465 


Staphylococcus epidennidis strain SRI clone step.l004h05 genomic 
sequence 


0.23 


5994 


XM 049474 


Homo sapiens KIAA1538 protein (KIAA1538), mRNA 


0.23 


5995 


AC001478 


Homo sapiens (subclone 1 Ji7 from BAC HI 12) DNA sequence, 
complete sequence 


0.63 


5996 


AF380163 


Influenza A virus (A/Goose/Guangdong/3/97(H5Nl)) segment 3 
polymerase (PA) gene, complete cds 


0.077 


5997 


Z74263 


S.cerevisiae chromosome IV reading frame ORF YDL215c 


0.36 


5998 


AF282059 


Homo sapiens clone 2qtel edcl7fil sequence 


7.9 


5999 


AF233898 


Escherichia coli strain KI1218 BfpA (bfpA) gene, partial cds 


0.073 


6000 


M92086 


Wasp venom protein mRNA sequence 


1.7 


6001 


XM 006601 


Homo sapiens hypothetical protein FLJ10659 (FLJ10659), mRNA 


0 


6002 


D14074 


Bovine mRNA for cyclophilin, complete cds 


0.23 


6003 


AC079391 


Homo sapiens clone CTD-2243M11, complete sequence 


4 


6004 


AE006055 


Pasteurella multocida PM70 section 22 of 204 of the complete genome 


0.077 


6005 


Y09000 


Rnorvegicus mRNA for dendrin 


| 0.18 


6006 


AK006991 


Mus musculus adult male testis cDN A, KlKJiN tuii-iengtn enncnea 
library, clone: 1700084F23, full insert sequence 


2.1 


6007 


AF319636 


Pelargonium peltatum cultivar Guenievre microsatellite Pp3a 
sequence 


0.001 


6008 


D 17668 


Clostridium septicum gene for alpha-toxin, complete cds 


0.077 


6009 


AK025193 


Homo sapiens cDNA: FLJ21540 fis, clone COL06156 


0.025 


6010 


X17191 


E gracilis chloroplast RNA polymerase rpoB-rpoCl-rpoC2 operon 


0.22 
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SEQID 
NO 


N 


DESCRIPTION 


P VALUE 


4 


12721131 


gi|iz/zll^l|go|AAKUzy(J5.1| (AEOuolzl) unknown 
[Pasteurella multocida] 


0.14 


6 


i <r\Af\i & 

IDUH-UIO 


gi|1504016|dbj|BAA13207.1| (D86971)no similarities 
to reported gene products [Homo sapiens] 


A AAAO 

0.0003 


7 


12838708 


gi|12838708|dbj|BAB24303.1| (AK005889) putative 
[Mus musculus] 


5E-74 


8 


12854246 


gi|12854246|dbj|BAB29972.1| (AK015778) putative 
[Mus musculus] 


2.2 


9 


1/1/71 ^^7/1 


gi|14715574|dbj|BAB6204Ll| (AB065434) PRPK 
mutant 1 [Homo sapiens] 


/ft OA 


10 


oo /<$yo4 


gi|6678964|ref|NP_032671.1|murinoglobulin 1 [Mus 
musculus] sp|P28665|A2Ml JVIOUSE 
MURINOGLOBULIN 1 PRECURSOR (MUG1) 
gb|AAA73048.1| (M65736) [Mouse murinoglobulin 
mRNA, complete cds.], gene product [Mus musculus] 


4.2 


14 


11351090 


gi| 1 135 1090|pir||E82999 probable beta-ketoacyl 
synthase PAS 174 [imported] - Pseudomonas aeruginosa 
(strain PAOl) gb|AAG08559.1|AE004930_5 
(AE004930) probable beta-ketoacyl synthase 
[Pseudomonas aeruginosa] 


9.1 


15 


9506761 


gi|9506761|ref|NP_062298. 1| glucosamine-phosphate N- 
acetyltransferase; gJucosamine-6-phosphate 
acetyltransferase [Mus musculus] emb|CAA04463.1| 
(AJ001006) EMeg32 protein [Mus musculus] 
dbj|BAB22120.1| (AK002466) putative [Mus musculus] 
dbj|BAB25161.1| (AK007647) putative [Mus musculus] 
dbj|BAB25212.1| (AK007722) putative [Mus musculus] 
dbj|BAB25240.1| (AK007764) putative [Mus musculus] 
aDj|Jt>/\±>ZD (AJvuuoOoo) putative [Mus musculus] 
dbj|BAB27395.1| (AK011098) putative [Mus musculus] 
dbj)BAB31241.1| (AK018499) putative [Mus musculus] 


1E-84 


20 


9369403 


gi|9369403|gb|AAF87151.1|AC002423_16(AC002423) 
T23E23.19 [Arabidopsis thaliana] 


7.4 


24 


12862430 


gi|12862430|dbj|BAB32466.1| (AB047280) Gag-like 
protein [Tricholoma matsutake] 


2.6 
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25 


7434102 


gi|7434102|pir||H71620 aspartate transaminase (EC 
2.6.1.1) PFB0200c [similarity] - malaria parasite 
(Plasmodium falciparum) gb|AAC71829. 1| (AE001380) 
aspartate aminotransferase [Plasmodium falciparum] 


0.73 


26 


14721245 


gi|14721245|ref|XP_044013.1| kinesin-like protein 2 
[Homo sapiens] 


8E-91 


27 


lOUO / 1 J 


gi|9506713|reflNP_061856.1| nucleolar protein femily 
A, member 1; H/ACA small nucleolar RNPs protein 1 
Homo sapiens] ref|XP_003601.1| nucleolar protein 
amily A, member 1 (H/ACA small nucleolar RNPs) 
Homo sapiens] ref|XPJ)39016.1| nucleolar protein 
amily A, member 1 (H/ACA small nucleolar RNPs) 
Homo sapiens] rei|Njr_lZ/4oU.i| nucleolar proiem 
kmily A, member 1; H/ACA small nucleolar RNPs 
protein 1 (Homo sapiens] emb|CAB76563.1| 
;AJ276003) GAR1 protein [Homo sapiens] 

rrKIA AT-rn^Al^ 1 1A AWfRdl^ fRPfHV^I ^ nucleolar 

protein family A, member 1 (H/ACA small nucleolar 

RNPc\ rHrvmo «u^nif*Ti<il 


4 


28 


14150082 


gi)14150082|reflNP_115691.1| hypothetical protein 

MnrAIQQ rPTnmn Q5mif*nQl crhlAAH04991 1IAAH04991 

(BC004991) Similar to RUCEN cDNA 5730438N18 
gene [Homo sapiens] 


0.001 


29 


12861366 


cril1?R61^66ldbilBAB32182 11 (AK020701) putative 
[Mus musculus] 


5E-42 


32 


14739173 


gi|14739173|reflXP 03753 1.1| 59395 Porno sapiens] 


3E-23 


36 


/D JO J J£ 


ml7^ 1 ^S79lnirll^794^R nhn<5nhatidvl<?erine 
decarboxylase (EC 4.1.1.65) precursor, mitochondrial - 

\w/JJLUJLC2>w lldXlioLw 


e-112 


37 


7488381 


gi|7488381|pir)|T08408 transcription factor homolog 
_r 1 odj . l ju - .Arauiuopsis inaiiniia cmu|\^/iu*t*'7 iu. i| 
(AL049862) transcription factor-like protein 
[Arabidopsis thaliana] 


8.1 


38 


12849752 


gi|12849752|dbj|BAB28466.1| (AK0 12782) putative 
[Mus musculus] 


5E-89 


39 


14715574 


gi|14715574|dbj|BAB62041.1| (AB065434) PRPK 
mutantl [Homo sapiens] 


1E-88 


40 


12832845 


gi|12832845|dbj|BAB22281.1| (AK002682) putative 
[Mus musculus] 


4E-51 
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SEQID 
NO 


ACCESS 
N 
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P VALUE 


43 


12832845 


gi|12832845|dbj|BAB22281.1| (AK002682) putative 
[Mus musculus] 


6E-45 


47 


11558040 


gi|l 1558040|emb|CAC17800. 1| (AJ245890) 
hypothetical protein [Polymyxa betae] 


7.6 


48 


8052306 


gi|8052306|emb|CAB92239.1| (AJ242721) AEP37 
protein [Mus musculus] 


1.3 


49 


13812380 


gi| 138 123 80|reflNP_l 1 3498. 1 1 N-myristoyltransferase 
[Guillardiatheta] emb|CAC27067.1| (AJ010592)N- 
myristoyltransferase [Guillardia theta] 


7 


51 


8885549 


gi|8885549|dbj|BAA97479.1| (AB025604) telomere 
repeat-binding protein [Arabidopsis thaliana] 


1.5 


52 


14727244 


gi|14727244|ref|XP_035596.1| hypothetical protein 

AP'lJ.n??^ rHnmn <jar»ienQl rpfl'XP 11 4Q489 

AT 1*tv^aJ (_X1\JIIMJ acLJJICIlaj ICtpVT vJJjy/.l| tytOz, 

[Homo sapiens] gb|AAH07829.1|AAH07829 
(BC007829) Similar to hypothetical protein AF 140225 
[Homo sapiens] 


6E-58 


57 


14737646 


fHll47^7646lrefnXP 018048 11 similar tn APTCAL 
ENDOSOMAL GLYCOPROTEIN PRECURSOR (R. 
norveeicusl fHomo saoiensl 


0.003 


58 


13811699 


gi|13811699|refpSfP_109668.1| zinc finger protein 313; 

7'inn "fincxer nrotein 998 rMnQ tniiQeiilii^l 

fill lv J-LLi^wX piVLVlil Z*£*\J |^-LV_» Uo lilUDwUiUijj 

gblAAG01141.1|AF282919_l (AF282919) Zfp228 
[Mus musculus] 


0.00002 


59 


14042394 


gi|14042394|dbj|BAB55226.1| (AK027604) unnamed 
protein product [Homo sapiens] 


e-112 


60 


14769036 


gi|14769036|reflXPJ)36091.1| 19140 [Homo sapiens] 


9E-97 


61 


13648339 


gi|13648339|ref|XP_003 184.2| zinc finger protein 
ANCJ2H01 [Homo sapiens] 


0.0006 


64 


13638702 


ffill36387021reflXP 015971 11 hvnothetical nrotein 
FLJ23384 |Homo sapiens] refpflP_048858.il 
hypothetical protein FLJ23384 [Homo sapiens] 


2E-86 


I 65 


14150082 


gj|14150082|ref|NP_115691.1| hypothetical protein 
MGC4399 Porno sapiens] gb|AAH04991.1|AAH04991 
(BC004991) Similar to RIKEN cDNA 5730438N18 
gene [Homo sapiens] 


0.00003 


67 


14727244 


gi|14727244|reflXP_035596.1| hypothetical protein 
AF140225 [Homo sapiens] ref|XP_035597.1| 49482 
[Homo sapiens] gb|AAH07829.1|AAH07829 
(BC007829) Similar to hypothetical protein AF 140225 
[Homo sapiens] 


2E-38 
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SEQID 
NO 


ACCESS 
N 


DESCRIPTION 


P VALUE 


68 


14727244 


mll4727244IreflXP 035596 l! hvnothetical Drotein 
AF140225 [Homo sapiens] ref|XP_035597.1| 49482 
[Homo sapiens] gb|AAH07829.1|AAH07829 
[BC007829) Similar to hypothetical protein AF140225 
Homo sapiens] 


6E-60 


69 


14715574 


gi|14715574|dbj|BAB62041.1| (AB065434) PRPK 
mutant 1 [Homo sapiens] 


2E-98 


70 


7491888 


gi|7491888|pir||T41272 hypothetical protein 

05 - fission veast t^Schizosaccharomvces 
pombe) emb|CAB40785.1| (AL049609) hypothetical 
protein [Schizosaccharomyces pombe] 


9.3 


71 


12843000 


gi|12843000|dbj|BAB25820.1| (AK008666) putative 
[Mus musculus] 


1E-86 


72 


13542310 


gi| 135423 10|ref]NP 111998.1] Uncharacterized 
conserved protein [Thermoplasma volcanium] 


6.6 


73 1 


14728906 


gi|14728906|ref|XPJ)03327.3| hypothetical protein 
FLJ10858 [Homo sapiens] 


7E-98 


74 


14749106 


gi|14749106|ref|XP_03 1512.1) 14599 [Homo sapiens] 
dbj|BAB15298.1| (AK025963) unnamed protein product 
[Homo sapiens] 


1E-26 


76 


1176169 


gi|l 176169|sp|P28636IYHBE_ECOLI 
HYPOTHETICAL 35.0 KDA PROTEIN IN DACB- 
RPMA INTERGENIC REGION pir||B65 109 
hypothetical 35. OK protein (dacB-rpmA intergenic 
region; - Jtiscnencnia con gu\j\j\r\D i yoD . 1 1 \\j 1 077 / y 
ORFJ321 [Escherichia coli] gb|AAC76216.1| 
(AE000399) or£ hypothetical protein [Escherichia coli 
K171 <rhlAAG58318 1IAE005547 4 fAE005547) or£ 
hypothetical protein [Escherichia coli 0157:H7 
EDL933] dbj|BAB37486.1| (AP002564) hypothetical 
protein [Escherichia coli 0157:H7] 


0.23 


77 


13161357 


gi|13161357|dbj|BAB32948.1| (AP002908) hypothetical 
protein—similar to Arabidopsis thaliana F1P2.140 
[Oryza sativa] 


6.3 


78 


8922806 


gj|8922806|ref|NP_060761.1| hypothetical protein 
FLJ10986 [Homo sapiens] dbj|BAA91940.1| 
(AK001848) unnamed protein product (Homo sapiens] 
gb|AAH00610.1|AAH00610 (BC000610) hypothetical 
protein FLJ10986 [Homo sapiens] 


5E-83 


82 


14603356 


gi|14603356|gb|AAH10136.1|AAH10136 (BC010136) 
Unknown (protein for MGC: 19513) [Homo sapiens] 


1E-74 
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83 


12804667 


gi| 128 04667|gb| AAH0 1 75 8 . 1 |AAH0 1 75 8 (BC00 1 75 8) 
hypothetical protein FLJ20758 [Homo sapiens] 


0.000000005 


Z7\J 


9795608 


gi|9795608|gb|AAF98426. 1|AC021044_5 (AC021044) 
Unknown protein [Arabidopsis thaliana] 


0.7 


01 


13631937 


gi|1363 1937|ref|XP_p01412.3| metal-regulatory 
transcription factor 1 [Homo sapiens] 


0.0004 


92 


14574479 


gi|14574479|gb|AAF60869.2| (AC024856) Hypothetical 
protein Y71G10AR.2 [Caenorhabditis elegans] 


0.15 


93 


14970562 


gi|14970562|emb|CAC44371.1| (AJ292465) WDR9 
protein, form A [Homo sapiens] 


2E-16 


98 


12855751 


gi|12855751|dbj|BAB30445.1| (AK016814) putative 

[Mus musculus] i 


3.1 


1ft? 


11498284 


gi|11498284|ref|NP_069510.1| adenylate kinase (adk) 
[Archaeogtobus fulgidus] sp|029581|KAD__ARCFU 
ADENYLATE KINASE (ATP-AMP 
TRANSPHOSPHORYLASE) pir|[D69334 adenylate 
kinase (EC 2.7.4.3) - Archaeoglobus fulgidus 
gb|AAB90565.1| (AE001058) adenylate kinase (adk) 
[Archaeoglobus fulgidus] 


4.00E-13 


104 


10245 


gi|9910248|ref|NP_064579.1| GL004 protein (Homo 
sapiens] gb|AAF86949.1|AF226049J. (AF226049) 
CjJL004 [Homo sapiens] 


1.00E-85 


105 


730219 1 


gi|7302191|gb|AAF57287.1| (AE003784) CG7856 gene 
product [Drosophila melanogaster] 


LOOE+OO 


106 


12005513 


gi| 120055 13|gb|AAG44486. 1|AF246239_1 (AF246239) 
AD030 [Homo sapiens] gb|AAG44658.1|AF258660JL 
(AF258660) AD033 [Homo sapiens] 


2.00E-73 


108 


13874435 


gi|13874435|dbj|BAB46923.1| (AB000782) cerebral 
protein- 10 [Homo sapiens] 


4.00E-04 


109 


14783375 


gi|14783375|ref|XP_045533.1| protein predicted by 
clone 23627 [Homo sapiens] 


1.60E-01 


110 


7229532 


gl| /2zy>JZ|gD|AA1^42ooD.l|Ar^2D924_l (Ar 223924) 
staufen [Drosophila virilis] 


3.30E+00 


112 


14775931 


gi|14775931|ref|XPJ)49935.1| hypothetical protein 
FLJ14950 [Homo sapiens] 


3.00E-77 


114 


10140758 


gi|10140758|gb|AAG13589.11AC051633_5 

(AC05 1633) putative ubiquitin protein [Oryza sativa] 


2.10E+00 
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N 
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115 




gi|4ziu^o|pir||b34Z0j nypotneticai protein la - 
Escherichia coli emb|CAA51036.1| (X72295) ORFla 
jDScnencnia couj 


6 50E+00 

VJ . ~J \J i—i < \J\J 


116 


14775931 


gi|14775931|reflXP_049935.1| hypothetical protein 
FLJ14950 [Homo sapiens] 


2.00E-74 


117 


14781316 


gi|14781316|refpCP_032533.1| hypothetical protein 

FLI 12787 [Homo sapiens] ' 


e-101 


118 


13161938 


gi|13161938|emb|CAC32991. 1| (AJ309861) putative 
protein kinase WNK4 [Homo sapiens] 


1.00E-30 


110 


7662028 


gi|7662028|re^NP_055557.1| KIAA0255 gene product 
[Homo sapiens] reflXP_009540.1| KIAA0255 gene 
product [Homo sapiens] sp|Q92544jT9S4_HUMAN 

TRANSMEMBRANE 9 S UPERFAMLLY rKU 1 JbUN 
MEMBER 4 dbj|BAA13385.1| (D87444) Similar to 

S.cerevisiae EMP70 protein precursor (S25 110) [Homo 
sapiens] emD|\^AU /dou i.z\ \j\iJJ e \7JDy) cuoooi> i / 
(KIAA0255 protein) [Homo sapiens] 


7.00E-14 


120 


10179324 


gi|10179324|dbj|BAB13674.1| (AB041351) type IV 
collagen alpha 6 chain [Mus musculus] 


7.9 


122 


7160119 


gi|7160119|emb|CAB76306.1| (AL158057) putative 
acetyltransferase (fragment). [Streptomyces coelicolor 
A3(2)] 


0.1 


123 


14 


gill4732244lrefpOP_039960.il PC3-96 protem [Homo 
sapiens] ref|XPJ)39961.1| PC3-96 protein [Homo 

sapiens] pir||T46276 hypothetical protem 
DKFZp564Ml 178. 1 - human emb|CAB70781. 1| 
(ALU /->! j) hypothetical protem [xiomo sapiens j 
gb|AAG35611.1|AF202092_l (AF202092) PC3-96 
jnomo sapiens j 


9E-81 


124 


7020475 


gi|7020475|dbj|BAA91 144.1| (AK000407) unnamed 
protein product [Homo sapiens] 


2.6 


125 


14775931 


pill 477593 llreflXP 049935 11 hvDothetical protein 
FLJ14950 [Homo sapiens] 


2E-72 


127 


226135 


gi|226 135 tprf}[ 14113 03 A GABA receptor alpha2 [Bos 
taurus] 


0.046 


136 


4995818 


gi|4995818|emb|CAB44313.1| (AJ131899) proline rich 
synapse associated protein 1 [Rattus norvegicus] 


0.00003 
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SEQID 
NO 


ACCESS 
N 


DESCRIPTION 


P VALUE 


138 


13357869 


gi|13357869|ref|NP_078143.1| unique hypothetical 
[Ureaplasma urealyticum] pir||D82907 hypothetical 
protein UU309 [imported] - Ureaplasma urealyticum 
gb|AAF30718.1|AE002128_6 (AE002128) unique 
hypothetical [Ureaplasma urealyticum] 


2.7 


139 


1076802 


gi|1076802|pir||S49915 extensin-like protein - maize 
cuiu|\^xTjr\o*Tz.o\j. i| ^jhhoj ) cxicnsiii - iiK.e protein \jl&€L 
mays] prfl|21 1 1476A extensin-like domain [Zea mays] 


1.9 


140 


11466208 


gi|l 1466208|refINPJ)66531.1| NADH dehydrogenase 
subunit 4 [Naegleria gruberi] 

0hlAAfr17ROQ 1IAF98R00? ^4 ^AF7RR0Q9MMAnH 

dehydrogenase subunit 4 [Naegleria gruberi] 


6.6 


145 


7490990 


gi|74yU29U|pir|t 1 38644 conserved hypothetical protein 
SPAC323.07c - fission yeast (Schizosaccharomyces 
pombe) emb|CAB53410.1| (AL109988) conserved 
hypothetical protein; UPF0013 [Schizosaccharomyces 


0 78 

v. / O 


147 


5813770 


gi|5813770|gb|AAD52006J|AF017304_l (AF017304) 
FMVBB [Morone saxatilis] 


6.8 


149 


2497311 


^h/IO'7'2 1 1}**** L )T>Z<.Qf\'2\'\&f^/~ K T>/~v-\ 7TXT A/IA7T7T TAT 

gi|z4y73i l(Sp|r!>5o0i|MU(j_DUVliN MYliLllN- 
OLIGODENDROCYTE GLYCOPROTEIN 
PRECURSOR pir||A47712 myelin/oligodendrocyte 
glycoprotein precursor - bovine 


0.33 


152 


609342 


anaplastic lymphoma kinase fusion protein [Homo 
sapiens] 


2.7 


153 


7481152 


gi|7481 152(pir||T36795 probable penicillin acylase - 
Streptomyces coelicolor emb|CAB46792. 1| (AL0968 1 1) 
putative penicillin acylase [Streptomyces coelicolor 
A3(2)] 


8 
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154 


12229694 


gi|12229694|sp|O93830|BET2_CANAL TYPE H 
PROTEINS GERANYLGERANYLTRANSFERASE 
BETA SUBUN1T (TYPE II PROTEIN GERANYL- 
GERANYLTRANSFERASE BETA SUBUNIT) 
(GGTASE-H-BETA) (PGGT) (YPT1/SEC4 
PROTEINS GERANYLGERANYLTRANSFERASE 
BETA SUBUNIT) dbj|BAA35193.1| (AB021171) Beta 
subunit of geranylgeranyl transferase type2 [Candida 
albicans] 


4.7 


156 


9630058 


gj|9630058|ref|NP_046276.1| unknown [Orgyia 
pseudotsugata single capsid nuclear polyhedrosis virus] 
sp|O10359|Y120_NPVOP HYPOTHETICAL 9.3 KD 
PROTEIN (ORF120) pir||T10389 hypothetical protein 
120 - Orgyia pseudotsugata nuclear polyhedrosis virus 
gb|AAC591 19.1| (U75930) unknown [Orgyia 
pseudotsugata single capsid nuclear polyhedrosis virus] 


8.5 


157 


9965966 


gi|9965966|gb|AAG10219. 1|AF294433_1 (AF294433) 
coat protein [Alfalfa mosaic virus] ! 


7 


158 


7296047 


gi|7296047|gb|AAF51343.1| (AE003585) CG17711 
gene product [Drosophila melanogaster] 


4.6 


161 


13507856 


gi|13507856|reflNP_109805.11 ribosomal protein L20 
[Mycoplasma pneumoniae] sp|P78023|RL20_MYCPN 
SOS RIBOSOMAL PROTEIN L20 pir||S73363 
ribosomal protein L20 - Mycoplasma pneumoniae 
(stram ATCC 29342) gb|AA(j34734.1|Ab0UUuU4_3 
(AE000004) ribosomal protein L20 [Mycoplasma 
pneumoniae] 


0.86 


162 




gi|8393641|ref|NP_058889.1| kynurenme 
aminotransferase II [Rattus norvegicus] 
emb|CAA90507.1| (Z50144) Icynurenine/alpha- 
aminnaHinate aminotransferase [Rattus norvefiicusl 


4.2 


169 


1515448 


gi|1515448|gb|AAC45559.1| (U63096) Description: 
pBF4 gene involved in Bacteroides spp. conjugal 
transfer [Bacteroides fragilis] 


2.6 


175 


422832 


gi|422832|pirl|B46629 mucin 6, gastric (3-repeat clone) - 
human (fragment) gb|AAB61945.1| (L07518) mucin 
[Homo sapiens] 


3.2 
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176 


7504075 


gi| / j Kj^tsj i j |pir| j i ©** nypoLncuCai proLein rjjrH.o - 
Caenorhabditis elegans emb|CAB01215.1| (Z77663) 

J. »/Jl T.V/ |^\^CIV^11VJ1 11CIL/1_IIL1l5 t/lt/^dlloj 




178 


13385538 


gi|13385538|ref|NPJ)80316.1| RIKEN cDNA 

ZOlUv/JUJVUJl gene [IVlUb lUUbCUlUSJ UDj|X>/\J3Zo.JZU.l| 

(AK012865) putative [Mus musculus] 


2E-48 


180 


902377 


gi|902377|gb|AAA8298Ll| (U18059) polyprotein 
[pestiviras type 1] 


0.53 


181 


13421499 


gi|13421499jgb|AAK22335.1| (AE005708) conserved 
hypothetical protein [Caulobacter crescentus] 


6.2 


187 


10803574 


gi|10803574|refINP_045972.1| GvpN [Halobacterium 
sp. NRC-1] sp|Q9HI16|GVNl_HALNl GVPN j 
PROTEIN 1 pir||A47053 gas-vesicle operon protein 
gvpN [imported] - Halobactenum sahnarum pui|T08244 
gas-vesicle operon protein gvpN - Halobacterium sp. 
(strain NRC-1) plasmid pNRClOO gb|AAC828 1 1 . 1| 
(Ar U1o4oj) CjvpJN [Halobacterium sp . JNKL- 1 J 
gb|AAD 15044.1| (L03361) gas vesicle protein [Plasmid 

nT\TRPinm €tU\ A ACrOmiR 11 /'AFOfHIA'^ CUmU 
piNJVV^lUUJ gD[/V-f\VJZ-U /Z.O. JL| \/\JDUU.} VJVpiN 

protein, cluster A; GvpNl [Halobacterium sp. NRC-1] 


7.40E-01 


192 


7662532 


gi|7662532|ref|NP_054838.1| PRO0195 protein [Homo 
sapiens] gb|AAF24025.1|AF090901_l (AF090901) 
PRO0195 [Homo sapiens] 


2.60E-01 


194 




gi|5360226|dbj|BAA36472.1| (AB015177) FO-ATPase 

dir»ii"ni+ f\ 1 Rptfsi imlrraTicl 
bUU LULU U \J |^J_>CUi VlUgcUloJ 




196 


3024921 


gi|3024921|sp|Q583 12|Y902_METJA 
HYPOTHETICAL PROTEIN MJ0902 pir||F64412 

hypothetical protein MJ0902 - Methanococcus 
jannaschii gb|AAB98907.1| (U67533) M. jannaschii 
predicted coding region MJ0902 [Methanococcus 
jannaschii] 


6.5 


198 


7522108 


gi|7522108|pir||T29097 pro-pol-dUTPase polyprotein - 
murine endogenous retrovirus ERV-L (fragment) 
emb|CAA73251.1| (Y12713) protease; reverse 
transcriptase; RNaseH; integrase; dUTPase; Pro-Pol- 
dUTPase polyprotein [Mus musculus] 


1.7 



443 
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Table 3B Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQID 
NO 


N 


DESCRIPTION 


P VALUE 


199 


9955592 


gi|9955592|emb|CAC00653.1| (AJ2925 19) N-acylamino 
acid racemase [Amycolatopsis orientalis subsp. lurida] 


4.5 i 


203 


4097932 


oi]4097932leblAAD 10321 11 <TJ72240") unknown 
Choristoneura fumiferana nucleopolyhedrovinis] 


6.5 j 


i 205 


11252252 


gi|l 1252252|pir||D82079 pyruvate dehydrogenase, El 
component VC2414 [imported] - Vibrio cholerae (group 
\j i sixain in Loyox ) gv\i\s\r j j i , x\ \t\jji\j\j*Tj x x) 
syruvate dehydrogenase, El component [Vibrio 
cholerae] 


3.5 i 


206 


12963481 


MGC5560; hypothetical protein [Homo sapiens] 
HHilR AR14^4? 1 1 f AK022978i unnamed nrotein nroduct 
Homo sapiens] 


3E-73 


207 


14737646 


gi|14737646|ref|XP_038048.1| similar to APICAL 
ENDOSOMAL GLYCOPROTEIN PRECURSOR (R. 
norvegicus) [Homo sapiens] 


4 


209 ' 


14742770 


oil 1 4749 770lrHT5n > 039393 1! KIAA1550 nrotein 
[Homo sapiens] 


8.4 


I 213 


12643499 


gijl2643499|sp|P89202|RRPO_SHMV RNA- 
DIRECTED RNA POLYMERASE (186 KDA 
PROTEIN) [CONTAINS: 
METHYLTRANSFERASE/RNA HELIOASE 
(MT/HEL) (128 KDA PROTEIN)] 


6.5 


223 




gi|9629141|ref|NP_044299. 1| putative transport protein 
[Soybean chlorotic mottle virus] 
sp|P15631|VMP_SOCMV MOVEMENT PROTEIN 
(CELL-TO-CELL TRANSPORT PROTEIN) (ORF 
IA) pir||JS0379 hypothetical 35. 5K protein - soybean 
cniorouc moixic virus eixiD|^/ijrw->oj«7.x| ^auo^o^ 
putative transport protein [Soybean chlorotic mottle 
virus] 


4.3 


226 


7494266 


gi|7494266|pir||T18485 hypothetical protein C0840w - 
maiana parasite \r lasnroui uiii ioiwipcu uxuj 
emb|CAB11124.1| (Z98551) putative P-type ATPase 
[Plasmodium falciparum] 


2.3 


228 


14042283 


gi|140422831dbj|BAB55184.1| (AK027535) unnamed 
protein product [Homo sapiens] 


2E-30 


229 


14744326 


gi|14744326|ref|XP_044812.1| NIMA (never in mitosis 
gene a)-related kinase 6 [Homo sapiens] 


4E-13 


232 


98014 


gj|98014|pir||S11148 amiA protein - Streptococcus 
pneumoniae 


10 
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Table 3B Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQID 
NO 


ACCESS 
N 


DESCRIPTION 


P VALUE | 


233 


10178317 


gljlUl /oil /|emD[L/AL/Uo40i.I| (AL12iy2o) 
bA18I14.5.1 (novel protein isoform 1) [Homo sapiens] 


7 


234 


630444 


gi|630444|pir||S43955 probable NADH dehydrogenase 
(ubiquinone) (EC 1.6.5.3) chain 3, kinetoplast - 
Trypanosoma brucei mitochondrion (fragment) 
gb|AAA20887.1| (L26251) CR5 [Trypanosoma brucei] 


6.2 


239 




gi|7505826|pir]fr25816 hypothetical protein K12D9.2 - 
Caenorhabditis elegans 


1.4 


240 


7434912 


gi|7434912|pir||H7 1934 phosphatidylglycerophosphate 
synthase - Helicobacter pylori (strain J99) 
gb|AAD05990.1| (AE001475) 
PHOSPHATIDYLGLYCEROPHOSPHATE 
SYNTHASE [Helicobacter pylori J99] 


5.8 


243 


• 

6136155 


gi|6136155|sp|O70546|UTX„MOUSE 
UBIQUITOUSLY TRANSCRIBED X 
CHROMOSOME TETRATRICOPEPTTDE REPEAT 
PROTEIN (UBIQUITOUSLY TRANSCRIBED TPR 
PROTEIN ON THE X CHROMOSOME) 
emb|CAA05692.1| (AJ002730) UTX [Mus musculus] 


8.90E+00 


246 


2143962 


gi|2143962[pir||I59422 rsec8 - rat (fragment) 
gb|AAC52265.1| (U32498) rsec8 [Rattus norvegicus] 


1E-93 


250 


6136155 


gi|6136155|sp|O70546|UTX_MOUSE 
UBIQUITOUSLY TRANSCRIBED X 
CHROMOSOME TETRATRIC OPEPTIDE REPEAT 
PROTEIN (UBIQUITOUSLY TRANSCRIBED TPR 
PROTEIN ON THE X CHROMOSOME) 
emb|CAA05692.1| (AJ002730) UTX [Mus musculus] 


7.1 
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Table 3B Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQID 
NO 


ACCESS 
N 


DESCRIPTION 


P VALUE 


255 


12737603 


gi|12737603|ref]XP_006839.2| activin A receptor type H 
like 1 [Homo sapiens] ref|XP_050707.1| activin A 
receptor type II-like 1 [Homo sapiens] 
sp|P37023|KIR3_HUMAN SEPJNE/THREONINE- 
PROTEIN KINASE RECEPTOR R3 PRECURSOR 

/CXTT>0\ / A f*'l*\ \ 7TTVT T> T7 /~ 1 T?D r T/^X> T TVT2 VTTVTACC7 1 \ 

(AC 11VUN ivbCJir 1 UK-J_/lJSJti JsJLXN AoJd 1 ) 

(ALK-1) (TGF-B SUPERFAMILY RECEPTOR TYPE 
I) (TSR-I) pir||A49431 activin/TGF-beta-like type I 
rprpntnr - human <*hlAAA 16160 11 fL17075> TGF-b 
^iinerfarnilv recentor tvoe I [Homo sanieiisl 
gb|AAB6 1900.1| (U77713) activin receptor like kinase 1 
[Homo sapiens] 


7.50E+00 


265 




gi|2133974|pir||S68425 SoxPl protein - rainbow trout 
dbj|BAAl 1868.1) (D83256) SoxPl [Oncorhynchus 
mykiss] 


0.18 


266 


9838427 


gi|9838427|reflNP_064041.1| orf214 [Beta vulgaris] 


2.2 


268 


5731737 


ml5731737ldbilBAA83337 11 (AB021878) similar to 
yeast sodium/proton exchanger [Oryza sativa] 


3.3 


271 


13642312 


gi|13642312|ref|XPJ)02895.2| parathyroid hormone 

receptor i |_rxomo sapicnbj rcij-^vr kjdd ±\ 

parathyroid hormone receptor 1 [Homo sapiens] 
ref|XP_033743.1| parathyroid hormone receptor 1 
[Homo sapiens] 


9 


272 


11545138 


gi|11545138|emb|CAC08390.2|(AL121886) 
dJ1028D15.1 (continued from dJ138B7.1 in 
Em:Z98752) [Homo sapiens] 


0.0000008 


275 


6323677 


gl|632io /7|rei|JNjr_Ul3 /4o.l| imru^4cp 
[Saccharomyces cerevisiae] sp|Q05131|YMS4_YEAST 

JlT X x\J 1 JTuZr 1 IV^xVL* HO.H x\JJ x xWJ x xlixiri iiN /Viva zf— 

IMP2 INTERGENIC REGION pir||S53951 probable 
mpmhrnnp nrntf»in YA/TR0^4f* - vea<rt* ( SRCcharomvces 
cerevisiae) emb|CAA89150.1| (Z49213) unknown 
[Saccharomyces cerevisiae] 


6.6 


277 


7297043 


gi|7297043|gb|AAF52312.1| (AE003612) CG13992 
gene product [Drosophila melanogaster] 


8.1 


282 


1616595 


gi|1616595|emb|CAA41710.1| (X58907) steroid 21- 
monooxygenase [Homo sapiens] 


6.8 


283 


14089610 


gi|14089610|emb|CAC13370.1| (AL445563) unknown; 
predicted coding region [Mycoplasma pulmonis] 


2.1 
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1 


fable 3B Nearest Neighbor (BlastX vs. Non-Redundant Proteins) j 


SEQID 
NO 


ACCESS 
N 


DESCRIPTION 


P VALUE 


284 


139809 


gi|139809|sp|P27571|XIST_MOUSE X INACTIVE 
SPECIFIC TRANSCRIPT PROTEIN pir||S 15433 

nyp ometiCai protein - mouse emD|v^/\iY*riy /o.ij 
(X59289) ORF [Mus musculus] prf||171 1440A xist 
gene [Mus musculus] 


0.18 


285 


11355576 


gi|l 1355576|pir||T44331 hypothetical protein wblD 

[imported j - v lono cnoierae qdj | jd aaj j o j j . i | 
(ABO 12957) probable NADH dehydrogenase [Vibrio 
cholerae] 


0.23 


297 


14389443 


gi|143oy443|ret|JNJr_l lo/ /o.i| XNAi/ri aenyarogenase 
subunit 4 [Rana nigromaculata] dbj|BAB5 8996.1) 
(AB043889) NADH dehydrogenase subunit 4 [Rana 
nigromaculata] 


9.2 


300 


4493987 


gi|4493987|emb|CAB39046.1| (AL034559) hypothetical 
protein, PFC1030w [Plasmodium falciparum] 


2.1 


306 


7494200 


gi|7494200|pir||T18434 hypothetical protein C0375c - 
malaria parasite (Plasmodium falciparum) 
emb|CAB 1 1 1 1 1 . 1 1 (Z98547) predicted using hexExon; 
MAL3P3.10 (PFC0375c), Homolpgue ot t^.elegans 
T08A11.2 protein, len: 1387 aa; Similarity to C.elegans 
T08A11.2 protein. C.elegans T08A11.2 protein 
(WP:T08A1 1.2) BLAST Score: 2901, sum P(2) = 0.0; 
66% identity in 839 aa > 


1.3 


308 


10728064 


gene product [Drosophila melanogaster] 


4.7 


317 


9631682 


gi|9631682|ref|NP_048461.1| all3L [Paramecium 
bursana CMorella virus lj pir||i 17603 hypothetical 
protein al 13L - Chlorella virus PBCV-1 
gb|AAC96481.1| (U42580) all3L [Paramecium 
bursaria Chlorella virus 1] 


7.7 


318 


4733986 


gi|4733986|gb|AAD28666. 1|AC007209_2 (AC007209) 
hypothetical protein [Arabidopsis thaliana] 


5.30E+00 


319 


461649 


gi|461649|sp|Q05004|BB61_RABIT BRUSH BORDER 
61.9 KD PROTEIN PRECURSOR pir||B45665 adult- 
specific 61 .9K brush border protein precursor - rabbit 
emb|CAA78302.1| (Z 12840) protein of unknown 
function [Oryctolagus cuniculus] 


3.00E-16 
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Table 3B Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQID 
NO 


ACCESS 
N 


DESCRIPTION 


P VALUE 


322 


1 1466552 


gi|11466552|ref|NP_044801.1|NADH dehydrogenase, 

»l"vi -i *-i t-#- /I 1 D a/tli^ran/moe <ampn/*OTiol mr n78 1 5c ^ A If 1— 1 

SUDUmt T" [xxcClOOIuOrlaS aJuCOCcUlciJ pirjfo /oloj INxvL/rx 

dehydrogenase (ubiquinone) (EC 1.6.5.3) chain 4 - 
Reclinomonas americana (ATCC 50394) mitochondrion 
gb|AADl 1916.1| (AF007261) NADH dehydrogenase, 
subunit 4 [Reclinomonas americana] 


9.1 


323 


13603897 


gi|13603897|gb|AAK31986.1|AF295594_l (AF295594) 
SKTxl precursor [Buthus martensii] 


9.3 


325 


11497767 


gi|l 1497767|ref|NP_068989.1| hypothetical protein 
[Archaeoglobus fiilgidus] sp|O30087[Y150_ARCFU 
HYPOTHETICAL PROTEIN AF0150 pir||F69268 
VivnntViPtiral nrntein AF0150 - Archaeofflotms fulsidus 
gb|AAB91087.1| (AE001096) A. fiilgidus predicted 
coding region AF0150 [Archaeoglobus fiilgidus] 


7.1 


332 


7*506939 


tril7S069^9lrrirllT24356 hvnothetical orotein T02D1 3 - 
Caenorhabditis elegans emb|CAB05908.1| (Z83319) 
T02D1 3 ICaenorhabditis eleeansl 


3.9 


336 


7510675 


gi|7510675|phi|T29689 hypothetical protein ZC266.2 - 
Caenorhabditis eleeans eblAAA96101 11 (TJ53140) 
ZC266.2 gene product [Caenorhabditis elegans] 


2.4 


337 


12697923 


gi|12697923|dbj|BAB21780.1| (AB051476) KIAA1689 
protein [Homo sapiens] 


3E-21 


338 


630465 


ffil630465tairllS47042 orotein kinase (EC 2.7.1.37) cdc2- 
related 1 - malaria parasite (Plasmodium falciparum) 


5.7 


340 


7488318 


gi(74883 18|pir||T01 191 RNA-directed DNA polymerase 

gb|AAC13599.1| (AF058914) similar to reverse 
transcriptase (Pfam: transcript_fact.hmm, score: 72.31) 
[Arabidopsis thaliana] 


5 


341 


5804818 


gi|5804818|emb|CAB52872.1| (AL021712) putative 
protein [Arabidopsis thaliana] emb|CAB79171.1| 
(AL161556) putative protein [Arabidopsis thaliana] 


5 


343 


14579376 


gi|14579376|gb|AAK69253.11AF336309_48 
(AF336309) SpyB [Yersinia enterocolitica] 


8.2 
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Table 3B Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQID 

"MO 


ACCESS 
N 


DESCRIPTION 


PVALUE 


344 


11284162 


gi|l 1284162|pir||H81077 hypothetical protein NMB 1490 
imported] - Neisseria meningitidis (group B strain 
MD58) gb|AAF41846.1| (AE002498) hypothetical 
protein [Neisseria meningitidis MC58] \ 


3 


347 


11361512 


gi|l 13615 12|pir||H82781 hypothetical protein XF0626 
imported] - Xylella fastidiosa (strain 9a5c) 
gb|AAF83436.11AE003908_4 (AE003908) hypothetical 
proteni [Xylella tastidiosa ya5cj 


7 fk 

/.o 


348 


7497054 


gil7497054|pui]T19739 hypothetical protein C35A5.4 - 
Caenorhabditis elegans emb|CAA94905.1| (Z71185) 
contains similarity to Pfam domain: PF00635 (MSP 
(Major sperm protem) domain), Score=66.4, Jb-value— 2e 
16, N=l [Caenorhabditis elegans] 


0.064 


349 


6531601 


gi|6531601|gb|AAF15500.1| (AF167672) 
immunoglobulin heavy chain variable region [Homo 
sapiens] 


1.6 


352 


12858471 


gi| 1285847 l|dbj|BAB3 1327.1| (AK018655) putative j 
Mus musculus] 




353 


2128157 


gi|2128157|pir||C64319 hypothetical protein MJ0154 - 
Methanococcus jannaschii 


0.98 


355 


14195575 


gi| 14 195575|sp|P58 1 14[YV78_CAUCR 
HYPOTHETICAL PROTEIN CC3178 
gb|AAK25 140,1| (AE005981) pirin-related protein 
[Caulobacter crescentus] 


4.20E+00 


359 




gi|7468435|pir||B72015 metalloproteinase, insulinase 
family CP0903 [imported] - Chlamydophila pneumoniae 
(strains CWL029 and AR39) gb|AAD19093.1| 
(AE001675) Insulinase femily/Protease III 
[Chlamydophila pneumoniae CWL029] 
gb|AAF38689.1| (AE002249) metalloprotease, 
insulinase family [Chlamydophila pneumoniae AR39] 
dbj|BAA99165.1| (AP002548) insulinase 

forniKz/rvrrvteacp TTT rf^lilarnvHrn^Hila THIftllTnoriiajB J 13 81 


4.1 


360 


461649 


gi|461649|sp|Q05004|BB61_RABIT BRUSH BORDER 
61.9 KD PROTEIN PRECURSOR pir||B45665 adult- 
specific 61 .9K brush border protein precursor - rabbit 
emb|CAA78302.1| (Z12840) protein of unknown 
function [Oryctolagus cuniculus] 


8E-45 
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Table 3B Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQID 
NO 


ACCESS 
N 


DESCRIPTION 


P VALUE 


363 




gi|12859685|dbj|BAB3 1736.1| (AK019464) putative 

rMiiQ mi infill In 


6.1 


365 


11055982 


gi|l 1055982|ref|NP_067633.1| PR domain containing 
io 5 fJK-domain containing protein id [riomo sapiens j 
ref]XP 011460.1) PR domain containing 13 [Homo 
sapiens] gb|AAG13448.1| (AY004253) PR-domain 
containing protein 13 [Homo sapiens] 


3.4 


368 


3915305 


gi|3915305|sp|Q96597|VP5_AHSV9 OUTER CAPSID 
PROTFTN VPS ablAAB 17570 11 ^74489^ VP5 
[African horse sickness virus] 


0.63 


370 


11466986 


gi| 1 1 H-ooy o o |rei|iN r^y e * i o y o . i j urro 3 j [cugieud 
gracilis] sp|P3 19 16[YCX3__EUGGR HYPOTHETICAL 
7« n If n PPOTFTN TNT PQRf TNTRON 2 (ORF635^ 
emb|CAA50080.1| (X70810) Orf635 [Euglena gracilis] 


1.7 


373 


14766918 


gill4766918lrefpCP_034599.ll acid sphingomyelinase- 
like phosphodiesterase [Homo sapiens] 


2E-99 


375 




gi|5732610|gblAAD49190.1|AFl 15423_2 (AF115423) 

rr%TP* r\Tfxtf*ir\ tX-f&r\ZitTt\ <! R vitiiqI 


3.4 


377 


Z /U /OOl 


gi|2707661|gb|AAB94699.1| (AF029934) IgM heavy 

z^Vioin \/T^T rpmrtn rO r\/f"*i*n1 n mi c PiiTiir*ii1n<jl 

C/Xic 11X1 V J— ' J lCtilUll 1 Wl J^LVJICl^Ui WrillllwLU.lJ.dJ 


4.7 


379 




gi|7516819|pir||C72580 hypothetical protein APE1923 - 
Aeropyrum pernix (strain Kl) dbj|BAA80928.1| 
(AP000062) 105aa long hypothetical protein 
ly^eropyiuiii peniixj 


6.80E+00 


380 


3859670 


gil3859670|emb|CAA22008.1| (AL033502) hypothetical 
membrane protein [Candida albicans] 


9.90E+00 


382 


14784562 


r*i11 /lTQ/l^/COli-i-flYP ftACSAI** llmutT CP rrsiW V»r»Tnr»lr»cr -i 
glll't /o^DOZ|rei|_\Jr u*fU*Hj.l| inuLL-f V*-** cv/iij nonwjiug -> 

[Homo sapiens] gb|AAF23905.1| (AF195658) DNA 
mismatch repair protein [Homo sapiens] 


2E-25 


383 


14194519 


gi|141945 19|sp|Q9K920|CMGB_BACHD COMG 
OPERON PROTEIN 2 HOMOLOG dbj|BAB06550.1| 
(AP0015 16) DNA transport machinery [Bacillus 
halodurans] 


2.7 


386 


7509624 


gi|7509624|pir||T26676 hypothetical protein Y38F1A.1 - 
Caenorhabditis elegans emb|CAA21628.1| (AL032639) 
predicted using Genefinder-cDNA EST 
EMBLAW057282 comes from this gene 
[Caenorhabditis elegans] 


9.4 
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SEQID 
NO 


ACCESS 
N 


DESCRIPTION 

vJuU VasXVJJL J. A V_/i. >( 


P VAT TTP 


388 


7508443 


gi| /3U5443|pir|| 1 ZdZg 1 hypothetical protein r25£12. 1 1 « 
Caenorhabditis elegans emb|CAB04829.1| (Z82052) 
contains sinuiaurny xo x rani uomain. rruio// (iJomain 
of unknown function), Score=130.6, E-value=4.6e-36, 
N=l [Caenorhabditis elegans] 


3.4 


389 


7295832 


gi|7295832|gb|AAF51133.1| (AE003580) CG3347 gene 
product [Drosophila melanegaster] 


0.032 


390 


7510359 


gi|7510359|pir||T27283 hypothetical protein Y64G10A.f 

V_/ AwllU 1 Lid. \J kAl Ho vlvKfl 1 1 0 


X 9 
o .<&. 


395 


6679381 


gj|6679381|reflNP_032903.1| plasminogen [Mus | 
musculus] sp|P20918|PLMN_MOUSE 
PLASMINOGEN PRECURSOR [CONTAINS: 
ANGIOSTATIN] pir||PLMS plasmin (EC 3.4.21.7) 
precursor - mouse gb|AAA50168.1| (J04766) 
plasminogen [Mus musculus] 


2.2 


396 


OH* / zrj 1 0 


gi|84795 18|sp|Q89853|VGP_EBORS STRUCTURAL 

GLYCOPROTEIN) [CONTAINS: GP1; GP2] 
go(AA^D4ooy. i| vUZj41oj virion spike glycoprotein 
[Ebola virus] gb|AAC54891.1| (U23417) virion spike 




403 ! 


14133247 


gi|14133247|dbj|BAA86564.2| (AB033076) KIAA1250 
protein [Homo sapiens] 


6E-19 


412 


3293234 


gi|3293234|gb|AAC25913.1| (U96413) T-DNA 
oncoprotein [Agrobacterium tumefaciens] 


9.1 


415 


7023033 


gi| /uz^u^jjaujji5/v/\yiouiy. i[ ^ATk.uuiOH-yy unnameu 
protein product [Homo sapiens] 


9E-85 


416 


12514130 


gi|125 14130|gb|AAG55434. 1|AE005285_1 (AE005285) 
putative oxidoreductase [Escherichia coli 0157:H7 
EDL933] dbj|BAB34455.1| (AP002554) putative 
oxidoreductase [Escherichia coli 0157:H7] 


9.40E-01 


418 


4514359 


gi|4514359|dbj|BAA75394.1| (AB013377) ComGB 
[Bacillus halodurans] 


2.6 


419 


7510324 


gi|75 10324|pir||T27250 hypothetical protein Y5F2A.3 - 
Caenorhabditis elegans emb|CAA21648.1| (AL032641) 
Y5F2A.3 [Caenorhabditis elegans] 


1.6 
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SEQID 
NO 


ACCESS 
N 


DESCRIPTION 


P VALUE 


421 


6981358 


gi|6981358|reflNP_037137.1| phosphoinositide 3-kinase 
p85 (other splicing variants: p55 and p50) [Rattus 
aorvegicus] sp|Q63787|P85A_RAT 
PHOSPHAITDYLINOSITOL 3-KINASE 
REGULATORY ALPHA SUB UNIT (PI3-KINASE 
P85-ALPHA SUBUNTT) (PTDINS-3-KINASE P85- 
ALPHA) (PI3K) dbj|BAA18932.1| (13o4U43) 
phosphatidylinositol 3-kinase p85 alpha subunit [Rattus 
norvegicus] 


2 4 


423 


6576738 


gi|6576738|dbj|BAA88337.1| (AB005891) ORF2 
[Platemys spixii] 


3.4 


4.94 




~\£.nic\Z^'2\ n V%\ A 11 ( AT71 A&A/7Qi\ T\f*\tziC 
gl|673y5!>3|gD|AArZ/zyy.l| {J\r iW £ +£z?) LJQiiaV^ luanio 

rerio] 


3.9 


ATI 




gi|13816283|gDlAAK4iUzu.l| (AJqUUoocS^J ouuate 
adenylyltransferase (sat) [Sulfolobus solfetaricus] 


3.6 


428 


11034774 


gi|l 1034774|gb|AAG27071 . 1|AF105225_7 (AF030414) 
NifQ [Gluconacetobacter diazotrophicus] 


3.6 


433 


13364676 


gi|13364676|dbj|BAB38622.1| (AP002568) hypothetical 
protein [Escherichia coli 0157:H7] 


5.8 


434 


12845784 


gi|12845784|dbj|BAB26898.1| (AK010380) putative 
[Mus musculus] 


0.075 




■ 

19^1^017 


gi|12513917Igb|AAG55266.1|AE005270_2(AE005270) 
Z1121 gene product [Escherichia coh 0157:H7 
EDL933] gb|AAG55675.1|AE005306_4 (AE005306) 
Z1560 gene product [Escherichia coli 0157 :H7 
EDL933] dbj|BAB34723.1| vArUUZD34) putative 
membrane protein [Escherichia coh 0157:H7] 


8.3 


436 


7661654 


gi|7661654)ref|NP 056444. 1| DKFZP566J153 protein 
[Homo sapiens] emb|CAB43677. 1| (AL050369) 
hypothetical protein [Homo sapiens] 


0.42 


437 


1150678 


gi|1150678|emb|CAA5097Ll| (X72086) ORF20R; 
B21R in citation [3] [Variola virus] 


5.10E+00 


438 


8928456 


gi|8928456|sp|05 1039|Y006_BORBU 

conserved hypothetical integral membrane protein 
BB0006 - Lyme disease spirochete gb|AAC66397.1| 
(AE001 1 15) conserved hypothetical integral membrane 
protein [Borrelia burgdorferi] 


4.2 


440 


7297273 


gi|7297273|gb|AAF52536.1| (AE003618) Myo28Bl 
gene product [Drosophila melanogaster] 


1 3.3 



452 
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Table 3B Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQID 
NO 


ACCESS 
N 


DESCRIPTION 




448 


7495690 


gi| /^t^joyuipirii i iy\jy\j nypouiciicai protein L>uoJr i 1.0 - 
Caenorhabditis elegans emb|CAB05674.1| (Z83216) 
C08F11.3 [Caenorhabditis elegans] 


4.9 


449 




gi|10438579|dbj|BAB15281.1| (AK025916) unnamed 

tiiYYt"^in tiiwiiiH' iHrvmrt cqtxiphc 1 


^ 7 


451 


112200 


gi|l 12200|pir||A32868 prolactin receptor - rat 
(fragments) 


8.6 


452 


114987 


gi|114987|sp|P17885|BIMA_EMENI BDMA PROTEIN 
pji j o nuciear protein oiirLrv - nmenccua niQU.ia.ns 
emb|CAA41959.1| (X59269) bimA [Emericella 
nidulans] 


6.60E+00 


457 


4885039 


gi|4885039|gb|AAD3 1932. 1|U00058_6 (U00058) 
contains sirnilarity to DNAJ domains (Pfam: PF00226, 
Score=44.9, E=1.8e-09, N=l) [Caenorhabditis elegans] 


4.3 


459 


4505121 


gi|4505121|reflNP_003916.1|methyl-CpG binding 
domain protein 4; 3,N(4)-ethenocytosine glycosylase; 
G/T mismatch glycosylase; G/U mismatch glycosylase; 
G/5-fluorouracil mismatch glycosylase with Diphasic 
kinetics [Homo sapiens] gb|AAC68879.1| (AF072250) 
methyl-CpG binding protein MBD4 [Homo sapiens] 
gb|AAD22195. 1[AF1 14784_1 (AF1 14784) methyl-CpG 
binding endonuclease [Homo sapiens] gb|AAD50374. 1| 
iz.\jyyzfj iiicuiyi"*v^p\j uinuiiig protein *t [rxuxiiv/ 

sapiens] 


2.1 


463 


12644455 


gi|12644455|sp|Q60751|IGlR_MOUSE BSfSULIN- 

LIKE GROWTH FACTOR I RECEPTOR 
PRECURSOR gbjAAC 12782. 1 1 (AF056187) insulin- 
like growth factor I receptor; IGF-I receptor [Mus 
musculus] 


3.1 


465 


11465398 


gi|11465398|refINP_045211 4 l| unknown; N-acetyl- 
glutamate-gamma-semialdehyde dehydrogenase 

rPiTsmiHiiiTn r»5i1 Hariri ml enlOQTT flSHMTQ^ PVATA 

AMIDOTRANSFERASE HISH 

gb|AAF12883.1|AF022186_5 (AF022186) unknown; N- 
acetyl-glutamate-gamma-semialdehyde dehydrogenase 
[Cyanidium caldarium] 


5 


466 


5761329 


gi[5761329|dbj|BAA83473.1| (AB004819) cysteine 
endopeptidase [Oryza sativa] 


5.70E+00 



453 
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Table 3B Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQID 
NO 


ACCESS 
N 


DESCRIPTION 


P VALUE 


468 


5306243 


gi|5306243]gb|AAD41976. 1|AC006438_8 (AC006438) 
hypothetical protein [Arabidopsis thaliana] 


4.2 


470 


7304025 


gi|7304025|gb|AAF59067.1| (AE003836) CG8639 gene 
product [Drosophila melanogaster] ! 


3.20E-01 


473 


7303724 


gi|7303724|gb|AAr5o773.1| (AbOU3o2y) iolagene 
Droduct [alt 2] [Drosophila melanogaster] 
gb|AAF58774.1| (AE003829) Iolagene product [alt 3] 
urosopmia meianogasterj gp\j\j\r j o i i o . i \ 
(AE003829) lola gene product [alt 4] [Drosophila 
mf»1nnnoacW] <*hlAAF58776 11 f AE003829^ lola eene 
product [alt 5] [Drosophila melanogaster] 


5.8 i 


476 


1098549 


gi|1098549]gb|AAA82594.1| (U25703) immunoglobulin 
Ught chain F class [Ictalurus punctatus] 


7 


477 


4262231 


gi|4262231|gb|AAD14524.1| (AC006200) hypolhetical 
protein [Arabidopsis thaliana] 


3.8 


478 


5834894 


gi|5834894|reflNP_006964. 1|ND5_10021 NADH 
dehydrogenase subunit 5 [Caenorhabditis elegans] 
sp|P24896|NU5M_CAEEL NADH-UBIQUINONE 

HYTnnP "ETtT TfT A QT7 f^XIATW ^ rMrllQ?6fV37 r\I ADIT 

dehydrogenase (ubiquinone) (EC 1.6.5.3) chain 5 - 
Caenorhabditis elegans mitochondrion 
emb|CAA38162.1| (X54252) ND5 protein (AA 1 - 527) 
[Caenorhabditis elegans] 


0.23 


480 


14776608 


ml 1 4776608 IreflXP 0335 88 11 BAI1 -associated urotein 
3 [Homo sapiens] 


8.6 


482 


7448960 


gi|7448960|pir||D72417 conserved hypothetical protein - 

Thermotoga maritima (strain MSB8) 
gb|AAD35211.1|AE001697_ll (AE001697) conserved 
hypothetical protein [Thermotoga maritima] 


9.1 


484 


7497960 


gi|7497960|pir||T15840 hypothetical protein C54G7.3 - 
Caenornabditis elegans gD|AAAai.>yz.i| luwiu; 
coded for by C. elegans cDNA yk9el0.5; coded for by 
C. elegans cDNA yk9el0.3; multiple regions of 
similarity to EGF-like repeats and cysteine-rich repeats 
[Caenorhabditis elegans] 


5 


487 


14720884 


gi|14720884|refIXP_032180.1| hypothetical protein 
DKFZp434A171 [Homo sapiens] 


7.7 


490 


3201900 


gi|3201900|gb|AAC19365.1| (AF067420) SNC73 
protein [Homo sapiens] 


0.28 



454 
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Table 3B Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQID 
NO 


ACCESS 
N 


DESCRIPTION 


P VALUE 


493 


12018149 


gi|12018149|gb|AAG45421.1|AF309495_l (AF309495) 

^cLiiit/tt/ o^Jt'v/ixiv-/ ii y vn yj^y ±JivJ iui t» ni/ii ^lyvAJL/iisLwiii cui 

Chlamydomonas reinhardtii] 


3.7 


494 


467634 


oil467634lemhlCAA82856 11 (Z29969^ HLY 4 
[Entamoeba histolytica] 


8.9 


498 


14726914 


^liH-zZO^iT-ireipvir / ivj.ij JtsjL/\/\UDZZ protein, 

Drosophila 'multiple .asters' (Mast)-like homolog 1 
Homo sapiens] 


6.8 


499 


11692583 


gi|11692583|gb|AAG39888.1|AF282303_l (AF282303) 
odorant receptor M34 [Mus musculus] 


9.5 


502 


7303064 


gi|7303064|gb|AAF58 132.1| (AE003810) CG12960 
crene rvrfidiiet rOm^onhila irielafioffasterl 


4.9 


504 


13701254 


gi|13701254|dbj|BAB42549.1| (AP003133) 
ORFID:SA1289~hypothetical protein, similar to 
bifiinctional biotin ligase/biotin operon repressor 
[Staphylococcus aureus subsp. aureus N3 15] 
dbj|BAB57618.1| (AP003362) hypothetical protein 
rStanhvlococcus aureus subsn aureus Mu501 


5.9 


505 




gi|137073|splP17086|UREl_PROMI UREASE ALPHA 
SUBUNTT (UREA AMIDOHYDROLASE) 

mirabilis gb|AAA25669.1| (M31834) urease subunit C 

{^rivjiwUo 11111 di/iiijj 


8.5 


512 


14548038 


gi|14548038|sp|Q9D952|EVPL_MOUSE 
ENVOPLAKIN (P210) (210 KDA CORNIFIED 
ENVELOPE PRECURSOR) emb|CAC38864.2| 

(AJ309317) envoplakin [Mus musculus] 


4.2 


516 


14031018 


gi|14031018|gb|AAK50523.1| (AY029684) NADH 
dehydrogenase F [Stenotaphrum secundatum] 


4.6 


518 


14600836 


gi|14600836|ref|NP_147359.1| hypothetical protein 
[Aeropyrumpernix] pir||D72647 hypothetical protein 
APE0610 - Aeropyrum peroix (strain Kl) 
dbj|BAA79580.1| (AP000060) 376aa long hypothetical 
protein [Aeropyrum pernix] 


2.8 
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SEQff) 
NO 


ACCESS 
N 


DESCRIPTION 


P VALUE 


519 


4930062 


gi|4930062|pdb|lMAA|D Chain D, Mouse 
Acetylcholinesterase Catalytic Domain, Glycosylated 

rotein pQD|iivjjfv/\|/\ villain i\ 9 jyiousc 
Acetylcholinesterase Catalytic Domain, Glycosylated 

Prntpiti nHKIIlV/TA A1P Chain f* Mnn«2P 

Acetylcholinesterase Catalytic Domain, Glycosylated 
Protein pdb| 1MAA|B Chain B, Mouse 
Acetylcholinesterase Catalytic Domain, Glycosylated 
rotein 


8.8 


522 


9757603 


gi|9757603|dbj|BAB08 147.1) (AB030877) maturase 
lilium mackliniae] 


0.62 


526 


7445887 


ml744S887lnirIIT07052 nrohable ootassium channel 

till / I f — ' \J \J / IL/ULII XV// \J+J£* LS X VJlr (Aw JLW |/wl*IOiJ*'**< * 

protein SKT2 - potato emb|CAA7087(U| (Y09699) 
putative inward rectifying potassium channel [Solatium 
tuberosum] 


2.1 


528 


2897832 


gi|2897832|dbj|BAA24910.1| (AB000134) cytochrome 
oxidase subunit I [Prorocentrum micans] 


9.8 


537 


7446379 


gi|7446379|pir||T14039 protein kinase (EC 2.7.137), 
mvntnnic dvstxonhv-associated - rat sblAAC02941.ll 
(AF021935) mytonic dystrophy kinase-related Cdc42- 
binding kinase [Rattus norvegicus] 


8E-52 


544 


14779648 


gi|14779648|ref|XPJ)27 140. 1| hypothetical protein 
FLJ23239 [Homo sapiens] 


6.3 


546 


7292503 


gi|7292503|gb|AAF47906.1| (AE003481) CG15023 
gene product [Drosophila melanogaster] 


6.3 


548 


4758758 


gi|4758758|ref|NP_004529.1| nucleosome assembly 
protein i iikc j [riomu cxipiciibj 
sp|Q99457|NPL3JHUMAN NUCLEOSOME 
ASSEMBLY PROTEIN 1-LIKE 3 dbj|BAA08904.1| 
(D50370) nucleosome assembly protein [Homo sapiens] 


1.6 


554 


14530412 


gi|14530412|emb|CAC42291.1| (Z69360) cDNAEST 
EMBL:U52071 comes from this gene [Caenorhabditis 
elegans] 


7 


559 


6680788 


gi|6680788|ref|NP_03 1576.1] Bloom syndrome protein 
homolog (human) [Mus musculus] 
sp|O88700|BLM_MOUSE BLOOM'S SYNDROME 
PROTEIN HOMOLOG emb|CAB10933.1| (Z98263) 
BLM protein [Mus musculus] 


4.8 
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SEQED 
NO 


ACCESS 
N 


DESCRIPTION 


P VALUE 


560 


3294495 


gi|3294495|gb|AAC25822.1| (AF038608) contains 
similarity to Mus musculus tumor susceptibility protein 
TSG101 (GB:U52945) [Caenorhabditis elegans] 


0.78 


561 


6319273 


gi|63 19273|ref]NP_009356. 1| pre-tRNA processing; 
Ptalp [Saccharomyces cerevisiae] 
sp|Q01329|PTAl_YEASTPTAl PROTEIN 
pir||S3 1299 pre-tRNA processing protein PTA1 - yeast 
v^accnarornyces cerevisiae ) go\i\j\j\D i t y 1 y . 1 1 (iMyj of 3) 
ptal [Saccharomyces cerevisiae] gb]AAC04988.1| 
(U12980) Ptalp: Pre-tRNA processing involved protein 
[Saccharomyces cerevisiae] 


9.9 


567 


Jul / / D 


gi|5817732|gb|AAD52875.1|AF142703_l (AF142703) 

utaJLtu oacm^c pi uicjlli ^v>/jjiii cbLid l dUidJoctj 




569 


1 76796 


gi|126296|sp|P08548|LINl_NYCCO LINE-1 
REVERSE TRANSCRIPTASE HOMOLOG 
prfl|1207289B reverse transcriptase related protein 


0 00000002 


572 


13816099 


gi|13816099|gb|AAK42875.1| (AE006870) Hypothetical 
protein [Sulfolobus solfataricus] 


2.1 


574 


12513361 


gi|12513361|gb|AAG54834.1|AE005228J. (AE005228) 
putative glutaminase [Escherichia coli 0157:H7 
FDT Q^l HhilRAFtt^Q61 11 f APOO?^?^ rnitative 
glutaminase [Escherichia coh 0157:H7] 


3.3 


575 


1 1 "35045"? 


gi| 1 1350453|pir||B82965 hypothetical protein PA5456 
[imported] - Pseudomonas aeruginosa (strain PAOl) 
gb|AAG08841.1|AE004958_ll (AE004958) 


Q 


584 


1174945 


gi|1174945|sp|P43109|VEXB_SALTI VI 
POLYSACCHARIDE EXPORT INNER- 

JNdJ^JVLJDKAJN li rKU 1 JbUN VriAli pir||vjJooyz Vl 

polysaccharide capsule transporter VexB - Salmonella 
typhi dbj|BAA03 197.1| (D14156) Wzm protein 
[Salmonella typhi] 


4.5 


586 


4566616 


gi|45666 16|gb|AAD23408. 1|AF1 1353 1_1 (AF1 1353 1) 
folhcle-stimulating hormone receptor precursor 
[Coturnix coturnix] 


2 


589 


14746756 


gi|14746756|ref|XPJ)39 102. 1| similar to hypothetical 
protein FLJ20378 (H. sapiens) [Homo sapiens] 


0.007 
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SEQID 
NO 


ACCESS 
N 


DESCRIPTION 


P VALUE 


590 


7429823 


OT^^QW^lrviTllDf^QQQI rnn<;erved Vivnnthetical nrotein 
yteU - Bacillus subtilis gb|AAC00275.1| (AF008220) 
YteU [Bacillus subtilis] emb|CAB14987.1| (Z99119) 
siinilar to hypothetical proteins from B . subtilis 
[Bacillus subtiUs] 


0.49 


592 


11877309 


gi|11877309|emblCAC19023.1| (AJ278707) ORF 4 
[Neisseria meningitidis phage 2120] 


9.2 


594 


14193715 


gi|14193715|gb|AAK56102.1|AF332074_l(AF332074) 
peroxisome proliferator-activated receptor binding 
protein [Mus musculus] 


0.000002 


595 


14625344 


gi|14625344|gb|AAK71419.1|U80842_9(U80842) 
Hypothetical protein ZC239. 19 [Caenorhabditis 
elegans] 


1.9 


596 


1107614 


gi|1107614|emb|CAA62696.1| (X91351) viral infectivity 
factor protein [Human immunodeficiency virus type 1] 


0.008 


599 


6324561 


gi|6324561|ref]NP_014630.1| required to degrade 
misfolded ER lumenal and integral membrane proteins; 
Hrdlp [Saccharomyces cerevisiae] pir||S66695 probable 
membrane nrotein YOLO 13c - veast (Saccharomvces 
cerevisiae) emb|CAA99012.1| (Z74755) ORF YOL013c 
[Saccharomyces cerevisiae] 


6.7 


600 


7494878 


gi|7494878|pir||T33047 hypothetical protein B0344.2 - 
Caenorhabditis elegans gb|AAC16982.1| (AF067209) 
B0344.2 gene product [Caenorhabditis elegans] 


0.94 


601 


11347199 


gi|i ij4/iyy|pir]|v^oijji prouaoie uur-rv- 
acetylmuramoylalanyl-D-glutamyl-2, 6-diaminophnelate- 
-D-alanyl-D-alanine Kgase (EC 6.3.2.15) Cj0795c 
[imported] - Campylobacter jejuni (strain NCTC 11168) 
emb|CAB73060.1| (AL139076) putative UDP-N- 
acetylmuramoylalanyl-D-glutamyl-2, 6-diaminopimelate- 
- Ugase [Campylobacter jejuni] 


0.55 


604 


1293067 


gi|1293067|gb|AAB06014.1| (U50958) drosophila seven- 
in-absentia gene product homolog [Mus musculus] 


7.9 
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SEQID 
NO 


ACCESS 
N 


DESCRIPTION 


P VALUE 


■ 

606 


5031975 


gi|5031975|refJNP_005875.1| p21-activated kinase 4; 
protein kinase related to S. cerevisiae STE20, effector 
for Cdc42Hs [Homo sapiens] ref]XP_041095.1| protein 
kinase related to S. cerevisiae STE20, effector for 
Cdc42Hs [Homo sapiens] ref]XP_041093.1| protein 
kinase related to S. cerevisiae STE20, effector for 
Cdc42Hs [Homo sapiens] sp|O96013|PAK4_HUMAN 

CCDTXTt? P rLTD "P /"VXTTVTC? TVn ATT7TXT TVTXT A OT? Tl A V A 

SbKINh/ lriKLUNlNb-P^ KINASE PAK 4 
(P2 1 -ACTIVATED KINASE 4) (PAK-4) 
gb|AAD01210.1| (AF005046) serine/threonine kinase 
[xiorrio sapiens j eniD|^A/\uyozu.i| \/vjui iodd) rj\]SJt 
protein [Homo sapiens] 


1.1 


608 


9280285 


gi|9280285|dbj|BAB01686.1| (AB046104) unnamed 
protein product [Macaca fascicularis] 


8.7 


609 


5869819 


gi]5869819|emb|CAB55576. 1| (AJ249395) NADH- 
ubiquinone oxidoreductase subunit 1 [Globodera 
pallida] 


0.84 


611 




gi|12045283|ref(NP__073094.1| conserved hypothetical 
protein [Mycoplasma genitalium] 
sp|P47662|Y423 JVLYCGE HYPOTHETICAL 
PROTEIN MG423 pir[|G64246 conserved hypothetical 
protein MG423 - Mycoplasma genitalium 
gb|AAC71647.1| (U39724) conserved hypothetical 
protein [Mycoplasma genitalium] 




612 


11289932 


gi|l 1289932|pir||T50247 probable hehcase [imported] - 
fission yeast (Schizosaccharomyces pombe) 
emb|CAB7184(U| (AL138666) putative helicase 
[Schizosaccharomyces pombe] 


4.5 


613 


6119709 


gi|ot iy /uy|emo|CAo!>y3oo.i| (AjZHyyoo) Lovj protein 
[Rattus norvegicus] 


1.6 


620 


7144507 


gi|7144507|gb|AAA58585.2| (U12823) hemolysin 
[Acanthamoeba polyphaga] 


8.1 


622 


15011757 


ffill5011757!ffblAAB04582 31 (U64603, Hvnothetical 
protein C09B7.1 [Caenorhabditis elegans] 


7 


625 


8923324 


gi|8923324|ref|NP_060247.1| hypothetical protein 
FLJ20345 [Homo sapiens] ref)XP_008161.3| 
hypothetical protein FLJ20345 [Homo sapiens] 
dbj|BAA91105.1| (AK000352) unnamed protein product 
[Homo sapiens] 


0.004 
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P VALUE 


J 

626 


6960319 


adaptor-related protein complex AP-4 epsilon subunit 
[Homo sapiens] 


0.00008 


628 


5739387 


gi|5739387|gb|AAD50450.1)AF169388_l (AF169388) 
alpha 4 collagen IV [Mus musculus] 


5.5 


631 


35 1UZ34 


gi|3510234|gb|AAC33487.1| (AC005581) R31237_l, 
jamai [nomo sapiens j 


8 6 




802150 


gi|802150|gb|AAB32775.1| (S75037) pancreatic 
peptidylglycine alpha-amidating monooxygenase, 
PAM=membrane-bouna isororm { alternatively spnceo, 
clone PAM-3, transmembrane domain (Ba region)} 
[numan, islet ceu tumor ceix line v^vjjt -i, rcpuuc rdrudi 5 
971 aa] (Homo sapiens] 


1E-13 


636 


2105238 


gi|2lU523o|gb|AA±>c> /504.ij ^Uooyioj similar xo vanoia 
and vaccinia E9L [Molluscum contagiosum virus 
subtype 1] 


1.5 


641 


75 7 5303 


gi|7576305|emb|CAB88003.1| (AJ277410) NADH \ 
dehydrogenase subunit 1 [Taenia hydatigena] 




643 


12231395 


gi|12231395|gb|AAG49078.1|AF216121JL (AF216121) 
phytochrome B [Gyminda tonduzii] 


7 


647 


281178 


gi|28 1 178|pir||JQ1580 major surface antigen - hepatitis 
B virus (subtype adw4q- 3 strains CNTS-38 and Fou) 
emb|CAA53344.1| (X75658) surface antigen [Hepatitis 
B virus] 


6.7 


652 


1 AC 


gi|7503 145|pir||T163 15 hypothetical protein F41C3.6 - 
Caenorhabditis elegans gblAAC46813.1| (U23521) 
asked finisher to look for frameshift because of 
discrepancy between prediction and EST CEESW76F; 
looks like there should be a splice irom zi/Mtozi/uo. 
But no problem was found in the area [Caenorhabditis 
elegans] 


6.9 


653 


14773502 


gi|14773502|ref|XPJ)43252.1| PCAF associated fector 
65 alpha [Homo sapiens] gb|AAH08785.1|AAH08785 
fRC008785^ PCAF associated factor 65 alpha [Homo 
sapiens] 


3E-10 


658 


14732840 


gi|14732840|ref|XP_034110.1| hypothetical protein 
XP 034110 [Homo sapiens] 


1.8 


659 


13442965 


gi|13442965|gb|AAK26242.11AF247132_l(AF247132) 
putative chromatin remodeling factor [Mus musculus] 


4 
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SEQID 
NO 


ACCESS 
N 


DESCRIPTION 


P VALUE 


660 


4512671 


gi|4512671|gb|AAD21725.1| (AC006931) En/Spm-like 
transposon protein [Arabidopsis thaliana] 

factor 1 [Arabidopsis thaliana] 

ahlAAD33R69 1IAF141376 1 fAF141376i nrotodermal 
factor 1 [Arabidopsis thaliana] 


6.6 


663 


13385330 


glj X jOO JJ V|IC'X|xN.x \JO\J 1Z. / , 1| IXAJlSJljIN WJL/l^l^V. 

1700010114 gene [Mus musculus] dbj|BAB24262.1| 
fAK005830i nutative FMus musculusl 


0.33 


665 


14325583 


gi|14325583|dbj|BAB60486.1| (AP000996) unknown 
nroduct 1 1 hermoDlasma volcaniuml 


5.4 


666 


14779941 


gi|14779941|ref|XPJ)07847.3| hypothetical protein from 
clone 24796 [Homo sapiens] 


0.00000002 


670 


14330407 


iriil4330407lemb1CAC41079 11 fAJ308518i P2X5 
receptor [Gallus gallus] 


3.3 


672 


13QQ1595 


gi|13991595|gb|AAK51427.1| (AF355796) BRCA1 j 

1 Krhvm i "np.r?j IcalnHiil 

1 JJvUYiillUwl W JCkCM U 


6.9 


681 


14728540 


gi|14728540|refpO > _04826Ll| similar to ring finger 
protein 23; RING-B box-coiled coil-B30.2 (M. 
musculus) [Homo sapiens] 


0.026 


683 


1474.0??? 


gi|14740222|ref|XPJ)49918.1| similar to hypothetical 

r»rrvtf»m IV/fOC' 1 0Q40 CW oonipn^ n-Trvmo ^arvien^i 1 

pi twill 1V1VJ V/ 1 v7*tv \X A. i3u.Uyl.VXXo ) |X XVXlXlVJ OCLL/XwXAOJ 


1E-86 


684 


11359451 


gi|l 135945 l|pir||T5 1040 hypothetical protein 
B15I20.100 [imported] - Neurospora crassa 
emb]CAB97464.1| (AL389900) conserved hypothetical 
protein [Neurospora crassa] 


9.2 


685 


14735526 


gi|14735526|ref|XP_00 1804.4| guanylate binding protein 
? interferon -inducible PHomo saoiensl 

IXll^wX Ivl V/ll AXXUlXlrXl/XlS ^XXUAilV/ i3C*JLyXtsAXOJ 


2.3 


687 


7510353 


gi|7510353|pir|iT27275 hypothetical protein Y63D3A.9 • 
Caenorhabditis elegans emb|CAA21711.1| (AL032652) 
cunxaiiis siiiiiaciaily tu a x«xux uuiivuu. rruuvj^u \x uua. 
domain.), Score=43.2, E-value=1.9e-09, N=l 
[Caenorhabditis elegans] 


6.8 


697 


14773502 


gi|14773502|refpa>_043252.1| PCAF associated fector 
65 alpha [Homo sapiens] gb|AAH08785.1|AAH08785 
(BC008785) PCAF associated factor 65 alpha [Homo 
sapiens] 


3E-10 


700 


7498998 


gi|7498998|pir||T16057 hypothetical protein F13D1 1.2 - 
Caenorhabditis elegans 


0.27 
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SEQID 
NO 


ACCESS 
N 

J. ^ 


DESCRIPTION 


P VALUE 


703 




gi|4493893|emb|CAB39002.1| (AL034558) predicted 
using hexExon; MAL3P2.15 (PFC0230c), Hypothetical 
nrntpin Iftn* 3979 aa rPl a <*rnndiiirn falcioaiTiiTil 


9.2 


706 


7243081 


gi|7243081|dbj|BAA92588.1| (AB037771) KIAA1350 
protein [Homo sapiens] 


1E-79 


707 


7496384 


gi|7496384|pir||T33266 hypothetical protein C24B9.10 - 
Caenorhabditis elegans gb|AAC19256.1| (AF068709) 
Hvnnthetical nrotein C24B9 10 ICaenorhabditis 
elegans] 


5.4 


708 


7492007 


gi|7492007|pir||T4 1 680 hypothetical protein j 
SPCP1E1 1.01c - fission yeast (Schizosaccharomyces 
nombei emblCAB54860 11 <AL 117183) hvDOthetical 
protein [Schizosaccharomyces pombe] 


4.1 


709 


147SRS07 

it / JOJU / 


gi|14758507|reflXP_044768.1| similar to death receptor 

£ fi-T «3anif*TiQ^ ITTniTio sarviensl 


0.031 


710 


6513925 


gi|65 13925|gb|AAF14829. 1|AC01 1664_1 1 (AC01 1664) 
unknown protein [Arabidopsis thaliana] 


0.83 


713 


6434714 


mlfid^4714lf»mhlCATl61 161 11 fAl\132973) mutative 
membrane protein. [Streptomyces coeHcolor A3 (2)] 


1.2 


714 


RR 1 0476 


gi|8810476|gb|AAF80137.1|AC024174 - 19(AC024174) 
Contains similarity to a hypothetical protein F24K9.13 
gi|6006885 from Arabidopsis thaliana gblAC008153 
and contains multinle PPR PFI01535 reoeats 


0.22 


719 


9626839 


gi|9626839|ref|NP_041 109.1| ORF 18 [ictalurid 
herpesvirus 1] sp|Q00120|VG18_HSVIl 

trVT>r»THTTTfr , AT rtTTNTP 1R PPOTPTMnirllA'3fi7RR 

hypothetical protein ORF 18 - ictalurid herpesvirus 1 
(strain auburn l)gb|AAA88121.1| (M75136) ORF 18 
[ictalurid herpesvirus 1] 


9.2 


723 


15004903 


gi|15004903|gb|AAK77203. 1|AC006622_3 
(AC006622) Hypothetical protein C52D10.12 
[Caenorhabditis elegans] 


2.4 


724 


6677663 


gi|6677663|reflNP_033047.1| retinoic acid induced 1 
[Mus musculus] pir||T30250 GT1 protein - mouse 
dbj|BAA06184.1| (D29801) Unknown [Mus musculus] 
prf)|2123391A GT1 gene [Mus musculus] 


0.12 
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SEQID 
NO 


ACCESS 
N 


DESCRIPTION 


P VALUE 


729 


2343183 


gi|2343183|gb|AAB67715.1| (AF013243) zinc finger 

LlCllloV-/lipLlUll LaxsLKJl |_JUy IvldJJIlUo Veil ICgd. LUo J 




730 


7465334 


gi|7465334|pir||B71980 proline dehydrogenase (EC 
1.5.99.8) / l-pyrroline-5-carboxylate dehydrogenase 
(EC 1.5. 1.12) putA [similarity] - Helicobacter pylori 
(strain J99) gb|AAD05632. 1| (AE001444) 
Proline/pyrroline-5 -carboxylate dehydrogenase 
[Helicobacter pylori J99] 


4.9 


733 


063 1 436 


gi|9631436|ref|NP_048287.1| ORFMSV216 SCGgene 
family protein [Melanoplus sanguinipes 
entomopoxvirusj pir||l2oo/7 UKr MJSvzlo JSCLrgene 
family protein - Melanoplus sanguinipes 
enioinopoxvirus gu|r\/iA^-7 / / . 1 1 \J\r uo.3 ©oo ^ v/Jvr 
MSV216 SCG gene family protein [Melanoplus 

Q^ncnnniripc pntnrnfYrvYvvinicl 


V/. JZ, 


734 


13813168 


gi|13813168|gb|AAK40402.1| (AE006645) Hypothetical 
protein [Sulfolobus solfataricus] 


2.7 


736 


14776978 


gill4776978lreffSP_033086.il hypothetical protein 
XP_033086 [Homo sapiens] 


5.4 


737 


7904.X63 


gi|7294863|gb|AAF50194.1| (AE003550) CG6718 gene 
prvMiucv ^jL^rosopiuia ineiaiiogasierj 


1 Q 


745 


7465334 


gi|74653341pir||B71980 proline dehydrogenase (EC 
1.5.99.8) / 1 -pyrroline-5 -carboxylate dehydrogenase 
(EC 1.5.1.12) putA [similarity] - Helicobacter pylori 
(strain J99) gb(AAD05632. 1| (AE001444) 
x^roijxie/pyiToiiiic-3 -ca.ru oxyiaxe aenyarogenase 
[Helicobacter pylori J99] 


3.9 


750 


5759216 


gi|5759216|gb|AAD51030.1|AF172399_l (AF172399) 
p75 neurotrophin receptor a-1 [Xenopus laevis] 
gb|AAD51031.1|AF172400_l (AF172400)p75 
neurotrophin receptor a-2 [Xenopus laevis] 


3.9 


751 


6518511 


gil6518511|dbjlBAA87907.1| (AB021222) ketosynthase 
[Streptomyces rochei] 


6.6 


754 


6679777 


gi|6679777|ref[NP_032029. 1| fibroblast growth factor 
15 [Mus musculus] sp|035622|FGFFJV10USE 
FIBROBLAST GROWTH FACTOR-15 
PRECURSOR (FGF-15) gb|AAB63197.1| (AF007268) 
fibroblast growth factor [Mus musculus] 
dbj|BAB30961.1| (AK017829) putative [Mus musculus] 


7.7 
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SEQID 
NO 


ACCESS 
N 


DESCRIPTION 


P VALUE 


755 


7498881 


gi|7498881|pir||T20748 hypothetical protein F11A5.2 - 
Caenorhabditis elegans emb|CAB07355.1| (Z92830) 

V/UXlldJXlb oxlLlJxcUxiy IU x lolll UlUlldXXX. X \J X WT yt x ivx 

chemoreceptor), Score=428.7, E-value==1.7e-125,N==l 
Caenorhabditis elegans] 


6.4 


756 


1378135 


<nl1 ^7R1 ^lahlA 11 rtJ58466 > i nutative 
fibroblast growth factor receptor 1-like protein; one of 
three nossible onen reading "frames in the seouence 
Hattus norvegicus] 


5.1 


758 


7444901 


gi|7444901|pir||B71630 hypothetical protein RP702 - 
Rickettsia prowazekii emb|CAA15138.1| (AJ235273) 
unknown [Rickettsia prowazekii] 


2.2 


759 


12854427 


gi|12854427|dbj|BAB30025.1| (AK0 15906) putative 
Mus musculus] 


6.6 


763 


4185892 


gi|4185892|emb|CAA21831.1| (AL033125) 1- 
evidence=predicted by content~l- 
method=genefinder; 084- 1 -evidence_end~2- 
evidence=predicted by match~2- 
maicn accession — d w ioo"rivv/ 1 .rjozuj~z.- 
match_description=HYPOTHETICAL 77.9 KD 
PROTFTN TN RRN1 0-MCM2 INTERGEN1 C 

x IVv/ x xJxxS XX > jajvli lv xvxv^ivxx* x± ^ iuxvujjiiiv/ 

REGI0N.~2-match species=SACCHA> 


9.1 


766 


14725855 


cHI147'7S8S^lreflXP 038865 1 1 similar to hvDOthetical 
protein PR02822 (H. sapiens) [Homo sapiens] 


9.2 


770 




gi|7582302|gb|AAF64271.1|AF208857_l (AF208857) 
RM-015 rHomo saniensl 


8E-13 


772 


6594283 


ml^SO^R^IHhilRAAXXdlQ 11 fAB016615i hvdronhobic 
transmembrane protein [Staphylococcus aureus] 


1.7 


783 


12597803 


oil12507803lffblAAG60115 1IAC073178 26 

till X ^ I O \J ' l&U riiivlvu X X ♦ A 1* k^yw f A. 1 w A#VS 

(AC073 178) hypothetical protein [Arabidopsis thaliana] 


0.62 


784 


8777303 


mlR777^0^IHhilBAA96893 11 <AB018112i serine 
carboxypeptidase [Arabidopsis thaliana] 


1.2 


785 


10045258 


gi|10045258|emb|CAC07930.1| (AJ228487) cytochrome 
c oxidase subiinit 3 [Acanthogammarus godlewskii] 


1.5 


786 


1794167 


gi|1794167|dbj|BAAl 1217.1| (D78137) unnamed 
protein product [Vibrio parahaemolyticus] 


9.7 


790 


12835698 


gi|12835698|dbj|BAB23329.1| (AK004489) putative 
[Mus musculus] 


8.1 
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SEQ ED 
NO 


ACCESS 
N 


DESCRIPTION 


P VALUE 


791 


131761 


1 /ol|Sp|F 1 io3o|yAY_JNJ}UCK V?UJUNAlJi 
PERMEASE (QUINATE TRANSPORTER) 
pir||G3 1277 quinate transport protein - Neurospora 

vrdbbd. ^ICHuiUVC SC(^UCI1L/C^ CIIlU|^xVr\-j-£. / JZ, 1| ^AltUUJ ) 

quinate transporter [Neurospora crassa] 


1.7 ! 


794 


11640576 


gi| 1 1640576|gb|AAG39285. 1|AF 1 13214_1 (API 13214) 
MSTP034 [Homo sapiens] 


6.3 


797 


5306243 


gi|5306243|gb|AAD41976. 1|AC006438_8 (AC006438) 
hypothetical protein [Arabidopsis thaliana] 


3.4 


798 


7511787 


gi|7511787|pir||T29089 alpha-mannosidase (EC 3.2.1.-), 
class II - fell armyworm gb|AAB62719.1| (AF005034) 
alpha-mannosidase II [Spodoptera frugiperda] 


7 


800 


3170793 


gi|3170793|gb|AAC18201.1| (AF062165) 
immunoglobulin heavy chain variable region [Homo 
sapiens] 


8.7 


802 


11466458 


gi|11466458|ref]NP_038161.1| NADH dehydrogenase 
subunit 4L [Chrysodidymus synuroideus] 
gb|AAB95105.1[ (U54633) N ADxi dehydrogenase 
subunit 4L [Chrysodidymus synuroideus] 
gb|AAF36927.1|AF222718_l (AF222718) NADH 
dehydrogenase subunit 4L [Chrysodidymus 
synuroideus] 


4.2 


803 


14779941 


glj i*f / / yyH' l |re±|-<vr / o*t / . .> | nypoineucdi piousm xrum 
clone 24796 [Homo sapiens] 


0.00000002 


804 


13430868 


gi| 134JUooo|rel]JN.r_u / /OUo. 1| hypothetical protein 
MGC2615 [Homo sapiens] refpOPJ)15759.1| 
nypotneuLcai protein ivivjvyZoiD |oomo sapiens j 
gb|AAH01656.1|AAH01656 (BC001656) Unknown 
(protein for MGC:2615) [Homo sapiens] 


4E-21 j 


808 




gi|3293234|gb|AAC25913.1| (U96413) T-DNA 
oncoprotein i Agio uac icii uin iuineiacieiii>j 


7 9 


810 


7299169 


gi|7299169|gb|AAF54367.1| (AE003682) Fps85D gene 
product [alt 1] [Drosophila melanogaster] 


5.6 


814 


11361627 


oill 13Mf>?7lnirllH8?7'}6 hvnnthetieal nrntein XF0QQ2 

[imported] - Xylella festidiosa (strain 9a5c) 
gb|AAF83802.1|AE003937_6 (AE003937) hypothetical 
protein [Xylella festidiosa 9a5c] 


0.43 


816 


1 10803160 


gi|10803160|emb|CAC13088.1| (AL445503) putative 
two component system response regulator [Streptomyces 
coelicolor] 


6.4 
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SEQID 
NO 


ACCESS 
N 


DESCRIPTION 


P VALUE 


819 


2613090 


gi|2613090|gb|AAB84279.1| (AF030317) OmpR 
Proteus vulearisl 


1.7 


821 


2494232 


gi|2494232|sp|Q628 14|E2F5_RAT TRANSCRIPTION 
'ACTOR E2F5 <E2F-51 eblAAB00180 11 (1131668') 
E2F-5 [Rattus norvegicus] 


7.8 


822 


14729752 


gi|14729752|ref|XPJ)36388.1| hypothetical protein 
VIGC5297 fHomo saniensl 


6.7 


825 


7024427 


gi|7024427|emb|CAA87594.2| (Z47547) 

Pwi=or£234 .l;Mpo=or£244;Angio=orfe~no ATG start 

codon [Chondrus crispus] 


8 


833 


11691811 


gi|11691811|emb|CAC11114.1|(AL121893) 
bA189K21.6 (Sec23 (S. cerevisiae) homolog B) [Homo 
saniensl 


5.6 


834 


117704 


gi|117704|sp|P18246|CXAl_BOVIN GAP JUNCTION 

ALPHA- 1 PROTEIN (CONNEXIN 43) (CX43) 
(VASCULAR SMOOTH MUSCLE CONNEXIN 43) 
pir|[A36623 gap junction protein Cx43 - bovine 
gb|AAA30459. 1| (J05535) vascular smooth muscle 
connexin43 [Bos taurus] 


9.2 


835 


1170606 

Ail \JV-JVj\J 


gi| 1 170606|sp|P43 1 88|KADC_MAIZE ADENYLATE 
KINASE, CHLOROPLAST (ATP -AMP 
TRANSPHOSPHORYLASE) pir||S45634 adenylate 
kinase ycA^ 2. 7.4.3), cnioropiast - maize pauj iz^viv|/y 
Chain A, Adenylate Kinase From Maize In Complex 
With The Inhibitor Pl,P5-Bis(Adenosine-5'- 
jpeniapQuspiiux© \j\pja,) pu.ujiz_»r\j>w|A> v/iuuii jj, 
Adenylate Kinase From Maize In Complex With The 
Inhibitor P l,P5-Bis(Adenosine-5 , -)pentaphosphate 


0.0000001 


836 


7661750 


gi|7661750|refINP_054866.1| HSPC047 protein [Homo 
sapiens] gb|AAF29019.1]AF161532_l (AF161532) 
HSPC047 [Homo sapiens] 


5E-28 


838 


13815530 


gi|13815530|gb|AAK42398.1| (AE006828) Transport 
protein, hypothetical [Sulfolobus solfataricus] 


5 


839 


14771844 


gi|14771844|ref|XP_045585.1| hypothetical protein 
FLJ11085 [Homo sapiens] emb|CAB91047.2| 
(AL109935) dJ1022P6.2 (KIAA1434) [Homo sapiens] 


0.089 


843 


4007786 


gi|4007786|emb|CAA51374.1| (X72850) acetyl CoA 
\ acetyltransferase [Sphingomonas sp.] 


4.6 
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SEQID 
NO 


ACCESS 
N 


DESCRIPTION 


P VALUE 


847 


9968471 


gi|9968471|emb|CAC06698.1| (AJ275988) Kruppel like 
factor [Mus musculus] 


6.2 


848 


7499805 


gi|7499805|pir||T21337 hypothetical protein F25D7.1 - 
Caenornabaitis elegans emo| wvtsu 1 wo. 1 1 (jl / 54 1 oj , 
cDNA EST yk60g6.3 comes from this gene-cDNA EST 
yk60g6.5 comes from this gene-cDNA EST 

yKlJOOlv.J COIIlCb JJUm LOla ^CllC^V^J-/l>-r\. JZfO X 

ykl56bl0.5 comes from this gene-cDNA EST 
yk399f2.3 comes from this gene-cDNA EST yk399£2.5 
comes from this gene~cDN> 


7.1 


849 


6581093 


gi|6581093|gb|AAF18453J|AF205599_l (AF205599) 
transposase-like protein (Mus musculus] 


4.4 


853 


13651256 


gi|13651256|ref|XP_015436.1| chloride channel 4 

fT T , . . . , _ -.1 -.—XI'XTT^ f\ A 1| /.'Linn J A s>T«*>vk*«£*1 A 

[rlomo sapiensj reijXr^iHD /Do. i| cnionae cnarmei 4 
[Homo sapiens] refjXP_045756.1| chloride channel 4 

[riomo sapiens j tg±\aj: vhd i d i . 1 1 ciiioriuc wn<inTici *t 

[Homo sapiens] dbj|BAA77327.1| (AB019432) chloride 
channel protein 4 [Homo sapiens] 
c*hlAAD50981 1IAF170492 1 <AF 170492} chloride 
channel CLC4 [Homo sapiens] 


2.3 


854 


336159 


gi|336159|gb|AAA46774.1| (M90520) polymerase 
protein [Woodchuck hepatitis B virus] 


3.2 


861 


8979124 


gi|8979124|dbj|BAA98959.1| (AP002547) CT651 
hypothetical protein [Chlamydophila pneumoniae J138] 


0.76 


863 


4758712 


gi|4758712|ref|NP_004659.1| alpha-glucosidase; brush 
border hydrolase [Homo sapiens] 
sp|043451|MGA_HUMAN MALTASE- 
GLUCOAMYLASE, INTESTINAL [INCLUDES: 

TV 4" AT TACT? /'AT PTTA OX TTfnQTTl AQT?V 

GLUCOAMYLASE (GLUCAN 1,4-ALPHA- 
GLUCOSIDASE)] gb|AAC39568.1| (AF016833) 
maltase-glucoamylase [Homo sapiens] 


0.1 


864 


13812050 


gi|13812050|re£lNP_113184.1| hypothetical protein 
[GuiUardia theta] gb|AAK39752. 1|AF083031_109 
(AF08303 1) hypothetical protein [Guillardia theta] 


4.3 


867 


10726408 


gi|10726408|gb|AAF54394.2| (AE003683) CG9381 
gene product [Drosophila melanogaster] 


5.5 
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SEQID 
NO 


ACCESS 
N 


DESCRIPTION 


P VALUE 


870 


7299899 


cril77QQRQQIffhlA AF55074 11 fAFOO^f^ CG7987 cene 
product [alt 1] [Drosophila melanogaster] 
gb|AAF55075.1| (AE003705) CG7987 gene product [alt 
2] [Drosophila melanogaster] 


9.7 


872 


6979325 


m!6979325leblAAF34418 1IAF172282 7 (AF172282) 
putative phosphatidylinositol 4-kinase [Oryza sativa] 


4.8 


873 


13699930 


g^l I joyy"^u|aDj}ij/\r>T'i^z5'. i| ^atwji^^ cuiiocivcu. 
hypothetical protein [Staphylococcus aureus subsp. 
ahi-piks T>ni51 dhilBAB56175 11 CAP003358} conserved 
hypothetical protein [Staphylococcus aureus subsp. 
aureus Mu50] 


3.7 ' 


877 


6841256 


gil68412561gb|AAF28981.1|AF161421 - l (AF161421) 
HSPC303 [Homo sapiens] 


7E-64 


879 


2459878 


gi|2459878|gb|AAC40459.1| (AF005734) glycoprotein 
precursor [Marburg virus] 


2.5 


880 


14759292 


gi|14759292|ref]XP_006700.2| checkpoint with forkhead 

?»nrl rintj "FincTf»r Hnirui'i'nQ rHntTin CJiniRTI^l 

CUlvl J. 111k liUgvl vllJxllxTllxD L A acl£S£wxloJ 

gb|AAF91084.1|AF170724_l (AF170724) cell cycle 
checknoint nrotein CHFR IHomo saoiensl 


0.000000002 


886 


141164 


gi|141164|sp|P19297(YORM_TTVl HYPOTHETICAL 
38 6 KD PROTEIN emblCAA32993 11 (X14855) URF 
(352 AA) [Thermoproteus tenax virus 1] 


2.2 


889 


1755061 


gi|1755061|gb|AAB50394. 1| (U62737) photosystem I 
subunitXI [Synechococcus sp. PCC 7942] 


3.6 


890 


7464739 


gi|7464739|pir||G71920 hypothetical protein jhp05 18 - 
Helicobacter pylori (strain J99) gb|AAD06099.1| 
(AE001485) putative [Helicobacter pylori J99] 


7.6 


891 


14141161 


gi|14141161|reflNP_004492.2| heterogeneous nuclear 
ribonucleoprotein U, isoform b; hnRNP U protein; 
scaffold attachment factor A, pizu nuclear protein 
[Homo sapiens] gb|AAH03367.1|AAH03367 
(BC003367) heterogeneous nuclear ribonucleoprotein U 
(scaffold attachment factor A) [Homo sapiens] 
gb|AAH03621.1|AAH03621 (BC003621) heterogeneous 
nuclear ribonucleoprotein U (scaffold attachment factor 
A) [Homo sapiens] 


6.9 


892 


7649887 


gi|7649887|dbj|BAA94 165.1| (AP000422) tail fiber 
protein [Escherichia coli 0157:H7] 


0.63 
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SEQID 
NO 


ACCESS 
N 


DESCRIPTION 


P VALUE i 


901 


1 


gi|7496871|pir||T19650 hypothetical protein C32H11.7 - 

cDNA EST yk576f6.3 comes from this gene 
[v^aenornauuiiio oicgdnoj 


4.6 


905 


13400109 


gi|13400109|gb|AAK21974.1| (U77931) rRNA 
promoter binding protein [Rattus norvegicus] 


0.003 


918 1 


13235458 


gi|13235458|emb|CAC3375U| (AJ293322) ProP2 
protein [Rickettsia typhi] 


4.1 


919 


7292165 


ml79Q916SlohlAAF47S77 11 fAF003472 , l CG12023 
gene product [Drosophila melanogaster] 


0.25 


920 


12831207 


gi|12831207|refINP_075579.1| gamma-aminobutyric 
acid (LrAoA-Aj receptor, suounii epsiion [jvauus 
norvegicus] gb|AAP70383.1|AF189262_l (AF189262) 

norvegicus] 


4.5 


926 


10765285 


<*ilia7^9RSlchlA A099Q71 HAF1M4.31 1 <AF18343r* 
inhibitor of apoptosis protein 2 [Rattus norvegicus] 


4.2 


927 


2832268 


gi|2832268|gb|AAC15477. 1| (AF043233) Caco-2 
oligopeptide transporter [Homo sapiens] 


5.1 


928 


6179898 


gi|6179898|gb|AAF05702.1|AF190129_l (AF190129) 
Na+/K+/2C1- cotransporter [Callinectes sapidus] 


6.1 


931 


12644035 


gi| 12644035|sp|Q9Z8Ml|IF2_CHLPN 
TRANSLATION INITIATION FACTOR IF-2 
pir||E81576 translation initiation factor 2 CP0440 
[imported] - Chlamydophila pneumoniae (strain AR39) 
gb|AAF38279,l| (AE002205) translation initiation 
factor 2 [Chlamydophila pneumoniae AR39] 
dbj|BAA98527.1| (AP002546) initiation foctor-2 
[Chlamydophila pneumoniae J138] 


0.91 


932 


10956362 


gi|10956362|ref|NP_052811.1| pXOl-115 [BaciUus 
anthracis] pir||C59105 hypothetical protein pXOl-1 15 - 
Raeillns anthracis virulence nlasmid oXOl 
gb|AAD32419.1|AAD32419 (AF065404) pXOl-115 
[Bacillus anthracis] gblAAD41347. 1|AF150965_1 
(AF 150965) resolvase X [Bacillus anthracis] 


1.9 


933 


13814214 


gi|13814214|gb|AAK41294.1| (AE006722) Conserved 
hypothetical protein [Sulfolobus solfetaricus] 


0.99 
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SEQID 
NO 


ACCESS 
N 


- DESCRIPTION 


P VALUE 


940 


6320195 


gi|6320195|re£|NP_010275.1| Ydl009cp 
[Saccharomyces cerevisiae] pir||S525 10 probable 
membrane protein YDL009c - yeast (Saccharomyces 
cerevisiae) emb|CAA88350.1| (Z48432) unknown 
[Saccharomyces cerevisiae] emb|CAA98568.1| 
(Z74059) ORF YDL009c [Saccharomyces cerevisiae] 


7 


942 


10181130 


gi|10181130|reflNP_065586.1| acid sphingomyelinase- 
like phosphodiesterase 3a [Mus musculus] 
emb|CAA69329.1| (Y08135) acid sphmgomyelinase-like 
phosphodiesterase [Mus musculus] 


6E-84 


943 


13124718 


gi|13 124718|sp|P54358|DPOD_DROME DNA 
POLYMERASE DELTA CATALYTIC SUBUNIT 
gb|AAF49555. 1| (AE003529) DNApol-delta gene j 
product [Drosophila melanogaster] 


2.8 


946 


2258300 


gi|2258300|gb|AAB63269.1| (AF001783) AgrC 
[Staphylococcus aureus] 


1.9 


947 


7460037 


eil7460037ioirHT13431 hvDOtheticaloroteinT17A13 20 
Arabidopsis thaliana emb|CAB79678. 1| (AL161574) 
hypothetical protein [Arabidopsis thaliana] 


3.4 


948 


7494291 


gi|7494291|pir||E71616 hypothetical protein PFB0365w 
malaria parasite (Plasmodium falciparum) 
gb|AAC71862.1| (AE001390) hypothetical protein 
[Plasmodium falciparum] 


1.9 


950 


1707627 


gi|1707627|emb|CAA96381.1| (Z71701) cytochrome 
oxidase subumt III [Euhadra herklotsi] 


9.1 


952 


6739602 


gi|6739602|gb|AAF27330.1| (AF178534) talin [Homo 
sapiens] 


1E-14 
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SEQID 
NO 


ACCESS 
N 


DESCRIPTION 


P VALUE 


954 


7498956 


gi|7498956|pir||T20804 hypothetical protein F12F6.1 - 
Caenorhabditis elegans emb|CAA97791.1| (Z73425) 
cDNA EST yk30a8.3 comes from this gene-cDNA EST 
ykl2dll.5 comes from this gene-cDNA EST yk30a8.5 

comes from this gene-cDNA EST ykl 17g9.3 comes 
from this gene-cDNA EST ykl 13hl 1 .5 comes from this 

gene-cDNA EST ykl 17g9.5 comes from this 
gene^cuiN/Y^ emu j \~>i\i\y / /yo. i| yiu i d^zsj) cj-/in/\ i 
yk30a8.3 comes from this gene-cDNA EST ykl2dl L5 
comes from this gene-cDNA EST yk30a8.5 comes from 
this gene-cDNA EST ykl 17g9.3 comes from this 
gene-cDNA EST ykl 13hl 1.5 comes from this 
gene-cDNA EST ykl 17g9.5 comes from this 
gene~cDNA> 


6.6 


955 


9631417 


gil9631417jrefINP_048325.il ORF MSV254 leucine 
rich repeat gene family protein, similar to Amsacta 
moorei entomopoxvirus Q3 ORF SW:P28854 
[Melanoplus sanguimpes entomopoxvirusj pir||izo4i:> 
ORF MSV254 leucine rich repeat gene family protein - 

Melanoplus sanguimpes entomopoxvirus 
<*VJA APQ77^n 11 fAF0<v3ttfif>l ORF MSV254 leucine 
rich repeat gene family protein, similar to Amsacta 
moorei entomoooxvirus 03 ORF SW*P28854 
(Melanoplus sanguinipes entomopoxvirus] 


2.1 


956 


7160126 


gil7 160 126|emb|CAB763 12.1| (AL158060) putative 
membrane nrotein rStreDtomvces coelicolor A3 (2)1 


0.25 


959 


1203965 


gi|1203965|gb|AAA89173.1| (L42379) bone-derived 
growth factor [Homo sapiens] 


1.4 


960 


585147 


gi|585147|sp|P38094|FLUG_EMENI FLUG PROTEIN 

pir|(A53186 fluG protein - Emericella nidulans 
gb|AAC37414.1| (L27817) FluG [Emericella nidulans] 


5.4 


963 


11352695 


gj|l 1352695|pir||D83270 transcription-repair coupling 
protein Mfd PA3002 [imported] - Pseudomonas 
aeruginosa (strain PAOl) 
gb|AAG06390.1|AE004725_3 (AE004725) 
transcription-repair coupling protein Mfd [Pseudomonas 
aeruginosa] 


1.3 
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SEQID 
NO 


ACCESS 
N 


DESCRIPTION 


P VALUE 


964 


119777 


gi|119777|sp|P16291|FA9_SHEEP COAGULATION 
FACTOR IX (CHRISTMAS FACTOR) pir||I47078 

coagulation fector IXa (EC 3.4.21.22) - sheep 
ffbiOTTienrt eblAAA31520 11 flvl262331 factor EX TOvis 

aries] 


8.2 


965 


7473507 


gij l** f i |pir|jr < / jjo / proDauie giytxj5yiucuioACia.»c 
Deinococcus radiodurans (strain Rl) 
^hlAAF12451 1IAE001863 76 (AE001863) 
glycosyltransferase, putative [Deinococcus radiodurans] 


0.24 


967 


2494130 


gi|2494130|gb|AAB80639.1| (AC002376) Contains 
similarity to Glycine SRC2 (gb|AB000130). 
[Arabidopsis thaliana] 


5.2 


972 


10728660 


ml10728660lffblAAF52603 21 fAE003620) CG8683 
gene product [Drosophila melanogaster] 


6E-12 


980 




gi|7500037|pir||T34063 chromosome segregation protein 
smcl F28B3.7 [similarity] - Caenorhabditis elegans 
gb|AAK21378.1| (AF003136) contains similarity to 
ATP wnthase subunit B rCaenorhabditis eleeansl 


0.68 


981 


7298917 


gi|7298917|gb|AAF54122.1| (AE003675) CG10272 
gene product [Drosophila melanogaster] 


5.9 


983 


1170115 


GLUTATHIONE S -TRANSFERASE 1 (GST CLASS- 
THETA) gblAAA92880.1| (L32091) glutathione S- 
transferase [Manduca sexta] 


7.1 


985 


10864490 


<»in08fi4490lffblAAG24203 11 (AF022981'> Hypothetical 
protein W03F9.6 [Caenorhabditis elegans] 


5.3 


987 


JLUJ*fUU 


gi|105400lpir||A35648 B-cell adhesion protein CD22 
alpha splice form precursor - human emb|CAA36988. 1| 
(X52785) CD22 antigen [Homo sapiens] prf||1608208A 

T5 neW n n+i crf*n T^C^OO n-Tnmr» Q5ir»i<*n^l 

JD dCJJ. dllUgvIl JLf\^£m£t \t- X\Jl.\.\\J OCLjJ A V^I-LO J 


6 


989 


14750376 


gi|14750376lreflXP_012394.3| myosin IXA [Homo 
sapiens] 


3.9 


990 


6324560 


gi|6324560|reflNP_014629.1|Yol013w-ap 
[Saccharomyces cerevisiae] pir||S78736 protein 
YOL013w-a - yeast (Saccharomyces cerevisiae) delta 
remnant 


9.3 


991 


7290503 


gi|7290503|gb|AAF45956.1| (AE003431) CG3527 gene 
product [Drosophila melanogaster] 


7.6 
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SEQID 
NO 


ACCESS 
N 


DESCRIPTION 


P VALUE 


994 


9964623 


gi|9964623|reflNP_064753.1| RP DNApol [Roseophage 
SIOl] gb|AAG02598.1|AF189021_17 (AF189021) 
Roseophage SIOl complete genome 


3.5 


996 


27695-38 


gi|2769538|emb|CAAl 1217.1| (AJ223279) voltage- 

qptiqitivp ^nHinnri pTismrtpl rP1ntf»11a va/1 nctfO 1^1 


7 ft 


998 


9628506 


g^|yo2o5Uo|rei|JNJ J jU433o4.1| regulatory protein bz 
(Human papillomavirus type 29] 
sp|P50772|VE2_HPV29 REGULATORY PROTEIN 
E2 gb|AAA79432.1| (U31784) regulatory protein E2 
[Human papillomavirus type 29] 


6.2 


999 


6566147 


protein [Drosophila melanogaster] ; 


0.085 


1002 


11346920 


gi|11346920|pir||H81390 probable integral membrane 
protein Cj0461c [imported] - Campylobacter jejuni 
(strain NCTC 11168) emb|CAB75099.1| (AL139075) 
putative integral membrane protein [Campylobacter 
jejuni] 


1.5 


1005 


9294038 


gi|9294038|dbj|BAB01995.1| (AB020746) 
gene_id:MOB24.1~unknown protein [Arabidopsis 
thaliana] 


9.8 


1006 


4505067 


gi|4505067|reflNPj002349.1| MAD2-like 1; mitotic 
arrest deficient, yeast, homolog-like 1 (Homo sapiens] 
sp|Q13257|MD21_HUMAN MITOTIC SPINDLE 
ASSEMBLY CHECKPOINT PROTEIN MAD2A 
(MAD2-LIKE 1) (HSMAD2) pir||G01942 mitotic 
feedback control protein Madp2 homolog - human 
gb|AAC52060.1| (U31278) mitotic feedback control 
protein Madp2 homolog [Homo sapiens] 
gb]AAC5078Ll| (U65410) Mad2 [Homo sapiens] 
emD|CAAUjy4o.i| (ajuiiuioo; jyi/ydz [riomo sapiens j 
gb|AAH00356. 1 |AAH00356 (BC000356) MAD2 
(mitotic arrest deficient, yeast, homolog)-like 1 [Homo 
sapiens] gb|AAH05945.1|AAH05945 (BC005945) 
MAD2 (mitotic arrest deficient, yeast, homolog)-like 1 
[Homo sapiens] gb|AAK38174.1| (AF202273) MAD2- 
like protein 1 {Homo sapiens] 


0.000005 



473 



.021 4500A2_I_> 



WO 02/14500 



PCT/US01/25840 





Table 3B Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 1 


SEQBD 
NO 


ACCESS 
N 


DESCRIPTION 


P VALUE 


1010 


13994140 


>i| i J y y*+ i*f ujrei|rN r _y)j zo . x \ antigen py / ^iiicicuiuiiici 
associated) identified by monoclonal antibodies 133.2 
and 96.5 [Mus musculus] dbj|BAA86655.1| 
(AB024336) membrane-bound transferrin-like protein 
n07 nUn<: mii<?ciilii<;l dbilBAB41139 11 (AB047799) 
membrane-bound transferrin-like protein [Mus 

nriiQdiliisl 


4.1 


1011 


11465559 


gi|11465559|re£INP_045049.1| unknown [Cyanidium 
caldarium] gb|AAF12997.1|AF022186JL69 
(AF022186) unknown [Cyanidium caldarium] 


1.8 


1012 


13421288 


gi|13421288|gb|AAK22158.1| (AE005691) TonB- 
dependent receptor [Caulobacter crescentus] 


0.65 ' 


1013 


6978481 


gi|6978481|ref|NP_036905.1l a-kinase anchoring protein 
Kattus norvegicusj spivoz^zh-kvjvi A _^ rt - JL ^ j\jj.n/voji 
ANCHOR PROTEIN 1 1 (PROTEIN KINASE A 

AMr'WYRTTdri PPOTPTM 1 W fPPKAl W (A KINASE 

ANCHOR PROTEIN 220 KDA) (AKAP 220) 

nirllT497^9 A -kinase anchnrinff nrotein AKAP 220 - rat 
gb[AAB06559.1| (U48288) AKAP 220 [Rattus 
norvegicus] 


3 


1015 


7498137 


gi|7498137|pir||T31694 hypothetical protein D1065.2 - 
Caenorhabditis elegans 


7.4 


1017 


9944230 


ffi!9944230lemblCAC05416 11 (AJ400866) membrane 
tyrosine phosphatase [Bos indicus] 


7 


1023 


14520862 


gi|14520862)ref{NP_126337.1] hypothetical protein 
[Pyrococcus abyssi] pir||G75106 hypothetical protein 
PAB0444 - Pyrococcus abyssi (strain Orsay) 
emblCAB49568 11 (AJ248285) hypothetical protein 
[Pyrococcus abyssi] 


| 9.7 


1025 


11290247 


rrtll 1 9Q0747lriirimR9fl'3Q mnQPrvpH Vivnothetical nrotein 

gill IZ^vZH / ipir|jA-'0^\J->-' WJIloOl VGU JLi.yj^/WUJ.W'Uivcw. jjxvlwxjj. 

VC2740 [imported] - Vibrio cholerae (group Ol strain 
N16961) gb|AAF95879.1| (AE004339) conserved 
hypothetical protein [Vibrio cholerae] 


8.2 


1027 


12484269 


gi|12484269|gb|AAG54048. 1|AF224607_4 (AF224607) 
NADH dehydrogenase subunit 4 [Propithecus verreauxi 
deckeni] 


7 


1028 


14329703 


gi|14329703|emb|CAC40662.1| (AJ292926) anaerobic 
(class ID) ribonucleotide reductase large subunit chain 
[Staphylococcus aureus] 


1.7 
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SEQID 
NO 


ACCESS 
N 


DESCRIPTION 


P VALUE 


1037 


2636683 


gi|2636683|gb|AAC06264.1| (U66333) pol 
[Schistosoma mansoni] 


5.2 


1039 


6016240 


gi|6016240|sp|O02100|HOPl_CAEEL INTEGRAL 
MEMBRANE PROTEIN HOP-1 pir||T15184 presenilin 
beta homolog - Caenorhabditis elegans pir||T42237 
presenilin-beta homolog - Caenorhabditis elegans 
gb|AAB52948.1| (AF000265) Hypothetical protein 
C18E3.8 [Caenorhabditis elegans] gb|AAB84394.1| 
(AF021905) presenilin [Caenorhabditis elegans] 


8.5 


1 HAH 


10047173 


gi|l0047l73|dbj|BABl3380.l| (AB046774) KIAA1554 
protein [Homo sapiens] 


8.4 




14089928 


gi|14089928|emb|CAC13687.1[ (AL445564) 
HEMOLYSIN C [Mycoplasma pulmonis] 


4.8 


1045 


7469269 


gi|7469269jpir||S77245 bioY protein - Synechocystis sp. 
(strain PCC 6803) dbj|RAA17579.1| (D90907) BioY 
protein [Synechocystis sp. PCC 6803] 


5.6 


! 1046 


10140780 


gi|10140780|gb|AAG13610.1|AC078840_l 
(AC078840) hypothetical protein [Oryza sativa] 


2.2 


1050 


7496756 


gi|7496756|pir||T19593 hypothetical protein C31A11.3 - 

Caenorhabditis elegans emb|CAB05685.1| (Z83218) 
contains similarity to Pfam domain: PF01838 (Domain 
of unknown function), Score=506.2, E-value=5.4e-154, 
N=2 [Caenorhabditis elegans] 


4.1 


1052 


n S /\<"v Ar\.e\. 

7500420 


gi|7500420|pir||T32834 hypothetical protein F33H12. 1 - 
Caenorhabditis elegans gb|AAB95002.1| (AF040649) 
Hypothetical protein F33H12. 1 [Caenorhabditis 
elegans] 


4 


1054 


14736828 


gi| 14736828|ref|XP _03248 1 . 1 1 hypothetical protein 
XP_032481 [Homo sapiens] 


5.5 


1056 


7290674 


gi|7290674|gb|AAF46122.1| (AE003436) CG4320 gene 
product [Drosophila melanogaster] 


4.7 




14970757 


gl|14y70757|emo|UAL-444o4. 1| {AJ313d{)o) CtxA 

protein [Salmonella typhimurium] 


5.5 


1062 


4115497 


gi|4115497|dbj|BAA36391.1| (AB010426) AL1 like 
protein [Phytoplasma sp.] 


3.4 


1064 


111816 


gi|l 1 18 16|pir]|S21348 probable pol polyprotein-related 
protein 4 - rat emb|CAA37647.1| (X53581) ORF4 
[Rattus norvegicus] 


6.4 
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SEQ ID 
NO 


ACCESS 
N 


DESCRIPTION 


P VALUE 


1067 


14149940 


gi|14149940|ref|NP_l 15610. 1| hypothetical protein 
FLI23059 [Homo sapiens] dbj|BAB15536.1| 
(AK026712) unnamed protein product [Homo sapiens] 


8E-34 


1072 


6321761 


gi|6321761|ref|NP_pil837.1| YnlOzocp 
Saccharomyces cerevisiae] sp|P38740|YHC6_YEAST 
HYPOTHETICAL 30.7 KD PROTEIN IN RUtfl- 

CXTT7/C T\TTt7D /T.T?XTT/^ 'D'C/'^TrYrVT "DUTh/"" 1 ! TT? QfYR 

SJNro JJN 1 xilvLrxiJN 1L, KxivjlUIN rKJi^UJvoClJv 
pir||S48942 hypothetical protein YHL026c - yeast 

/Co™*Vifa*v\r»w/wi nf*T&nciA(A oh\ A A"R6S0fi? 11 fill 1583^ 

YHL026c gene product [Saccharomyces cerevisiae] 


0.26 


1076 


5932366 


mKO^O^Inkl A Ark^£Q1Q 11 AF1 RftlilS 11 fAF1R014^ 
^pyjZOOO|gD|/VrtlJDOyi^.l|/\J7 J. OV/l'rJ A A V*** lOUlHJ; 

lypothetical protein; zml2orf5 [Zymomonas mobihs] 


7.2 


1084 


1 1Uooo4 


gi|7106864|gb|AAF36157.1|AF151071_l(AF151071) 
xil> rv,z,j / [xioino sapiens j 


8.3 


1087 


1173846 


gi|1173846|gb|AAA86616.1| (U39455) envelope 
glycoprotein precursor [Crimean-Congo hemorrhagic 
fever virus] 


6.1 


1092 


87765 


gi|o / /o5|pur|[JUUUi3 nypotneucai J-fi protein ^imra 
intron of gene TS) - human prf||1510254A LI repetitive 
element ORF [Homo sapiens] 


2 


1096 


9506813 


gi|9506813|refINP_062184.1| Inositol polyphosphate-5- 
phosphatase [Rattus norvegicus] gb|AAB40610.1| 
(U55192) inositol polyphosphate 5' phosphatase Ship 
[Rattus norvegicus] 


3.3 


1101 


11496734 


gi|11496734|refJNP_045511.1| B. burgdorferi predicted 
coding region BBH18 [Borrelia burgdorferi] pir||C70237 
hypothetical protein BBH18 - Lyme disease spirochete 
plasmid H71p28-3 gblAAC66022.1| (AE000784) B. 
burgdorferi predicted coding region BBH18 [Borrelia 
burgdorferi] 


7 


1102 


9630723 


gi|9630723|reflNP_047269.1| putative virulence 
detenninant Vir [Mycoplasma arthritidis bacteriophage 
MAV1] gb|AAC33779.1| (AF074945) putative 
virulence determinant Vir [Mycoplasma arthritidis 
bacteriophage MAV1] 


9.9 
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SEQID 
NO 


ACCESS 

N 
in 




P VAT TTP 


1107 


3638957 


gi|3638957|gb|AAC36301.1| (AC004877) sco-spondin- 
mucin-iiKe, similar to Jryoio/ (iriu.gi / 1 idhoJ, details 
of intron/exon structure uncertain [Homo sapiens] 


3.3 


1110 


If / Z.D JOZ 


gi|14725582|ref|XPJ)02477.3| hypothetical protein 
x*j-rj 1U0Z7 [nonio sapiens j 


V.oD 


1112 


7298126 


gi|7298126|gb|AAF53364.1|(AE003642) 
BG.DS00180.10 gene product [Drosophila 
melanogaster] 


8.6 


1120 


15011489 


gi|1501 1489|gb|AAK77584.1|AF396436_24 
(AF396436) heme maturase [Tetrahymena thermophila] 


0.077 


1121 


/jUU^UO 


gi| /!>0UiUo|pir|| 12163 0 hypothetical protem r 32B4.1 - 
Caenorhabditis elegans emb|CAB04238.1| (Z81522) 
preaiciea using Lrenetinaer [^aenornao cutis eiegansj 


J.O 


1123 


170156 


gi|170156|gb|AAA73078.1| (M73688) [Sorghum bicolor 
endosperm tissue mRNA, complete CDS.], gene product 
prfj|l 80833 1 A gamma kafirin [Sorghum bicolor] 


0.41 


1125 




gi|348951|gb|AAC78248.1| (M77194) gag [Rat 
icuKenua virus j 




1127 


14583262 


gi|14583262|ref|NP_127506.1| replicase ORFlab 
polyprotein [Equine arteritis virus] emb|CAA69187.2| 
(Y07862) replicase ORFlb polyprotein [Cloning vector 
pEAV030] emb|CAC42775.2| (X53459) replicase 
ORFlb polyprotein [Equine arteritis virus] 


0.79 


1128 


6322140 


gil6322140lreflNP_012215.ll involved in filamentous 
growth; DfglOp [Saccharomyces cerevisiae] 
sp|P40526|YIE9_YEAST HYPOTHETICAL 30.3 KD 

X>t? OTTiTM TNT 1>T>T O/TD CVm TTVTTTTDnTTMT/^ 

rssAJ 1 HUN UN JKxJj jh-xs-o i vj! UN 1 iiK\jJbJN 1L/ 

REGION pir||S48430 probable membrane protein 
YIL049w - yeast (Saccharomyces cerevisiae) 
emb|CAA86173.1| (Z38060) or£ len: 253, CAI: 0.11 
[Saccharomyces cerevisiae] 


7.6 


1129 


14779404 


gi|14779404|ref|XP_008099.4| integrin alpha L 
precursor (Homo sapiens] 


0.004 
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Table 3B Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQID 
NO 


ACCESS 
N 


DESCRIPTION 


P VALUE 


1136 1 


13928898 


gi|13928898|ref|NP_l 13837. 1| killer cell lectin-like 
receptor subfamily G, member 1 [Rattus norvegicus] 
pir||I59421 mast cell function associated antigen - rat 
emb|CAA56208.1| (X79812) mast cell function 
associated antigen [Rattus norvegicus] 
emb|CAA65829.1| (X97191) MAFA protein [Rattus 
norvegicus] 


5 


1138 


11467057 


gi|l 1467057|ref|NP_042533. 1| NADH dehydrogenase, 
subunit 4 [Acanthamoeba castellanii] 
sp|Q37375|NU4M_ACACA NADH-UBIQU1NONE 
OXIDOREDUCTASE CHAIN 4 pir||S53834 NADH 
dehydrogenase (ubiquinone) (EC 1.6.5.3) chain 4 - 
Acanthamoeba castellanii mitochondrion 
gb|AAD 11826.11 (U12386) NADH dehydrogenase, 
subunit 4 [Acanthamoeba castellanii] 


0.63 


1140 


1 O A O 

733l54o 


gi|7331848|gb|AAF60536.1| (AC024772) contains 
similarity to Pfam family PF00569 (Zinc finger present 
in dystrophin, CBP/p300), score=30.4, E=4.3e-05, N==l 
[Caenorhabditis elegans] 


A S 


1144 


12725042 


gi| 12725042|gb|AAK06095 . 1 jAE006429_13 
(AE006429) UNKNOWN PROTEIN [Lactococcus 
lactis subsp. lactis] 


U.5Z 


1 1A A 
1 140 


13122173 


gi|13122173|emb|CAC32349.1| (AL583945) putative 
bifunctional protein (histidine kinase and regulator) 
[Streptomyces coehcolor] 


1.3 


1 1 A1 
1 14 / 


13925661 


gi|13925661|gb|AAK49407J|AF261233_l (Ar 261233) 
sodium/calcium exchanger protein [Mus musculus] 


5.3 


1149 


6552484 


gi|6552484|gb|AAF1641 L1|AF038572_1 (AF038572) 
jagged2 [Mus muscnlus] 


0.83 


1154 


13489284 


gi|13489284|gb|AAF16898.2lAF168614_l (AF168614) 
HMG-box transcription factor Soxl7 [Danio rerio] 


9.8 


1155 


1170115 


gi|l 1701 15|sp|P46430|GTTl JrfANSE 

THETA) gb|AAA92880.1| (L32091) glutathione S- 
transferase [Manduca sexta] 


7.8 


1159 


13377412 


gi|13377412|gb|AAK20674. 1|AF3 16639_9 (AF3 16639) 
Wzy [Streptococcus pneumoniae] gb|AAK74527.1| 
(AE007347) hypothetical protein [Streptococcus 
pneumoniae] 


4.1 
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SEQED 
NO 


N 


DESCRIPTION 


P VALUE 


1163 


7298868 


gi|7298868|gb|AAF54075.1| (AE003673) CGI 105 gene 
product [Drosophila melanogaster] 


5.5 


1164 


10178202 


gi|10178202|dbj|BABl 1626. 1| (AB016875) 

fc^CliC L\l.S\.yLJ / . X J~UI1M1UW11 piULCJH L-rYI tAUlUUpblb 

thaliana] 


9.3 


1166 


7494379 


gi|7494379|pir||C71610 probable membrane associated 
protein PFB0615c - malaria parasite (Plasmodium 
falciparum) gb|AAC71912.1| (AE001406) predicted 
membrane associated protein [Plasmodium falciparum] 


6.5 


1170 




gi|12408633|ref|NP_074924.1| cytochrome c oxidase 
subunit 1 [Podospora anserina] 

sp|P20681|COXl_PODAN CYTOCHROME C j 
OXIDASE POLYPEPTIDE I pir||A48327 cytochrome-c 
oxidase (EC 1.9.3.1) chain I - Podospora anserina 
mitochondrion emb|CAA38777.1| (X55026) cytochrome 
OAiuase c jjt ouospora onsennaj 




1173 


2144233 


gi|2144233|pirl|JC5010 nucleotide-binding protein F - 
Methanosarcina mazei emb|CAA62802.1| (X91502) 
ABC transporter [Methanosarcina mazei] 


6.9 


1174 


1 (VSR 1460 


gi|10581460|gb|AAG20195.1| (AE005096) Vng2034h 

ri-T*alr*hs»r*tf 3k rmm en "MUf^-ll 
| ricti^jUciv^Lci luiii ap. x> rvv^-i j 




1176 


13652647 


gi|13652647|ref|XPJ)07000.3| solute carrier femily 6 
(neurotransmitter transporter, betaine/GABA), member 

12 [Homo sapiens] refpCP_029976.ll solute carrier 
femily 6 (neurotransmitter transporter, betaine/GABA), 
member 12 [Homo sapiens] 


9 


1181 


\ 

9631326 


gi|9631326|ref|NP_048159.1| ORF MSV088 
hypothetical protein [Melanoplus sanguinipes 
entomopoxvLTUsj pir||T28249 ORF MbV088 
hypothetical protein - Melanoplus sanguinipes 
entomopoxvirus go| AAty /o J y . 1 1 (Ar uo.5 ooo) kjkt 
MSV088 hypothetical protein [Melanoplus sanguinipes 
fintomnnnwi n i <j1 

vmvlllUU vA V U Uul 


3.2 


1184 


13897920 


gi|13897920|gb|AAK48502.1|AF260966_l (AF260966) 
IL-8 receptor [Oncorhynchus mykiss] 


2.6 


1185 


12860694 


gi|12860694|dbj|BAB32022.1| (AK020187) putative 
[Mus musculus] 


0.86 


1187 


6671484 


gi|6671484|gb|AAC49301.2| (U32444) phytochrome F 
[Lycopersicon esculentum] 


3.4 
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Table 3B Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ ID 
NO 


ACCESS 
"NT 


DESCRIPTION 


P VALUE 


1191 


11358531 


gi| 1 135853 l|pir||T5 1640 myb-related transcription 
factor MYB19 [imported] - Arabidopsis thaliana 
(fragment) gb|AAC83590.1| (AF062868) putative 
transcription factor [Arabidopsis thaliana] 


0.95 


1192 


88462 


gi|88462|pir||A27307 proline-rich phosphoprotein (gene 
PRH1, Db allele) - human 


7.9 


1193 


7484Q0Q 


gi|7484909|pir||T06608 disease resistance protein 
homolog F16J13.80 - Arabidopsis thaliana 
emb|CAB40942.1| (AL049638) putative disease 
resistance protein (TMV N-like) [Arabidopsis thaliana] 
emb|CAB78244.1| (AL161533) putative disease 
rp«:i<!tanrp nrotein f 1 1VTV N-lilcei TArabidonsis thaliana! 


3.4 


1195 


7492269 


gi|7492269|pir||T39663 paired amphipathic helix, 
probable transcription regulator protein - fission yeast 

(Schizosaccharomyces pombe) emb|CAA21310.1| 
(AL03 1856) putative transcriptional rgulatory protein 
[Schizosaccharomyces pombe] 


4.1 


1197 




gi|134437|sp|P13823|SERA_PLAFG SERINE- 
REPEAT ANTIGEN PROTEIN PRECURSOR (P126) 

Sill T7"T% A ATLTTTOTTXTX . * -II A C A CAC - - 

(Ill KDA ANTIGEN) pir||A545U5 senne-repeat 
antigen precursor - malaria parasite (Plasmodium 
ialciparurn) (strain r^KJj gDjAAAzy fo3.±\ ^jujyy^j 

serine repeat protein [Plasmodium falciparum] 
gb|AAA16791.1| (J04000) serine-repeat antigen protein 
[Plasmodium felciparum] gblAAA74911.1| (U08113) 

c^ririf* Tpr*f*jrt' ntitiopti rPljiQiTioHiinn "fslc.iTlfl'riiTnl 


4.6 


1202 


12841678 


gi|12841678|dbj|BAB25308.1| (AK007856) putative 
[Mus musculus] 


2E-31 


1211 


586120 


gi|586 120|sp|Q07283|TRHY_HUMAN 

XPTPWrHTVAT TNT nidi A 4S 07^ tri rfinVivsiiin - human 

gb|AAA65582.1| (L09190)trichohyalin [Homo sapiens] 


9.8 • 


1214 


3122611 


gi|3 12261 l|sp|O18417|A70A_DROSE ACCESSORY 
GLAND-SPECIFIC PEPTIDE 70A PRECURSOR 
(PARAGONIAL PEPTIDE B) emb|CAA67791. 1| 
j (X99414) sex-peptide [Drosophila sechellia] 


6.1 


1215 


2190464 


gi|2190464|emb|CAB09537.1| (Z96107) Uncx4.1 [Mus 
musculus] 


7.6 
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Table 3B Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQID 
NO 


ACCESS 
N 


DESCRIPTION 


P VALUE 


1216 


153930 


gi|153930|gb|AAA27047.1| (M23007) NADPH-sulfite 

rpjii i pt"3 Qf* liPiTirYtirrvt'fMTi rT»Tnrinnpn+ rQ^lmrvriplli} 

typhimurium] 


1.3 


1217 


5091519 


protein [Oryza sativa] 


0.95 


1219 


9759203 


gi[9759203|dbj|BAB09740.1| (AB015476) heat shock 
transcription factor HSF30-like protein [Arabidopsis 
thaliana] 


9.2 


1224 


1732073 


gi|1732073lgblAAC50901.1| (U75308) TBP-associated 
factor FHomo saniensl 


0 13 


1225 


7505421 


gi|7505421tpir||T23399 hypothetical protein K07C10.1 - 
Caenorhabditis elegans emb|CAA87375.1| (Z47074) 
similarity to transmembranous domains of the 

UlUoUpjJlXci piULCill pdLL/llCU. ^OWlaa xIUL dwCCbblOIl 

number PI 85 02) [Caenorhabditis elegans] 


3.4 


1228 


140791 


gi|140791|sp|P17369[YHR3_VACCV 
HYPOTHETICAL HOST RANGE 27.4 KDA 
PROTEIN pir||WZVZA3 27.4K HindTTT-C protein - 
vaccinia virus (strain WR) gb|AAA69594. 1| (M228 12) 


4 fi 


1231 


6831569 


gi|683 1 569|sp|O84098|IF2_CHLTR TRANSLATION 
INITIATION FACTOR IF-2 pir||H71558 probable 
translation initiation factor IF-2 - Chlamydia 
trachomatis (serotype D, strain UW3/Cx) 

ah!AAP676R7 11 fAF00195tt^ Initiation Vartnr-') 

[Chlamydia trachomatis] 


7.8 


1243 


12725042 


gi|12725042|gb|AAK06095.1|AE006429_13 
(AE006429) UNKNOWN PROTEIN [Lactococcus 

lapti^i <2iih^n lartiQl 

luvlid JllUjp. UlvUQJ 


0 91 


1246 




gi| 10 1 78036|dbj|BAB 1 1 5 1 9. 1| (AB005245) 




1249 


7494298 


gi|7494298|pir||A71613 hypothetical protein PFB0530c ■ 
malaria parasite (Plasmodium falciparum) 
sblAAC71895 11 (AE001400) hvoothetical nrotein 
[Plasmodium falciparum] 


2.6 


1253 


14721018 


gi|14721018|ref|XP_051562.1| similar to agrin (H. 
sapiens) (Homo sapiens] 


3.5 


1255 


6175163 


gi|6175163|gb|AAF04889.1|AC01 1437_4 (AC011437) 
unknown protein [Arabidopsis thaliana] 


8.7 
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Table 3B Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQID 
NO 


ACCESS 
N 


DESCRIPTION 


P VALUE 


1264 


1 9^90097 


(AC083891) ABC transporter, putative [Arabidopsis 

LJx&JLLcilxciJ 


9.5 


1265 


13375919 


gi|13375919|ref|NP_078940.1| hypothetical protein 

17T Tl 1 rTTr»TYir» ccitvi*»nci HfiiKR ARl ^93 1 T 1 

7 LjJ x 1 ojo LFioino sapienbj cjdj j j>/vd i -? x . x j 
(AK021900) unnamed protein product [Homo sapiens] 


4.2 


1274 


9369395 


gi|9369395|gb|AAF87143J)AC002423_8 (AC002423) 
T23E23.16 [Arabidopsis thaliana] 


7.2 


1275 


7435789 


i^l7^^^7QOlrvii*IITn£97^ Kf»n«7 , ntliiaHiJ! , 7n1p-inHiiCftH 
gl| / ^tjD I 07[pir|| 1 VJOZ. / O U CIxZaJ UXlcMJlaxAJiG ixuxux<t.vx 

protein (clone WCI-4) - wheat gb|AAC49287.1| 
(U32430) thiol protease [Triticum aestivum] 


5.4 


1280 


7497192 


gi|7497192|pir||T19833 hypothetical protein C38D9.3 - 
Caenorhabditis elegans emb|CAB03949.1| (Z81481) 
C38D9.3 [Caenorhabditis elegans] 


0.15 


1284 


14739967 


gi| 14739967|rexiXP_035107.1| v-raf murine sarcoma 
viral oncogene homolog Bl [Homo sapiens] 


1.9 




7463772 


gj|7463772|pir||r7Uxlo signal peptidase i \iepo-i; 
homolog - Lyme disease spirochete gb|AAC66422.1| 
^/Viiuui i x / ) signal pepuaase i ^icpx>-j.^ ^i3uiicxi<» 
burgdorferi] 


0.22 


1292 


4580388 


hypothetical protein [Arabidopsis thaliana] 


5.5 


1293 


10437002 


gi|10437002|dbj|BAB14954.1| (AK024666) unnamed 
protein product [Homo sapiens] 


0.000000002 


1296 


4493974 


gi|4493974|emb|CAB39033.1| (AL034559) hypothetical 
protein, PFC0930c [Plasmodium falciparum] 


3.2 


1301 


482030 


gi|482030|pir||S40544 hypothetical protein - Escherichia 

r1Ktl*RA A019QO 11 /TilOzlft^i T^l fivnnthptiral 
COU QDjIxj/VrVUlZ"". 1| \U IVJ^tOD ) lol xiypiJUJ.OU.OaJ. 

protein E-96(PIR:A04462) [Escherichia coli] 


8.3 


1306 


4836719 


gi|4836719|gb[AAD30537.1|AF133256_2(AF133256) 
envelope protein precursor [Friend mink cell focus- 
forming virus] 


8.6 


1307 


12843929 


gi|12843929|dbj|BAB26168.1| (AK009250) putative 
[Mus musculus] 


2.1 


1308 


10175999 


gi|10175999|dbj|BAB07095. 1| (AP0015 18) 
lipopolysaccharide biosynthesis [Bacillus halodurans] 


2.8 
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SEQID 
NO 


ACCESS 
N 


DESCRIPTION 


P VALUE 


1310 


14767733 


gi|14767733|ref|XP_030859.1| similar to TGF(beta)- 
induced transcription factor 2 (H. sapiens) [Homo 
sapiens] 


1.5 


1 O 1 "I 
I J I 1 


1351721 


gi|1351721|sp|Q10105|YAQ5_SCHPO PUTATIVE 
TRANSLATIONAL ACTIVATOR C18G6.05C 
(GCN1 HOMOLOG) pir||T37919 GCN1 homolog - 
fission yeast (Schizosaccharomyces pombe) 
emb|CAA92Jo5.l| (Zooiys) (jUJNi nomologue 
Schizosaccharomyces pombe] 


0.48 


1312 


13376747 


gi|13376747|refINP_079428.1| hypothetical protein 
FLJ12660 [Homo sapiens] ref|XPJ)17923.1| 
hypothetical protein FLJ 12660 [Homo sapiens] 
dbj|B AB 14203. 1| (AK022722) unnamed protein product 
[Homo sapiens] 


U.UUUUUUUUo 


1315 


9453886 


gi|9453886|dbj|BAB03287.1| (AB045975) pro-alpha 1 
type V7XI collagen [Pagrus major] 


0.021 


1320 


11034630 


gi|11034630|dbj|BAB17154.1| (AP002868) hypothetical 
protem [Oryza sabva] dbj|BAB5549l.l| (AP002541) 
hypothetical protein [Oryza sativa] 


0.49 


1322 


14727261 


gi|l472726l(ref|XP_0273l3.l| hypothetical protein 
FLJ22351 [Homo sapiens] refpff_0273l4.ll 
hypothetical protein FLJ22351 [Homo sapiens] 


0.077 


1327 


2789430 


gii2789430|dbj|BAA24380.l| (D30612) repressor 
protein [Homo sapiens] 


u.y / 


1328 


7497494 


gi|7497494|pir||Tl9963 hypothetical protein C46C2.4 - 
Caenorhabditis elegans emb|CAA92590.l| (Z68296) 
contains similarity to Pfam domain: PF0056 1 
(alpha/beta hydrolase fold), Score=30.8, E-value=le-05, 
N=l [Caenorhabditis elegans] 


0.2 


1329 


13129018 


gi|l3l290l8|refINP_076956.l| hypothetical protein 
MGC3077 [Homo sapiens] ref|XPJ)30ll6.l| 
hypothetical protein MGC3077 [Homo sapiens] 
gb|AAC23790.1| (AC005154) similar to protem 
U28928 (PID:g861306) [Homo sapiens] 
gb|AAH00625.1|AAH00625 (BC000625) Unknown 
(protein for MGC3077) (Homo sapiens] 


2E-89 


1330 


14647539 


gi|14647539|gb|AAK71916.1|AF332040_l (AJF332040) 
growth hormone receptor [Jaculus jaculus] 


4.8 
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OCA ¥T\ 

NO 


ACCESS 
N 


DESCRIPTION 


P VALUE 


1331 


1584488 


gi|1584488|prf||2123261AB chemosensory receptor 
Caenorhabditis elegans] 


0.68 


1333 


2326324 


gi|2326324|emb|CAB10936.1| (Z98264) cytochrome c 
oxidase subunit in [Anabaena sp.] 


A 1 
*r.l 


1334 


7434912 


gi|7434912jpir|]H71934phosphatidylglycerophosphate 
synthase - Helicobacter pylori (strain J99) 
gblAAD05990.1| (AE001475) 
PHOSPHATIDYLGLYCEROPHOSPHATE 
SYNTHASE [Helicobacter pylon Jyyj 




1337 


1335199 


gi|1335199|emb|CAA26919.1| (X03145) pot. ORF V 
Homo sapiens] 


5.2 


134U 


7440285 


gi|7440285|pir||T07994 ribosomal protein S9 - 
Chlamydomonas reinhardtii chloroplast 
emb|CAA74006.1| (Y13655) 30S nbosomal protein b9 
[Chlamydomonas reinhardtii] 


3 


1 1 A 1 

1341 


8923088 


gi|8923088|ref]NP_060 127. 1| hypothetical protein 
FLJ20080 [Homo sapiens] ref|XPJ)46636.1| 
hypothetical protein FLJ20080 [Homo sapiens] 
dbj|BAA90936.1| (AK000087) unnamed protem product 
{Homo sapiens] 


0.000000008 


1 1 AG. 

1345 


6958206 


gi|695 8206lgb[AAF32493 . 1|AF093 132_1 (AF093 132) 
kexm-like protease KEX1 [Pneumocystis cannn f. sp. 
muris] 


2.3 


1 1A 

1 J4o 


7496774 


gi|7496774|pir||T32166 hypothetical protem C31B8.8 - 
Caenorhabditis elegans 


2 


1350 


9366789 


gi|9366789|emb|CAB95551.1| (AL359782) hypothetical 
protem, CHR1. 313. 1 Trypanosoma bruceij 


O A 


1352 


10175488 


gi|10175488|dbj|BAB06586.1| (AP001516) cation 
antiporter (Na-*7Ca2+) [Bacillus naloauransj 


U. /Z 


1354 


1119230 


gijlll9230|dbj|BAA06595.1| (D31786) secretion 
protein Y [ Acyrthosiphon kondoi endosymbiont] 


8.5 


1355 


2897812 


gi|2897812|dbj|BAA24894.1l (AB010996) G2-G1 
polyprotein precursor [tomato spotted wilt virus] 


8.9 


1357 


I'll 1 1 CO/f 

131115H6 


gi|13 1 1 1586[gb|AAK12388. 1(AF296094_1 (AF296094) 
poiyprotein Urorcine tescnovirusj 


9.2 


1359 


13881823 


gi|13881823|gb|AAK46426.1| (AE007064) hypothetical 
protein [Mycobacterium tuberculosis CDC 1551] 


1.3 


1368 


1362185 


gi|1362185|pir||S56686 histone H2B123 - wheat 
dbj|BAA07158.1| (D37944) protein H2B 123 [Triticum 
aestivum] 


9.7 
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SEQID 
NO 


ACCESS 
N 


DESCRIPTION 


P VALUE 


1369 


13812308 


gi| i o o i jld uo |rei|iN Jr _i i o4z o . 1 1 iiypotneticai protein 
[Guillardia theta] emb|CAC26995 . 1 1 (AJO 1 0592) 
hypothetical protein [Guillardia theta] 


4.7 


1370 




gi|13541057|ref|NP_110745.1| Permease (major 
iaciiitaior superraiiiiiy ) [ 1 nerrnopiasiua voicaniunij 
dbj|BAB59369.1| (AP000991) uiiknown product 

I X ildllUjplciblllci VUlWOlUUIIlJ 


7 6 


1371 


9964353 


gi|9964353|refTNP_064821.1| AMV039 [Amsacta 
moorei entomopoxvirus] gb|AAG02745.1|AF250284_39 

\J\ir ZJuZo'fJ AiVlVU-jy |/\IHSaCia mOOiCl 

entomopoxvirus] 


5.3 


1373 


902377 


gi|902377|gb|AAA82981.1| (U18059) polyprotein 
tpestivirus type 1] 


0.41 


1374 


oyJD fro 


gljoVDj /4o(u.Dj|oAAyoUO/.l| ^AJrUUUJOoj 

gene_id:F6B6 J~piii|C71410-similar to unknown 
protein i/Axauiciopsis raaiianaj 




1375 


11291752 


gi|11291752|pir||T47971 seven in absentia-like protein - 
AraDiaopsis tnanana emD|wvD/xJ.uy.x| \a\uY5j&jzj) 
seven in absentia-like protein [Arabidopsis thaliana] 


10 


1379 


10177211 


gi|10177211|dbj|BAB10286.1| (AB026650) protein 
kinase [Arabidopsis thaliana] 


5 


1384 


7332073 


gi|7332073|gb|AAF60760.1| (AC024810) Hypothetical 
protein Y54E1 OA. 1 [Caenorhabditis elegans] 


5.7 


1388 




gi|11358814|pir||T46130 RNA polymerase HI subunit- 
like protein - Araoiaopsis tnanana emD|v^AJ50ZUiu.i| 
(AL 132967) RNA polymerase HI subunit-like protein 
[/\rauiQopsis uiaiianaj 


9 9 


1389 


4506569 


gi|4506569|reflNPJ)02932.1| roundabout (axon 
guidance receptor, Drosophila) homolog 1 [Homo 
sapiens] gb)AAC39575.1| (AF040990) roundabout 1 
[Homo sapiens] 


4E-17 


1393 


11761072 

x x / vy x \j 9 


gi|11761072|dbj|BAB19062.1| (AP002744) hypothetical 
nrntein FOrvza satival 


1.9 


1398 


461649 


gi|461649|sp|Q05004|BB61_RABIT BRUSH BORDER 
61.9 KD PROTEIN PRECURSOR pir||B45665 adult- 
specific 61.9K brush border protein precursor - rabbit 
emb|CAA78302.1| (Z12840) protein of unknown 
function [Oryctolagus cuniculus] 


8E-52 
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CEO ID 

,11-/ 

NO 


ACCESS 

IN 


T)F<irRTPTTON 


P VALUE 


1402 


478809 


gi|478809|pir||S29851 protein kinase 6 (EC 2.7.1.-) - 
soybean gb|AAA34002.1| (M67449) protein kinase 
Glycine max] prf]|1908223A protein kinase [Glycine 
max] 


9.3 


1406 


1517936 


gi|1517936|gb|AAB07000.1| (U52347) tachykinin-like 
receptor [Stomoxys calcitrans] 


1.1 


1407 


9977929 


gi|9977929|sp|Q05013|LIPA_NEIMB CAPSULE 
POLYSACCHARIDE MODIFICATION PROTEIN 
JPA pir||D81240 capsule polysaccharide modification 
protein LipA NMB0082 [imported] - Neisseria 
meningitidis (group ±> strain jyllj joj gD|/\i\rHujH-o.i| 
(AE002367) capsule polysaccharide modification 
protein LipA [Neisseria meningitidis MC5 8] 


9.3 


1412 


14773348 


gi|14773348|reflXP 038450. 1| 20849 [Homo sapiens] 


1E-48 


1413 


13537363 


gi|13537363|dbj|BAB40663.1| (AB051851) death 
receptor 3 [Homo sapiens] 


3.9 


141Q 


7477083 


gi|7477083|pir||A70577 hypothetical protein Rv2133c - 
Mycobacterium tuberculosis (strain H37RV) 
emb|CAB08660.1| (Z95388) hypothetical protein 
Rv2133c [Mycobacterium tuberculosis] 
gb|AAJ\>4-o4 / D. 1| (AliUU /uo / J conserved xiypumciiL/cu 
protein [Mycobacterium tuberculosis CDC 1551] 


1.9 


1422 




gi|749y021|pir|[ lzUo4o nypotneticai protem r iDEiy.y - 
Caenorhabditis elegans emb|CAA93411.1| (Z69383) 
r LDS2i2.zr i^aenoriiaDuiiis eicg<m^j 


4.1 


1424 


14723696 


gi|14723696|reffXP_035744.1| hypothetical protein 
XP 035744 [Homo sapiens] 


1.5 


14?^ 


9961349 


gi|yyoi j*ty|rei(iNir uujjuu.zi i^mx-iii^c i [xiuiuu 

sapiens] 


0.94 


1426 


A TOO 

478302 


gi|478302|pir||JN0835 carbonate dehydratase (EC 
4.2.1.1) I - chimpanzee 


1.6 


1427 


6649942 


gi|6649942|gb|AAF21641.1|AF032379_l (AF032379) 
gonadotrophin releasing hormone receptor; GnRH-R 
[Trichosurus vulpecula] 


9.6 ! 


1432 


6636500 


gi|6636500|gb|AAF2020 1 . 1| AF20579 1_1 (AF20579 1) 
squalene synthase [Botryococcus braunii] 


1.8 


1434 


7504070 


gi|7504070(pir||T22586 hypothetical protein F53F4.14 - 
Caenorbabditis elegans 


0.05 
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SEQID 
NO 


ACCESS 
N 


DESCRIPTION 


P VALUE 


1437 


102177 


gij 1 02 1 77|pir||S 13141 hypothetical protein (ribosomal 

ixi ^ xi i r ^ \jt£pi\j±x j vj xcu met Lcu.ii\jLxa. 


1.2 


1439 


13357869 


gi|13357869|ref]NPJ)78143.1| unique hypothetical 
[Ureaplasma urealyticum] pir||D82907 hypothetical 
protein UU309 [imported] - Ureaplasma urealyticum 
gb|AAF30718.1|AE002128_6 (AE002128) unique 
hypothetical [Ureaplasma urealyticum] 


5.4 


1451 


7468435 


gi|7468435|pir||B72015 metalloproteinase, insulinase 
family CP0903 [imported] - Chlamydophila pneumoniae 
(strains CWJL029 and AR39) go|AAD 19093. 1| 
(AE001675) Insulinase fomily/Protease DDL 
[Chlamydophila pneumoniae CWL029] 
go j/Y/\r j o o o zf . i j \j\i2Aj\}/.z. t t y ) meiauoproiease, 
insulinase family [Chlamydophila pneumoniae AR39] 
dbj|BAA99165.1| (AP002548) insulinase 

■fe"mi1v/riTr»1"fyicp 111 rf^Viljimvflri'nliil?! rvnPTrmfYnijift Tl^Rl 
j_cu.iu.iy / ljx uiwoov .ill y\^XLiaj.Lt.y \j.kjljijj.1(X piivummuov j ajoj 


7 1 


1452 


9857712 


gi|9857712[gb|AAG00902.1(AF176776_l(AF176776) 
xyloglucan endotransglycosylase LeXET2 
[Lycopersicon esculentum] 


1.9 


1453 


4493974 


gi|4493974|emb|CAB39033.1| (AL034559) hypothetical 
protein, PFC0930c [Plasmodium falciparum] 


3.4 


1454 


4493900 


gij^^y^i'uuiemDj^Ao^iyuuy.ij {j\L/V^hddo) preuicieu 
using hexExon; MAL3P2.22 (PFC0265c), Hypothetical 
protein, len: 637 aa [Plasmodium falciparum] 


6.1 


1456 


7503603 


gi| /3UjoU3|pir|| l io3 /D nyp otneucai protein r4oixi i.i - 
Caenorhabditis elegans gb|AAA81397.1| (U40412) 
Hypothetical protein F46G1 1 . 1 [Caenorhabditis 
elegans] 


2.1 


1462 


9437954 


nucleoprotein [Influenza A virus (A/Duck/Hong 
Kong/P54/97(Hl 1N9))] 


6.7 


1464 


2129239 


gi|2129239|pir||G64488 reverse gyrase (intein- 
containing - IMethariricnccu^ iantiascHii 
gb|AAB9953 1 . 1| (U67592) reverse gyrase, intein 
containing (rgy) [Methanococcus jannaschii] 


9.2 


1466 


12232439 


gi|12232439|ref|NPJ)73602.1| hypothetical protein 
FLJ11937 (Homo sapiens] dbj|BAB 15124.1) 
(AK025392) unnamed protein product [Homo sapiens] 


6.4 
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SEQID 
NO 


N 


DESCRIPTION 


P VALUE 


1468 


7243247 


gi|7243247|dbj|BAA92671.1| (AB037854) KIAA1433 
protein [Homo sapiens] 


2E-87 


1472 


12382242 


gi|12382242|gb|AAG53080. 1|AF263824_1 (AF263824) 
5'A2rel-related protein [Leishmania donovani] \ 


8.7 


1473 


13812383 


gi|13812383|ref]NP_113501.1| seryl-tRNA synthetase 
(serin-tRNA ligase) [Guillardia theta] 
emb|CAC27070.1| (AJ910592) seryl-tRNA synthetase 
(serin-tRNA ligase) [Guillardia theta] 


6.5 


1476 


6014998 


gi|6014998|sp|O80164|DPA5_BPR69 DNA 
POLYMERASE PROCESS IVITY COMPONENT 
(DNA POLYMERASE ACCESSORY PROTEIN 45) 
(GP45) pdb|lB77|A Chain A Building A Replisome 
Structure From Interacting Pieces: A Sliding Clamp 
Complexed With An Interaction Peptide From Dna 
Polymerase pdb|lB77|B Chain B, Building A Replisome 
Structure From Interacting Pieces: A Sliding Clamp 
Complexed With An Interaction Peptide From Dna 
Polymerase pdbjlB77|C Chain C, Building A Replisome 
Structure From Interacting Pieces: A Sliding Clamp 
Complexed With An Interaction Peptide r rom Una 
Polymerase pdb|lB8H|A Chain A, Sliding Clamp, Dna 
Polymerase pdb|lB8H|B Chain B, Sliding Clamp, Dna 
Polymerase pdb|lB8H|C Chain C, Sliding Clamp, Dna 
Polymerase gb|AAC393 10.1| (AF039565) DNA 
polymerase processivity component [Bacteriophage 
RB69] 


6.9 


1479 


7706747 


gi|7706747|ref|NP_057263.1| transient receptor potential 
4 [Homo sapiens] sp|Q9UBN4|TRP4_rlUMAJN 
SHORT TRANSIENT RECEPTOR POTENTIAL 
CHANNEL 4 (TRPL4) (IKr-KbLAlcJJ rKU 1 tJJN 4 J 
(HTRP-4) (HTRP4) gb|AAD51736.11AF175406_l 
( AF 1 75406^ transient receotor potential 4 fHomo 
sapiens] gb|AAF22927.1(AF063822_l (AF063822) trp- 
related protein 4 [Homo sapiens] 


• 

1.7 


1480 


9757550 


gi|9757550|dbj|BAB08163.1j (AB030831) SrtT 
[Streptococcus pyogenes] 


0.47 


1483 


13516917 


gi|13516917|dbj|BAB40338.1| (AB044076) hybrid 
sensor [Myxococcus xanthus] 


5 
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SEQID 
NO 


ACCESS 
N 


DESCRIPTION 


P VALUE 


1487 


5790208 


gi|5790208|dbj|BAA83536.1| (AB031285) NADH 
dehydrogenase subunit 2 [Taenia saginata] 


2.7 


1489 


14754352 


gi|14754352|reflXP_032294.1| hypothetical protein 
FLJ10775 [Homo sapiens] 


0.0008 


1490 


3552028 


gi|3552028|gb|AAC64946.1| (AF087130) siderophore 
regulation protein [Neurospora crassa] 


7.2 


1493 


3913936 


gi|39 1 3936|sp|Q43652|D?27_SOLTU PROTEINASE 
INHIBTrOR TYPE H CM7 PRECURSOR pir||S43 105 
proteinase inhibitor II - notato emblCAA55082 11 
(X78275) proteinase inhibitor II [Solanum tuberosum] 


9.9 


1494 


6573777 


gi|6573777|gb|AAF17697. 11AC009243_24 (AC009243) 
F28K19.17 [Arabidopsis thaliana] 


2.2 


1496 


13622425 


gi|13622425|gb|AAK34148.1| (AE006569) maltodextrin 
transDort svstem nermease rStrentococcus nvo&enes M"l 
GAS] 


3.7 


1498 


9910266 


gi|9910266|ref|NP_064627. 1| kinesin-like protein 2 
FHomo saoiensl dbilBAB03309 11 fAB035898^ kinesin- 
like protein 2 [Homo sapiens] 


5E-19 1 


1503 


7294128 


eil7294128leblAAF49482 11 <AE003527^ CG4925 eene 
product [Drosophila melanogaster] 


3.6 


1506 


14089610 


gi|14089610|emb|CAC13370.1| (AL445563) unknown; 
predicted coding region [Mycoplasma pulmonis] 


1.5 


1507 


2565196 


gi|2565196|gb|AAB8 1938.1| (AF000381) non-functional 
folate binding protein [Homo sapiens] 


0.0000004 


1509 


14753935 


gi|14753935|refp0P_040892.1| hypothetical protein 
XP_040892 [Homo sapiens] 


3.3 


1510 


12545425 


gi|12545425|ref|NPJ)74975.1| hypothetical protein 

TArt-acin Inner*! cnrP^A77^lVPV9 AQTT C% 

HYPOTHETICAL 34.5 KDA PROTEIN IN RPS12- 
TRNP INTERGENIC REGION (ORF288) 
emb|CAC24586.1| (AJ294725) hypothetical protein 
rAstasia loneal 


7 


1511 


420215 


gi|420215|pir||B45878 hypothetical protein 2 - mouse 


7.1 


1512 


7512874 


gi|7512874|pir||T08792 hypothetical protein 
DKFZp586E1422.1 - human (fragment) 
emb|CAB43306.1| (AL050170) hypothetical protein 
[Homo sapiens] 


5.5 


1514 


221758 


gi|221758|dbj|BAA01683.1| (D10879) UL37 [human 
herpesvirus 1] 


0.18 
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SEQID 
NO 


ACCFSS 
N 


DESCRIPTION 


P VALUE 


1517 


11466232 


gi|11466232|ref|NP_062855.1| ORF9, contains 8 trans 
membrane regions, putative [Physarum polycepbalum] 
rlhinRAR0R0R9 11 fAB027295"l ORF9 contains 8 trans 
membrane regions, putative [Physarum polycepbalum] 


0.43 


1518 


182710 


gi|182710|gb|AAA52467.1| (M13918) fibronectin 
receptor alpba-subunit precursor [Homo sapiens] 


1.3 


1519 


1142976 


gi|1142976Igb|AAC52392.1| (U28769) odorant receptor 
r\/fnc mitQriiln^l nrfl 12207403 C odorant receotor FMus 
musculus] 


5.6 


1520 


6912446 


gi|6912446|ref|NPJ)36417.1| potassium voltage-gated 
channel, subfamily H (eag-related), member 4; ether-a- 
crrk-crri JC(-±\ r\yz\nt\p\ familv member FTTomo saoiensl 
dbj|BAA83592.1| (AB022698) BEC2 [Homo sapiens] 


8.4 


1522 


14601483 


gi|14601483|ref(NP__14802Ll| hypothetical protein 
[Aeropyrum pernix] pir||G72637 hypothetical protein 
APE1558 - Aeropyrum pernix (strain Kl) 
rfVnHRA AR05S7 ll fAP00006U 279aa lone hvoothetical 
protein [Aeropyrum pernix] 


5.7 


1526 


12515306 


gi|12515306|gb|AAG56369. 1|AE005365_3 (AE005365) 
orf 5 hypothetical protein [Escherichia coli 0157:H7 
pni Q^qi HhifRAR^S4^5 1 1 f AP002557^ hvnothetical 
protein [Escherichia coli 0157:H7] 


7.4 


1530 


2765672 


cnl976SfV7?lpmhlCAR06819 1! ^786115^ ArbX 
[Lactobacillus delbrueckii] 


7.9 


1532 


13472514 


gi| 134725 14|reflNP__10408 1 . 1 1 unknown protein 
[Mesorhizobium loti] dbj|BAB49867.1| (AP003000) 
unknown protein [Mesorhizobium loti] 


7.9 


1533 


3309522 


gi|3309522|gb|AAC26098.1| (U18292) unknown 
[Borrelia burgdorferi] 


2.1 


! 1535 


14520328 


gi|14520328|ref[NP_125803.1| activator 1, replication 
■fertnr O ^mall ^iihiinit PPvrococcus abvssil DirllC75198 
activator 1, replication factor c, small chain PAB0068 - 
Pyrococcus abyssi (strain Orsay) emb|CAB49034.1| 
(AJ248283) activator 1, replication factor C, small 
subunit [Pyrococcus abyssi] 


0.41 


1536 


7229605 


gi|7229605|gb|AAF42902.1| (AF229961) NADH 
dehydrogenase subunit 1 [Taygetis andromede] 


6.7 
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Table 3B Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQID 
NO 


ACCESS 
N 


DESCRIPTION 


P VALUE 


1537 


6681 961 

UUO l^Ql 


gi|6681261|reflNP 031926.1| ect2 oncogene [Mus 
musculus] sp|Q07139|ECT2_MOUSE ECT2 PROTEIN 
,xiLx iJJN^UOJilNJi; pir||j3jZo I l xransionxiing protein 
(ect2) - mouse gb|AAA37536.1| (LI 13 16) ect2 [Mus 
musculus] prfi|1911407A oncogene ect2 [Mus 


6E-10 


1538 


1?"}81848 


gi|12381848|emb|CAC24715.1| (AJ297319) glucose-6- 
phosphate dehydrogenase-6-phosphogluconolactonase 

"PlocinriflinTTi V>f k rO"ViPil 


7 


1540 


7688657 


oil76Rftfi57lohl-AAFfi74fiQ 1IAF146760 1 fAF146760i 

Jill / VJOOVJ-J / I^UJxVtVJ. \J /*TVl^. X LfiJL X**T\J / UV X ^/ll ITU f \J\J J 

septin 2-like cell division control protein [Homo sapiens] 


9.7 


1541 


13021853 


aill ^0?18^lahlAA*Kl 1*564 1IAF318500 1 fAF318500i 
ent-kaurenoic acid hydroxylase [Arabidopsis thaliana] 


0.091 


J543 


6900006 


OTl6Q00006lprnhlf*AR71?Q4 11 fAr?S1917^ chorion 

protein si 8 [Ceratitis capitata] 


2.1 


• 1545 


1076445 


rrill n7AA4^rnirllQ^7nft4 mitriCic-orKV-J-fir* r>w1in r v YT l 7 - 

gi|iu /o^fH- jjpirj|oj juu^f lxxxxosis-speciiiw cycxin i 
rape gb|AAA51660.1| (L25406) cyciin [Brassica napus] 


4.3 


1548 




gi|7293274!gb|AAF48655.1| (AE003503) CG9644 gene 

t-»-mHnr»+ 1 rirocoTiViilii TYi^laYirfcO'Sicf'pri 
jJlUU.UL'L [^1^1 IJoUJJIXixd lllCixXIXt/gCloLC'l J 


0.46 


1549 


1 13S8Q61 


gi|11358961|pir||T51243 Sell protein [imported] - rice 
(fragment) gb|AAC98091.1| (AF067401) Sell protein 

\\J\y ZiGL odlXVdJ 


3 9 


1550 


8978966 


gi|8978966|dbj|BAA98801.1| (AP002547) phenylalanyl 
tRNA synthetase beta [Chlamydophila pneumoniae 
J138] 


4.1 


1551 


14010341 


gi|14010341|gblAAK51958.1|AF362013_l (AF362013) 

/\i Jr syimictoC ru auuuxxxi, u |_xxdxiwiivjixiji id. z>yj. xvx vv 

2001] 


8.2 


1553 


8923094 


OTlR09^n04lrpfrMP 0601^0 1 1 nvnotVietifial nrotein 

FLJ2O085 [Homo sapiens] ref|XP_009383.2| 
hypothetical protein FLJ20085 [Homo sapiens] 
dbj|BAA90939.1| (AK000092) unnamed protein product 
[Homo sapiens] 


1.6 


1557 


14724725 


. gi|14724725|ref]XP_037600.1| hypothetical protein 
XP _037600 [Homo sapiens] 


4.2 
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SEQID 
NO 


ACCESS 
N 


DESCRIPTION 


P VALUE 


1561 


14758684 


gi|14758684|ref|XP_007211.4| retinoblastoma 1 i 
(including osteosarcoma) [Homo sapiens] 
ref|XPJ)33825.1| retinoblastoma 1 (including 
osteosarcoma) [Homo sapiens] ref)XP_033826,l| 
retmoDiasioma i ^mciuuing osteosarcoiud^ ^xauiiiu 
sapiens] ref|XPJ)33827.1| retinoblastoma 1 (including 
osteosarcoma) [Homo sapiens] 


0.91 


1565 


9635387 


gi|9635387|ref|NP_059285.1| ORF137 [Xestia c-nigrum 
granulovirus] gb|AAF0525 1 . 1|AF162221_137 
(AF162221) ORF137 [Xestia c-nigrum granulovirus] 


5.9 


1572 


1730077 


gi|1730077|sp|P18160|KYKl_DICDI NON- 
RECEPTOR TYROSINE KINASE SPORE LYSIS A 

/•PVT) /"\OTTvTC7 T>"D r\TC TXT XTTXT A CT7 1 % r»irl IT 1 SO 7 (\ nnn. 

ili KUolJNii-rKU I JbJJN JsJJN A^ii i ) pn]| iioz/o non- 
receptor tyrosine kinase - slime mold (Dictyostelium 
discoideum) gb|AAB41 125 . 1| (U32174) non-receptor 
tyrosine kinase [Dictyostelium discoideum] 


6 


1574 


9622133 


^IO^OOI^IiyVJA AT7RQA3^ 1IAF1£771Q 1 ( API 6771 9^ 

transmembrane leptin receptor [Sus scrofa] 


2.6 




139809 


gi|l39809|sp|P27571|XIST_MOUSE X UN AC 11 Vfc, 
SPECIFIC TRANSCRIPT PROTEIN pir||S 15433 
hypothetical protein - mouse emb|CAA41978.1| 

(AOyZoyj vJlvT [ivlUS mUSCUlUSJ pn]|l / 1 ihh-ua ajsl 

gene [Mus musculus] 


7.7 


1579 


8567792 


gi|8567792|gb|AAF76364.1| (AC013428) I-box binding 
factor, putative [Arabidopsis thaliana] 


5.7 


1581 


7638161 


gi|7638 161|gb|AAF65408. 1 |AF2383 12_1 (AF2383 12) 

PUlaUVC oCniltr ULlXCUlllilw UJ.VJlwJJ-1 -ISI 1 inOv lvxx*\sXJ 

[Dictyosteli\im discoideum] 


9.2 


1583 


12644495 


gi|12644495|sp|Q9Z7G7|EX5B_CHLPN 
EXODEOXYREBONUCLEASE V BETA CHAIN 
dbj|BAA98945.1| (AP002547) exodeoxyribonuclease V, 
beta [Chlamydophila pneumoniae J 13 8] 


7.2 


1588 


11350667 


gi|11350667|pir||D83160 nitrite extrusion protein 1 
PA3877 [imported] - Pseudomonas aeruginosa (strain 
PAOl) emb|CAA75538.1| (Y15252) nitrate extrusion 
protein [Pseudomonas aeruginosa] 
gb|AAG07264.1|AE004805_2 (AE004805) nitrite 
extrusion protein 1 [Pseudomonas aeruginosa] 


i 9.7 
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SEQID 
NO 


ACCESS 
N 


DESCRIPTION 


P VALUE 


1589 


12644495 


gl| 1 ZO*f n-^f 7 D \sp\ KJzJjL /Or / |JiyL jlJ^l^xlljJrlN 
EXODEOXYRIBONUCLEASE V BETA CHAIN 
dbj|BAA98945.1| (AP002547) exodeoxyribonuclease V, 
beta [Chlamydophila pneumoniae J138] 


5.2 


1590 




gi|305479|gb|AAC37807.1| (LI 1473) envelope 


4 3 


1591 


9055244 


gipU3 jz^f'f[rcijrNJr uoiz3o.i| nimungion yeast partner 

[Mus musculus] gb|AAD39464.1|AF135440_l 
(AF135440) huntington yeast partner C [Mus musculus] 


0.8 


1593 


12666210 


gi|12666210jemb|CAC28083.1| (AL138875) 
bA103J18.2 (novel protein) [Homo sapiens] 


4 


1596 


12855510 


[Mus musculus] 


0.067 


1597 


133747 


gl|133747|sp|rUy©99|KJS12_MlClAJ 30b KiBUaUMAL 
PROTEIN S12 pir|jA26956 ribosomal protein S12 - 

.lYUCrOCOCCUS lUieUS gDj/VAAZDol / \JY11 / foo) 

ribosomal protein S12 (gtg start codon) [Micrococcus 
luteus] 


9.9 


1600 


79Q94.SS 


gi|7292455|gb|AAF47859.1| (AE003480) CG15005 

fre>r>f> r\iwiiir r f" 1 1 li*nc/^T%riilo TYif*lciTi/**cr£icrt"£»T*I 
gCIJC piUUUC/L |JLSI VJoVypiUlct IIlClallUgcloLCrj 


V/.tJ 


1601 


1 

1*+ / T-OO / *T 


gi| 14748674|refpO»_038 133. 1| Prader-Willi/Angelman 


ft Q 


1602 


1 197809ft 


gi|l 1278020)pir||H82215 serine transporter VC1301 
[imported] - Vibrio cholerae (group Ol strain N 16961) 
gb|AAF94460.1| (AE004210) serine transporter [Vibrio 

r* ri pro pi 
CrUUiClclvJ 


7 8 


1607 


9757538 


gi|9757538|dbj[BAB08122.1| (AB030852) maturase 
[Lilium rubellum] 


6.6 


1618 


12249143 


gi|12249143|ref|NP_066194.2| NADH dehydrogenase 
subunit 5 [Schistosoma japonicum] gb|AAG13134.2| 
(AF215860) NADH dehydrogenase subunit 5 
[Schistosoma japonicum] 


5.9 


1620 


7495508 


gi|7495508|pir||T18993 hypothetical protein C06B8.1 - 
Caenorhabditis elegans emb|CAB03850,l| (Z81463) 
Similarity to C. elegans zinc finger proteins, contains 
similarity to Pfam domain: PF00104 (Ligand-binding 
domain of nuclear hormone receptor), Score=-13.7, E- 
value=0.051,N=l [Caenorhabditis elegans] 


2.9 
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1621 


7288570 


gi|7288570|gb|AAF45170.11AF218963_l (AF218963) 
period [Drosophila miranda] 


4.3 


1622 


7707666 


gi|7707666|dbj|BAA95343.1| (AB027560) ATPase 
subunit 6 [Echinococcus vogeli] 


4.1 


1625 


4761646 


gi|476 1646|gb|AAD29428 . 1 1 AF 1 39060_1 (AF 1 39060) 
transmembrane cell adhesion receptor MUA-3 precursor 
[Caenorhabditis elegans] emb|CAA83226.2| (Z30974) 
contains similarity to Pfam domain: PF00057 (Low- 
density lipoprotein receptor domain class A), 
Score=52.9, E-value=2.2e-12., N=3, second half (former 
T20G5.3) contains similarity to Pfam domain: PF00008 
(EGF-like domain), Score=32o.3, b-value— 1> 
emb|CAC42345.1| (Z30423) contains similarity to Pfem 
domain: PF00057 (Low-density lipoprotein receptor 

A \ Qs*m-a ^1 Q T? -.roll 1^=9 9^-1 9 XT—^ 

domain class A) 9 &core— 3z.y, xi-vaiue— ^.zo-i^j i> 

second half (former T20G5.3) contains similarity to 

Pfbm rWnnirv PFOOftOR fROF-liVe Hnmain i 
irxam Quiiicim. x r \j\j\j\jo ^dvji xu\.v uvuixeixixi 5 

Score=326,3, E-value=l> 


1.7 


1626 


3582424 


gi|3582424|dbj|BAA33057.1| (AB017255) arginine 
kinase two-domain chain [Pseudocardium sachalinensis] 


4.5 


1628 


9758077 


mlO7^XO771HhilRAR08S?l 11 <AB009052i sucrose 

Jill f / JOu/ / jUUJ 1 JJ-fAJLJv-'O^ ^ x . x | ^njjuvyw^ ouvivjw 

cleavage protein-like [Arabidopsis thahana] 


6.2 


1629 


11610630 


gi|11610630|gb|AAG37436.1| (AY013711) cyclinE 
[Mustela vison] 


10 


1631 


4838093 


gi|4838093|gb|AAD30838.1| (AF103278) 
immunoglobulin heavy chain variable region [Homo 
sapiens] 


3.3 


1635 


12045265 


gi|12045265|rei|NF — 073U7o.l| Air syntnase ru, 
subunit B (atpF) [Mycoplasma genitalium] 

Sp|F4 / o4j |A 1 rxWVl I CLrii Air ax IN 1 ri/vaii o 

CHAIN PRECURSOR pir||F64244 ATP synthase B 
cnain \aipr^ - Mycoplasma geniTauum gu|-rw-vv> / ioj 1 . ij 
(U39722) ATP synthase F0, subunit B (atpF) 
[Mycoplasma genitalium] 


8.5 


1636 


627406 


gi|627406|pir||A54849 collagen alpha 1(VB) chain 
precursor - human 


0.66 


1643 


2246540 


gi|2246540|gb|AAB62665.1| (U93872) glycoprotein M 
[Human herpesvirus 8] 


1.5 


1644 


2388576 


gi|2388576|gb|AAB71457.1| (AC000098) YUP8H12.17 
[Arabidopsis tbaliana] 


4.7 
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N 
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1645 




gi|7490184|pir||T37997 carboxypeptidase y - fission 
yeast (Schizosaccharomyces pombe) pir||T43236 
carboxypeptidase C (EC 3.4.16.5) precursor [validated] 
Mission yeast (Schizosaccharomyces pombe) 
emb|CAB10121.1| (Z97209) carboxypeptidase y 
Schizosaccharomyces pombe] dbj|BAA25568.1| 
(D86560) carboxypeptidase Y [Schizosaccharomyces 

L/vMIXUVj 


0.009 


1646 


7415597 


gi|7415597|dbj|BAA93452.1| (AB026494) 
acyltransferase homolog [Gentiana triflora] 


6.1 i 


164S 


13541404 


gi|13541404|ref|NP_111092.1| DNAhelicase 
(superfamily II) [Thermoplasma volcanium] 
HhilRARS0714 11 fAP000QQ?i DTsIA fielieace 

U.UJ |DA_DJ y / 1*T. 1| ^JTTLi \J\J\JZ7*7JZi ) Uir%r\. llvllVdaG 

[Thermoplasma volcanium] 


7.5 


1649 


6174902 


gi|6174902|sp|Q24767|PER_DROYA PERIOD 
CIRCADIAN PROTEIN pir||S 17286 period clock 
protein - fruit fly (Drosophila yakuba) 
emb|CAA43439.1| (X61127) period [Drosophila 
yakuba] 


0.2 


1650 


L*tJ iojuo 


gi|14318508|ref|NP_116641.1| Ieslp [Saccharomyces 
cerevisiae] sp|P43579|YFB3_YEAST 
HYPOTHETICAL 78.8 KD PROTEIN IN HSP12- 
riXl 1U UN IJbKAjxilNlv^ KxSCjIUJN plT||34oJlO prouauie 
membrane protein YFL013c - yeast (Saccharomyces 
cerevisiae ) emoji^A/vooo'f/.i[ {j^hojldd) on, ion. oyz., 
CAI: 0.14 [Saccharomyces cerevisiae] dbj|BAA09225.1| 

rnSO^l 7^ VFT 01*3f* r^JarrliarrrtTivr.pQ rerpviqiael 


8 8 


1655 




gi|5052950|gb|AAD38784.1|AF149422_l (AF149422) 

ntiLmrwx;n rHnmn csityi**yic1 
U.I1KJHJWI1 [XXIJ.III.IJ od^/lCUujj 


0 000000007 


1660 


2988422 


gi|2988422|gb|AAC39776.1| (AF016903) agrin 
precursor [Homo sapiens] 


3.4 


1661 


14756108 


gi|14756108|ref|XP_029883.1| EGF-like-domain, 
multiple 4 [Homo sapiens] 


5 


1664 


10047313 


cill0047313|dbi|BAB13444 11 (AB046838) KIAA1618 
protein [Homo sapiens] 


0.00001 


1669 


5902048 


gi|5902048|reflNPJ)08974.1|HW-l rev binding protein 
2; Rev interacting protein [Homo sapiens] pir||G02629 
Rev interacting protein Rip-1 - human gb|AAB00557.1| 
(U55766) Rev interacting protein Rip-1 [Homo sapiens] 


0.009 
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1672 




gi|13 122465|gb|AAK12639.1|AF3 17654_1 (AF317654) 


2.4 


1673 


7296752 


gi|7296752|gb|AAF52030.1| (AE003603) CG12147 
gene product [Drosophila melanogaster] 


2.1 


1681 




gi|14767950|re£lXP_040494.1| nuclear RNA export 
factor 5 [Homo sapiens] emb|CAC20428.1| (AJ277654) 


6.9 


1684 


14740021 


gill4740021|reflXP_033953.1| 32575 porno sapiens] 

rp flYP fTC^QSA 11 *39S76 fHnmn Qjmien<?l 

rCll-AX^ LOO " J 1. 11 O^JtXJ \xTXj1lI\J o<*|Jldloj 

ref|XP _ 033955.1| 32577 [Homo sapiens] 


2.5 


1685 


1JJ00700 


gi|13568988|gb|AAK30843.1|AF254571_l (AF254571) 

m-r»\T7+V» /Hi ■f-fv i rp»nti S»"H rvn -fiartor 7 rMll^ mil^Clllu^l 


5.8 


1694 


106323 


gi|106323|pir||A34087 hypothetical protein (L1H 5' 
region) - human 


0.005 


1695 




gi|744444z|pir|| 1 U/Uo / gag/poi poiyprotein - maize 
retrotransposon Hopscotch gb|AAA57005.1| (U12626) 
copia-like retrotransposon Hopscotch polyprotein [Zea 
maysj 


0.49 


1696 


13569915 


gi|13569915|ref]NP_l 12205. 1| amnionless protein 
[Homo sapiens] gb|AAK28532.1|AF328788_l 
(AF328788) amnionless [Homo sapiens] 


4.4 


1697 


13812078 


gi|13812078|refINP_113215.1| hypothetical protein 

rrrtiillarrliji tliRtal ahIA AF9401 1 1IAF083031 8 

(AF083031) hypothetical protein [Guillardia theta] 


5.2 


1699 


462193 


gi|462193|sp|Q06003|GOLI_DROME GOLIATH 
PROTEIN (Gl PROTEIN) pirl|JC1495 regulatory 
protein \ji - rruii ny ^urosopniia meidiiug<iau?ij 
gb|AAA28582.1| (M97204) goliath protein [Drosophila 
melanogaster] 


9.3 


170? 


13376638 


gi|13376638|ref|NPJ)79359.1| hypothetical protein 

T7T TO 1 1 09 rUnmn c-inioncl HViilR AR1 ^001 1 1 

fijjzll/o [Homo sapiensj uO}\dj\oij<jkii.i\ 
(AK024781) unnamed protein product [Homo sapiens] 


5E-49 


1705 


7445803 


gi|7445803lpir||JC5348 cdd4 protein - Clostridium ' 
difficile emb|CAA63569.1| (X92982) cdd4 [Clostridium 
difficile] 


0.28 


1706 


10175016 


gi|10175016|dbj|BAB06115.1|(AP001515) 
BH2396~unknown [Bacillus halodurans] 


9 


1709 


3953531 


gi|3953531|dbj|BAA34722.1| (AB015856) ATF6 
[Homo sapiens] 


9.1 
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1712 




gi|14329676|emb|CAC40651.1| (AI242782) centaurin 

JClix \ITXjIII\j odJJlCil&J 


0.00003 


1714 


10432382 


gi|10432382|emb|CAC10340.1| (AL139421) dJ717I23.1 
(novel protein similar to Xenopus laevis Sojo protein) 
Homo sapiens] 


0.00006 


1717 


8247357 


rrtlfcOA7^7UmViir i A'RQ?QS7 11 fAT401026^ hvnothetical 
protein [Thermoanaerobacter thermohydrosulfuricus] 


1.4 


1719 


7487043 


gi|7487043|pir||T08926 hypothetical protein T15N24.50 
- Arabidopsis thaliana embICAB77061.1| (AL078465) 
putative protein [Arabidopsis thaliana] 
emb|CAB79515.1| (AL161565) putative protein 
[Arabidopsis thaliana] 


0.39 


1721 


13507765 


gi|13507765|refjNP_l 09714. 1| similar to CrlPases 
[Mycoplasma pneumoniae] sp|P75088IY024_MYCPN 

PROBABLE GTP-BIND1JNG rKU 1 JbJJN MuUZ4 
HOMOLOG (B01_ORF362) pir||S73454 probable GTP 
binding protein yyaF - Mycoplasma pneumoniae (strain 
ATCC 29342) gb|AAB95776.1| (AE000015) similar to 
GTPases [Mycoplasma pneumoniae] 


6 


1725 


11071788 


gi|110717881emb|CAC14632.1| (AL449144) 
hypothetical protein P2 14.26 [Leisbmania major] 


9 


1727 


14325595 


gi|14325595|dbj|BAB60498.1| (AP000996) hypothetical 
protein [Thermoplasma volcanium] 


2.5 
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SEQ n> 
NO 


ACCESS 
N 
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P VALUE 


1728 


6498442 


gi|6498442|dbj|BAA87845.1| (AP000815) ESTs 
AU081301(E20138),C99280(E10593) correspond to a 
region of the predicted gene.~Similar to Arabidopsis 
thaliana chromosome H BAC T26B15 genomic 
sequence,unknown protein. (AC004681) [Oryza sativa] 
dbj|BAB00648.2| (AP002804) ESTs 
AU081301(E20138),C99280(E10593) correspond to a 
region of the predicted gene.~Similar to Arabidopsis 
thaliana chromosome II BAC T26B15 genomic 
sequence; unknown protein (AC004681) [Oryza sativa] 
dbj|BAB17744.1| (AP002862) contains ESTs 
AU08 1301 (B20 1 3 yzou^iii UDy3/^sinuiar to 
Arabidopsis thaliana chromosome 2 9 
F12C20. 1 1 -unknown protein [Oryza sativa] 
dbj|BAB44118.1| (AP003103) contains ESTs 
AU081301(E20138),C99280(E10593)~similarto 
Arabidopsis thaliana chromosome 2, 
F12C20.1 l~iraknown protein [Oryza sativa] 


1.1 


1741 


11467083 


gi|11467083|reflNP_042559.1| NADH dehydrogenase, 
subumt 3 [Acanthamoeba castellanuj 
sp|Q37382|NU3M_ACACA NADH-UBIQUINONE 

dehydrogenase (ubiquinone) (EC 1.6.5.3) chain 3 - 

gbjAADl 1852.1] (U12386) NADH dehydrogenase, 
subunit 3 [Acanthamoeba castellanii] 


5 


1742 


12843826 


i mil 2843 R26ldhilBAB26 128 11 (AK009187) nutative 
[Mus musculus] 


0.00000002 


1743 




gi|4928550|gb|AAD33637.1| (AF133876) DBL alpha 
rvrntein rPlasmodium falciparum! 


6.2 


1744 


6503033 


OTl^SO^O^llcrhlA AF14SS7 1IAF176666 1 fAF176666i 
F-box leucine-rich repeat protein 5 [Xenopus laevis] 


3.8 


1745 


6562750 


gi|6562750|emb|CAB62889.1| (AL035475) hypothetical 
protein, MAL4P2.48 [Plasmodium falciparum] 


8.9 


1750 


7498998 


gi|7498998|pir||T16057 hypothetical protein F13D11.2 - 
Caenorhabditis elegans 


i 54 


1 1751 


6939792 


gi|6939792|dbj|BAA90658!l| (AB037920) HA-17 
[Clostridium botulinum] 


7.3 
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SEQID 
NO 


ACCESS 

N 


DESCRIPTION 


P VALUE 


1754 


14195008 


gi|14195008|sp|Q9JI55|PLEl_CRIGR PLECI1N 1 
(PLTN) (PCN) (300-KDA INTERMEDIATE 
FILAMENT-ASSOCIATED PROTEIN) (IFAP300) 
gb|AAF70372.1| (AF260753) plectin [Cricetulus 
griseus] 


1.6 


1756 


74519 


gi|74519|pir|lGNWVR4 structural polyprotein - rubella 
virus (strain Therien) 


4.4 


1760 


6324076 


gi|6324076|refINP_014146.1| Ynl253wp 
Saccharomyces cerevisiae] sp|P53851|YNZ3_YEAST 
HYPOTHETICAL 47.2 KD PROTEIN IN SIP3- 
MRPL30 INTERGENIC REGION pir||S63226 
hypothetical protein YNL253w - yeast (Saccbaromyces 
cerevisiae) emb|CAA65491.1| (X96722) ORF N0860 
[Saccbaromyces cerevisiae] emb|CAA96160.1| 
\Zj l iDzy) xINJ_iZj^w [oaccnaroinyccs cerevisiaej 




L /OZ 


1 /UoUoZ 


gi|1708082|sp|P50900|GUX2_CLOSR 
EXOGLUCANASE D PRECURSOR 
(EXOCELLOBIOHYDROLASE H) (1,4-BETA- 
CELLOBIOHYDROLASE II) (AVICELASE II) 
emb|CAA932©u.l| (Zo93j9) aviceiase 11 [Uiostndium 
stercorarium] 


1.9 


1 

1 /O-J 




gl|2224o3o|emb|CAA4D3oo.l| (AWy/4; putative 
protein-tyrosine phosphatase [Rhodobaeter capsulatus] 


5.2 


1768 


6513773 


gi|6513773|gb|AAF14748.1|AF197756_l (AF197756) 
maturase [rieuy osmum aroorescensj 


ft tn 


1775 


14732870 


gi|14732870|ref|XP_029018.1| hypothetical protein 
FLJ10647 [Homo sapiens] ref|XP_001969.2| 
hypothetical protein FLJ 10647 [Homo sapiens] 


1.3 


1779 


7521942 


gi|7521942|pir||T29u96 gag polyprotem - murine 
endogenous retrovirus ERV-L emb|CAA73250.1| 
(Y12713) Gag polyprotein [Mus musculus] 


0.071 


1785 


7508408 


gi| / 3Uo'fUojpir|| l z 3 zz> l nypouicticai protein i ^txxiv/.H' - 
Caenorhabditis elegans emb|CAA90944.1| (Z54216) 
T24H10.4 [Caenorhabditis elegans] 


2.9 


1788 


7460247 


gi|7460247|pir|lB71612 hypothetical protein PFB0555c - 
malaria parasite (Plasmodium falciparum) 
gb|AAC71900.1| (AE001402) hypothetical protein 
[Plasmodium falciparum] 


6.4 
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1789 


4337102 


gi|4337102|gb|AAD18078. 1|AAD18078 (AF129756) 
G6f [Homo sapiens] 


2E-24 


1792 


1723494 


gi|1723494|sp|Q10413|YD88_SCHPO VERY 
HYPOTHETICAL 13.3 KD PROTEIN C1F3.08C IN 
CHROMOSOME I pir||T38079 very hypothetical 
protein SPAClF3.08c - fission yeast 

Schizosaccharomyces pombe) emb|CAA94626.1| 
(Z70690) very hypothetical protein 

Schizosaccharomyces pombe] 


7.2 


1793 


4589921 


j 5i|45 8 992 1 |dbj |B AA7692 7 . 1 [ (ABO 1/1 92) 
molybdopterin biosynthesis protein [Clostridium 
perfringens] 


4.2 


1799 


7494302 


gi|7494302|pir||E7161 1 hypothetical protein PFB0580w 
malaria parasite (Plasmodium falciparum) 
gb|AAC71905.1| (AE001404) hypothetical protein 
Plasmodium falciparum] 


3.3 


1800 


14318508 


gi|14318508|ref|NP_l 16641. 1| Ieslp [Saccharomyces 
cerevisiae] sp|P43579|YFB3_YEAST 
HYPOTHETICAL 78.8 KD PROTEIN IN HSP12- 
HXT10 INTERGENIC REGION pir||S48316 probable 
membrane protein YFL013c - yeast (Saccharomyces 
cerevisiae) emb|CAA86347.1| (Z46255) orf, len: 692, 
CAI: 0.14 [Saccharomyces cerevisiae] dbj|BAA09225.1| 
(D50617) YFL013C [Saccharomyces cerevisiae] 


6.6 


XoUZ 


14325595 


gi|14325D95|abj|oABoU4yo.i| (Aruuuyyo; nypOLtteucai 
protein [Thermoplasma volcanium] 


2 


1809 


7520399 


gi|7520399|pir||Tl 1689 NADH dehydrogenase 
(ubiquinone) (EC 1.6.5.3) chain 5 - Graphium sarpedon 
mitochondrion (fragment) abj | B AAz o 1 87 . 1 1 
(ABO 13 147) NADH dehydrogenase subunit 5 
[Graphium sarpedon] 


4.9 


1816 




gi|423981|pir||A46193 88K E-26-specific domain 
protein Pok - fruit fly (Drosophila melanogaster) 
dbj|BAA01080.1| (D10228) Ets domain protein 


0.23 


1818 


7463036 


gj|7463036|pir||C70177 beta-glucosidase homolog - 
Lyme disease spirochete gb|AAC66976.1| (AE001 163) 
beta-glucosidase, putative [Borrelia burgdorferi] 


8.4 


1819 


3851471 


gi|3851471|gb|AAC72292.1| (AF037295) sulfonylurea 
receptor- 1 [Mus musculus] 


3.4 
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1820 


9845030 


oxoprolinase precursor [Alcaligenes faecalis] 


5.8 


1821 


6599044 


protein [Asterias rubens] 


6.7 


1822 


7715984 


gi|7715984|gb|AAF68235. 1|AF206244_1 (AF206244) 
seroreactive antigen BMN1-2 [Babesia microti] 


0.067 


1823 


7293329 


gi| /Z7J jzi/jgu|jf\Ar'fo /uy. i| ^Aiiuuj3u*f i ; iajoioz gene 
product [Drosophila melanogaster] 


2.8 


1824 


2136095 


gi|2136095|pir||I58381 receptor tyrosine kinase isoform 
FLT4 long - human (fragment) gb|AAB28539.1| 
(S66407) receptor tyrosine kinase isoform FLT4 long, 
FLT41 {C-terminal} [human, Peptide Partial, 71 aa] 
[Homo sapiens] 


0.0007 | 


1826 


7522093 


gi| /3Zzoy3[pir|| I JUoUy plasminogen related growth 
factor receptor 3 - Fugu rubripes emb|CAA09086.1| 
(AJ010317) plasminogen related growth factor receptor 
3 [Takifugu rubripes] 


4.7 


1832 


13622999 


gi|13622999|gb|AAK34670.1| (AE006621) hypothetical 
protein [Streptococcus pyogenes Ml GAS] 


3.7 


1834 


6319504 


gi|6319504|ref[NP_009586.1| Ybr030wp 
[Saccharomyces cerevisiae] sp|P38222[YBO0_YEAST 
HYPOTHETICAL 62.6 KD PROTEIN IN CDS1- 
RPL4A ESfTERGENIC REGION pir||S45886 
hypothetical protein YBR030w - yeast (Saccharomyces 
cerevisiae) emb|CAA53686.1| (X76078) YBR0314 
[Saccharomyces cerevisiae] emb|CAA84972.1| 
(Z35899) ORF YBR030w [Saccharomyces cerevisiae] 
orfll2206497K ORF YBR0314 rSaccharomvce<? 
cerevisiae] 


0.65 


1835 


7290986 


gj|7290986|gb|AAF46425.1| (AE003445) CG3898 gene 
product [Drosophila melanogaster] 


3.1 


1840 


14725330 


gi|14725330|reflXP_002254.2[ mitochondrial 
translational initiation factor 2 precursor [Homo 
sapiens] 


4.2 ! 
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Table 3B Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQID 
NO 


ACCESS 
N 


DESCRIPTION 


P VALUE 


1842 




gi|7509362|pir||T3 1492 hypothetical protein 
YlloAoC.20 - Caenornabaitis elegans 
emb|CAB55 126. 1| (AL1 17204) predicted using 
Genefinder-contains similarity to Pfam domain: 
PF00642 (Zinc finger C-x8-C-x5-C-x3-H type (and 

eimitor^ ^ Qr>nrf*=71 ft F-vn1iiP=3 1e-19 "N=2~cDNA 

S 111111 3X ) •/} OV^UIC — / 1 .Vj !_/ VdlLlW J. iv 1 -7 S Vl/iirx 

EST yk247a8.5 comes from this gene [Caenorhabditis 


8.4 


1846 


6093793 


gi|6093793|sp|Q64 181 |PROP_CAVPO PROPERDIN 
PRECURSOR gb|AAB35918.1| (S81116) properdin 
[guinea pigs, spleen, Peptide, 470 aa] [Cavia] 


4.1 


1848 


7300538 


gi|7300538|gb|AAF55691.1| (AE003727) CG7411 gene 
product [Drosophila melanogaster] 


2.7 


1849 


1 730077 


gi|1730077|sp|P18160|KYKl_DICDI NON- 

K KC ,hr 1 LJJx. 1 I XSAJoliNli JVlTN Aoli JiUiul JLr I olJ f\. 

(TYROSINE-PROTEIN KINASE 1) pir||T18276 non- 
receptor tyrosine kinase - slime mold (Dictyostelium 
discoideum) gblAAB41125.1| (U32 174) non-receptor 
■K/mciriF* lrinacft IT)i p.tvn «rte1 111 m discoideuml 


8.4 


1851 


5326919 


gi|5326919|emb|CAB46239.1| (AJ133488) SCO- 
spondin [Bos taurus] 


2.7 


1853 


13811938 


gi| 138 1 1938|emb|CAC03433.2| (AL1 18505) 
dJ1056H1.2.1 (novel protein similar to mitogen 
inducible protein MIG-2 (isoform 1)) [Homo sapiens] 


1 2E-21 


1859 


8954377 


gi|8954377|ref|NP_059365. 1| haem lyase 
[Cyanidioschyzon merolae] pir||A58932 cytochrome C- 
type biogenesis protein CCMF - Cyanidioschyzon 
merolae mitocnononon q\j\\di\i\j\}jj.i .i\ ^uo^owi; 
cytochrome C-type biogenesis protein CCMF 
[Cyanidioschyzon merolae] 


2.3 


1860 


13376638 


gi|13376638|reflNP_079359.1| hypothetical protein 
FLJ21128 [Homo sapiens] dbjlBAB15001.1| 
(AK024781) unnamed protein product [Homo sapiens] 


2E-47 


1862 


14602664 


gi| 14602664|gb|AAH09855. 1|AAH09855 (BC009855) 
Similar to nucleolin [Homo sapiens] 


4E-45 
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Table 3B Nearest Neighbor (BkstX vs. Non-Redundant Proteins) 


SEQID 
NO 


ACCESS 
N 




P VAT TJF 


1864 


7512078 


gi|75 12078|pir||T30879 dynein heavy chain isotype 5A - 
sea urchin (Tripneustes gratilla) (fragment) 
gb|AAA6359Ll| (U03977) dynein heavy chain isotype 
5 A [Tripneustes gratilla] 


1.7 


1865 


7688347 


gi)7688347|emb|CAB89836.1| (AJ242516) AmpE 
protein [Salmonella typhimiirium] 


2.8 


1867 


4758568 


gi|4758568|refINP_004497.1| heat shock transcription 
fector 2 [Homo sapiens] ref|XP_004466.2| heat shock 
transcnption fector 2 [Homo sapiens] ref|XPJ)51573.1| 
heat shock transcription factor 2 [Homo sapiens] 
sp|Q03933|HSF2JriUMAN HEAT SHOCK FACTOR 
PROTEIN 2 (HSF 2) (HEAT SHOCK 
TRANSCRIPTION FACTOR 2) (HSTF 2) pir||A41 138 
heat shock transcription fector HSF2 - human 
gb|AAA36017.1| (M65217) HSF2 [Homo sapiens] 


0.33 


1869 


13358509 


gi|13358509|ref|NPJ)78688.1| orfl07 Qymphocystis 
disease virus 1] 


8.1 


1870 

A O / \J 


6580323 


gi|6580323|emb|CAB63392.1| (AL132864) cDNA EST 
EMBL:T00048 comes from this gene-cDNA EST 
EMBL:T00047 comes from this gene-cDNA EST 
yk390e6.3 comes from this gene-cDNA EST yk5 12a3.3 
comes from this gene-cDNA EST yk5 12a3.5 comes 
ironi tnis gene~cjL/iN i\ jdo 1 yioozaz. ^ comes xrom this 
ge> 


0.012 


1871 


1 1 <l^O'7'7£ 

i ijoy / /o 


• a n iof aii/i * linn a /\ c r\ 1— -- . ii * • _ _ i . ±. _ * . _ 

gi|l 1359776|pir||T45059 hypothetical protem 
Y39B6B.gg [imported] - Caenorhabditis elegans 
emb[CAB6093o.l| (AJL132o96) predicted usmg 
Genefinder; preliminary prediction [Caenorhabditis 
elegans] 




1876 


5453171 


gi|5453171|gb|AAD43464.1| (AF113915) pre-mRNA 
splicing fector [Heterodera glycines] 


2.1 


1877 


5453171 


gi|5453171|gb|AAD43464.1| (AF113915) pre-raRNA 
sphcing factor [Heterodera glycines] 


2.1 


1880 


14752353 


cnl147S71^lreflXP 050519 11 annextn A2 IHomo 
sapiens] 


0.27 


1881 


7494821 


gi|7494821|pir||T31996 hypothetical protein B0281.5 - 
Caenorhabditis elegans gb|AAB66084.1| (AF016666) 
similar to human tumor necrosis fector-alpha-induced 
protein B12 (NID:gl79304) [Caenorhabditis elegans] 


3.8 
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NO 


N 


DESCRIPTION 


P VALUE 


1883 


480482 


gi|480482|pir|jS36953 cytochrome-c oxidase (EC 
.9.3.1) chain III - Herpetomonas samuelpessoai 
mitochondrion gb|AAD09166.1| (L10852) cytochrome 
oxidase subunit HI [Herpetomonas pessoai] 


0.78 


1885 


5174493 


gi|5174493|ref|NPJ)06050.1| laminin, gamma 3 
precursor [Homo sapiens] 

gOIA-AX/joyyi.iiAru^f io^3_i ^/vjtu*hoj3^ laminin 
gamma 3 chain precursor [Homo sapiens] 


3E-12 


1886 


8922792 


gi|8922792|ref]NP_060753.1| hypothetical protein 
FLJ10956 [Homo sapiens] ref|XP_007214.2| 
hypothetical protein FLJ10956 [Homo sapiens] 
clDj |x5AA9 1 yz 0 . 1 1 1 6 1 o j UDnaiucu protein prouuct 
Homo sapiens] 


5E-39 


1889 


7479561 


gi|7479561|pir||T35135 hypothetical protein aC4rio.U4c 
SC4H8.04c - Streptomyces coelicolor 
emb|C> AAl 5 o / 1 . 1 1 ^ALrUZuy j <s j nyp otae ticai protein j 
SC4H8.04c [Streptomyces coelicolor A3(2)] 


0.5 


1890 


7461128 


gi|7461 128|pir||T03057 hypothetical protein 032R - 
Chilo iridescent virus gD|AAi>9443 1 . l| (Aruu^ 
hypothetical protein 032R [Chilo iridescent virus] 


0.33 


1891 


8778367 


gl|8778367|gblAAr79375.1|ACUU /oo (AA^UU fo&I) 
F1504.28 [Arabidopsis thaliana] 


6.6 


1894 


13475247 


gi| 13475247|ref|NP_1068 1 1. 1| unknown protein 
[Mesorhizobium loti] dbj|BAB52597.1| (AP003008) 
unknown protein [Mesorhizobium loti] 


2.3 


1 OAT 

1895 


228595© 


gi|2285958|emb|CAA70903.1| (Y09763) GABRE 
[Homo sapiens] 


0 7 


1896 


14729939 


gi| 14729939lreffXP_038475.il DKFZP564J102 protein 
(Homo sapiens] 


0.007 


1898 


t\ A A C\C\ 

7504499 


gi|7504499|pir||T32750 hypothetical protein F57B10.1 - 
Caenorhabditis elegans gblAAB96719.1| (AF039713) 
Hypothetical protein to /JtJiu.i [i^aenornaDains eiegans 


4 9 


1900 


10581310 


eill0581310leblAAG20067 11 (AE005086) methionyl 
aminopeptidase; Map [Halobacterium sp. NRC-1] 


8.9 


1902 


111814 


gi|l 1 1814|pir|lS21347 hypothetical protein 3 - rat 
emb|CAA37646.1| (X53581) ORF3 [Rattus norvegicus] 


0.3 


1903 


7290766 


gi|7290766|gb|AAF462 11.1| (AE003439) CG4557 gene 
product [Drosophila melanogaster] 


0.38 
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Table 3B Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQID 
NO 


ACCESS 

IN | 


r»T7 C r^T? TDTT rvM 


P VAT TTF 


1904 


5139521 


gi|5139521|emb|CAB45562.1| (AJ238798) CTRP 
protein [Plasmodium berghei] dbj|BAA82322.1| 
[AB027129) adhesive protein-like molecule 
Plasmodium berghei] gb|AAF73 158. 1|AF 14977 1_1 
/yt 149771) ookinete protein [Plasmodium oergneij 


Q 

o 


1905 




gi|7489900|pir||T18287 protein-tyrosine kinase (EC 
2.7. 1 . 1 12) - slime mold (Dictyostelium discoideum) 
gb|AAB04999.1| (U64830) protein tyrosine kinase 
[Dictyostelium discoideum] 




1908 


6002776 


gi|6002776|gb|AAF00134. 1|AF149806_1 (AF149806) 
hypothetical protein [Oryza saliva] 


n i s 


1913 


14773348 


gill4773348lrefpCP_038450.il 20849 [Homo sapiens] 


2E-50 


1916 


7301187 


gi|73011871gblAAF56319.1| (AE003748) CG5794 gene 
product [Drosopbila melanogaster] 


8 


1 Q1 Q 


3378685 


gi|3378685|emb|CAA76071.1| (YlolU4) replicase 
protein [Physalis mottle tymovirus] 


0.13 


i ni o 
191V 


4501915 


gi|4501915|ref|NPJ)03807.1| a disintegrin and 
metalloproteinase domain 9 preproprotein; meltrin 
gamma [Homo sapiens] gb|AAC50403.1| (U41766) 
metaUoprotease/disintegrin/cysteine-rich protein 
precursor [Homo sapiens] 


0.002 


1922 


14587070 


gi|14587070|gb|AAK70463.1|AF387344_4(AF387344) 
spore germination protein GerLC [Bacillus cereus] 


8 A 

0.*T 


1924 


7291161 


gi|7291161|gb|AAF46595.1| (AE003450) CG2892 gene 
product [Drosopnila melanogaster j 




iyzD 


7446016 


gi|7446016|pir||E70895 hypothetical glycine-rich protein 
Rvl087 - Mycobacterium tuberculosis (strain H37RV) 
emb|CAA17203.1| (AL021897) PE — PCrRb 
[Mycobacterium tuberculosis] 


3.6 


i no/; 


11595522 


gi| 1 1595522|emb|CACl 83 16. 1| (AL45 1022) 
hypothetical protein [Neurospora crassa] 


5.9 


A 7JV/ 


14043326 


Unknown (protein for MGC:747) [Homo sapiens] 


8E-76 


1931 


13810543 


gi|13810543|dbj|BAB43950.1| (AB051633) ookinete 
surface protein Pos28-2 [Plasmodium ovale] 


3.1 


1933 


7206826 


gi|72068261gb|AAF39985.1| (AC006696) contains 
similarity to other proline-rich proteins [Caenorhabditis 
elegans] 


6.6 
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Table 3B Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ ID 
NO 


ACCESS 
N 


DESCRIPTION 


P VALUE 


1934 


14749721 


gi|14749721|reflXP_027893.1| similar to ALU 
STTRFAMTT V SRI ^FOTTFTsTfF CONTAMINATION 
WARNING ENTRY (H. sapiens) [Homo sapiens] 


2.1 


1935 


12324211 


™ll 9^*9/19 1 1 UKI A AnS9077 1IAP019fV7Q IS 

(AC012679) putative proline-rich protein precursor; 
93710-91881 [Arabidopsisthaliana] 


2.1 


1940 




gi|4204305|gb|AAD10686.1| (AC003027) Hypothetical 

r\Twfr**ifi r A T**aT"\i/"1/'Yt"\C1 G "t" rid 11 51 11 51 1 
jrOLCUl L-rVl aUl\X\J\JQMB lUdllcLUclJ 


5.4 


1955 


X*f /OJ 1 1 o 


gi|14783118|reflXP_043478.1| hypothetical protein 

YP 04^4751 rHnmn <55iniPTiQl 


4.7 


1959 


Ijj /joDU 


gi|13375860|reflNP_078907.1| hypothetical protein 
FLJ23342 [Homo sapiens] dbj|BAB15618.1| 

( AT^09r*QQS^ unnnmpH nrotein nrnduct FHnmo saoiensl 


0.2 


1962 


' 7304316 


gi|7304316|gb|AAF59348.1| (AE003844) CG2052gene 
product [Drosophila melanogaster] 


8.2 


1963 


7513579 


gi|75 13579|pir||T09064 l-acylglycerol-3-phosphate O- ! 
acyltransierase z.^.i.di^ - mouse gD|AADozuu^.i| 
(AF03000 1) lysophatidic acid acyl transferase-alpha j 
[Mus musculus] 


1.1 


1964 


7512671 


gi|75 12o 1 l|pir|j i 12D4D nypotneucai protein 
DKFZp434N074. 1 - human (fragments) 
emb|CAB46377. 1| (AL096732) hypothetical protein 
[Homo sapiens] 


3.1 


1972 


13249541 


r^lilO/lQ^/l 1 IrVki A AfcTl 1 1 /"AVOI SS07^ 
gl| 1 3/4^34 l|gD|Af\Jvl 34 ^Aiuijj^/; 

dissimilatory sulfite reductase subunit B [vmcultured 

suliute-reducing bacterium] 


2 


1976 


8134766 


gi|8134766|sp|Q9ZES2|TRPE_BUCTC 

AJN 1 iTJtvrVlN JLLtA. 1 JQ o X IN 1 xXrVOJi V^\^lVXX^\»/.rNXvrN x x 

emb|CAA09993. 1| (AJ012333) anthranilate synthase 
large subunit [Buchnera aphidicola] 


5.6 


1977 


7518768 


gi|7518768|piii|A71111 hypothetical protein PH0656 - 
Pyrococcus horikoshii dbj|BAA29747.1| (AP000003) 
107aa long hypothetical protein [Pyrococcus horikoshii] 


9.3 


1982 


14749721 


gi| 14749721|reflXP_027893 . 1| similar to ALU 
SUBFAMILY SB1 SEQUENCE CONTAMINATION 
WARNING ENTRY (H. sapiens) [Homo sapiens] 


1.9 
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SEQID 
NO 


ACCESS 
N 


DESCRIPTION 


P VALUE 


1983 


2498123 


GROWTH FACTOR BINDING PROTEIN 

(ALS) pir||JC5239 insulin-like growth factor acid-labile 
chain - baboon 


4.4 1 


1990 


2911545 


gi|2911545|emb|CAA75449.1| (Y15173) E6 protein 
[Human papillomavirus type 75] 


4.2 


1992 


1 000764 


gi|1090764|prf||2019432A cyclin-dependent kinase 5 

+1-1/0+07* 1 RrtC fill 11*11 cl 

ctOLi VctLvJl [DUs IxLUluaJ 




1993 


7446170 


gi)7446170|pir||T02529 myb-related protein F13M22.13 
- Arabidopsis thaliana gb|AAC23633.1| (AC004684) 
putative MYB family transcription factor [Arabidopsis 

putative transcription factor [Arabidopsis thaliana] 


5.9 


1994 


4501915 


gi|4501915|ref|NP_003807.1| a dismtegrin and 
metalloproteinase domain 9 preproprotein; meltrin 
gamma [rlomo sapiens J go|AAU^U4U3.i| 
metaDoprotease/disintegrin/cysteine-rich protein 
precursor [Homo sapiens] 


0.002 


1997 


12853260 


gi|12853260|dbj|BAB29697.1j (AK015063) putative 
[Mus musculus] 


6.1 


2000 


6492289 


g*|6492289|gb|AAF14258.1|AF137068_l (AF137068) 
cubilin [Cards famiharis] 


4.6 


2001 


2828501 


gi|2828501|sp|P40899|ISP3JSCHPO SEXUAL 
DIFFERENTIATION PROCESS PROTEIN ISP3 
pir||T38112 sexual differentiation process protein isp3 - 
fission yeast (Schizosaccharomyces pombe) 
emb|CAB03599.1| (Z81312) sexual differentiation 
process protein isp3; meiotic expression upregulated 
[Schizosaccharomyces pombe] 


7.9 


2022 


4206157 


ffil4206157teblAAD 11433 11 <AF109404^ transnosase 
[Streptomyces scabiei] 


3.4 


2026 


11466224 


gi|11466224|ref|NP_062847.1| ORF1, contains 4 trans 
membrane regions, putative [Physarum polycephalum] 
dbj|BAB08081.1| (AB027295) ORF1, contains 4 trans 
membrane regions, putative [Physarum polycephalum] 


1.2 
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SEQID 
NO 


ACCESS 
N 


DESCRIPTION 


P VALUE 


2027 


7507536 


»i| / r> u / 3 d ojpirjj i Z4 / ^ y nypotneticai protein i uy _ 1 1 . ** - 
Caenorhabditis elegans emb|CAB03533.1| (Z81147) 
contains similarity to Pfam domain: PF01757 (Domain 
of unknown function), Score=594.0, E-value=3e-175, 
N=l [Caenorhabditis elegans] 


4.3 


2028 


6102749 


ml6102749lemhlCAR59307 11 CAJ236287 , NADH 
dehydrogenase subunit F [Carphalea glaucescens] 


1.9 


2031 




gi|7486224|pir||T08553 hypothetical protein 
F27B13.160 - Arabidopsis thahana emb|CAB43667.1| 

/AT niito+iT/p* nrntpin r AfoVviHrwicic "frlisilisitial 

emb|CAB79750.1| (AL161575) putative protein 

r AraHiHrinciQ thaliarml 


2.7 


2032 


7462619 


gi|7462619|pir||F72210 hypothetical protein TM1801 - 
Thermotoga maritima (strain MSB8) 

hypothetical protein [Thermotoga maritima] ' 


6.2 


2035 


7296128 


gi|7296128|gb|AAF5 1422.1| (AE003587) CG4629 gene 
product [Drosophila melanogaster] 


3.4 


2038 


14760974 


pro-pol polyprotein (H. sapiens) [Homo sapiens] 


1E-13 


2041 


14601134 


gi|14601134|ref|NP_147662.1| hypothetical protein 
[Aeropyrum pernix] pir|p72698 hypothetical protein 
APE1008 - Aeropyrum pernix (strain Kl) 
dbj|BAA79992.1| (AP000060) 123aa long hypothetical 
protein [Aeropyrum pernix] 


2.6 i 


2043 


4507537 


gi[4507537|ref|NPJ)03260.1| nuclear receptor subfemily 
2, group E, member 1; tailless (Drosophila) homolog; 
tailless homolog (Drosophila) [Homo sapiens] 
refjXP_004530.il 54551 [Homo sapiens] 
reiTXP_038737.il nuclear receptor subfemily 2, group 
E, member 1 [Homo sapiens] 
sp|Q9Y466[NR2 INHUMAN ORPHAN NUCLEAR 

IJlnr'TTPTnT* "NTR9F1 fTsJTTr , T FAP RFPFPTOR TT Xi 
IV—V^lilr 1 LI XV INxvZIl 1 Uv^JUlwrVIv. xVE/v^J-ix 1 \JSK x 

(TAILLESS HOMOLOG) (ILL) (HTLL) 
emb|CAA73725.1| (Y13276) Tailless protein [Homo 
sapiens] emb|CAB75626.1| (AL078596) dJ429G5.1 
(nuclear receptor subfemily 2, group E, member 1) 
[Homo sapiens] gb|AAG31945.1|AF220532_l 
(AF220532) orphan nuclear receptor [Homo sapiens] 


9E-10 
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SEQID 
NO 


ACCESS 
N 


DESCRIPTION 


P VALUE 


2044 


7300319 


gi|7300319|gb|AAF55480.1| (AE003719) CG7305 gene 
product [Drosophila melanogaster] 


3.3 


2046 


7498832 


gi|7498832|pir||T34212 hypothetical protein F10E7.4 - 
Caenorhabditis elegans gb|AAA82427.1| (U41264) 
coded for by C. elegans cDNA yk99a6.5; coded for by 
C. elegans cDNA yk72g6.5; coded for by C. elegans 
cDNA yk99a6.3; coded for by C. elegans cDNA 
yk72g6.3; coded for by C. elegans cDNA ykl27a2.5; 
coded for by C. elegans cDNA ykl27a2.3; Simila> 


9.7 


2050 


13643847 


gi|13643847|refIXP_0 11044.2| 8-oxoguanine DNA 
glycosylase [Homo sapiens] ref|XPJ)16414.1| 8- 
oxoguanine DNA glycosylase [Homo sapiens] 
ref]XPJ)16415.1| 8-oxoguanine DNA glycosylase 
[Homo sapiens] ref|XP_03 1967.1) 8 -oxoguanine DNA 
glycosylase [Homo sapiens] ref|XP_031959.1| 8- 
oxoguanine DNA glycosylase, isoform 2d [Homo 
sapiens] ref]XP J)3 1 96 1 . 1 1 8-oxoguanine DNA 
glycosylase, isoform la [Homo sapiens] 
ref|XP_052277.1| similar to 8-oxoguanine DNA 
glycosylase (H. sapiens) [Homo sapiens] 


1.2 


2051 


7474551 


gi|7474551|pir||E69792 conserved hypothetical protein 
yeeA - Bacillus subtilis emb|CAB12496.1| (Z99107) 
similar to hypothetical proteins [Bacillus subtilis] 


7 


2060 


3913201 


gi|3913201|sp|Q585 1 1|CCA_METJA TRNA 

T*JT Tr*T Vt nTmVT TT? AXTCPT7P A CT7 /TP XT A 

ADENYLYLTRANSFERASE) (TRNA CCA- 
PYROPHOSPHORYLASE) (CCA-ADDING 
ENZYME) gb|AAB991 14.1| (U67554) tRNA 
nucleotidyltransferase (cca) [Methanococcus jannaschii] 


5.9 


2072 


13186342 


gi|13186342|gb|AAK15384.1| (AF211134) valyl-tRNA 
synthetase [Carsonella ruddii] 


2.8 
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SEQID 
NO 


ACCESS 
N 


DESCRIPTION 


P VALUE 


2075 




gi|6225820|sp|O04226|P5CS_ORYSA DELTA 1- 
PYRROLINE-5 -C ARB OXYLATE SYNTHETASE , 
(P5CS) [INCLUDES: GLUTAMATE 5-KINASE 
(GAMMA-GLUTAMYL KINASE) (GK); GAMMA- 

/~»T T PT A TV JTV7T TVT TAOnT T A TT3 "D T7TVT T^T A CC /#TD"D\ 

GLUTAMYL PHUbrriAlb KLJL>UL,lAoJb (vjrK) 
(GLUTAMATE-5-SEMIALDEHYDE 

SEMIALDEHYDE DEHYDROGENASE)] pir||T03695 
delta 1 pyrroline-5 -carboxylate synthetase - rice 
dbj|BAA19916.1| (D49714) deltal-pyrroIine-5- 

rarhnwlatf* cvmtlietTWfi FOrvza ^atival 

LaiuuAviaLv jjjf m.i ivuiov [v/i y *jcl oauvuj 


9.4 


2077 


4028153 


gi|4028153|gbIAAC96117.1| (AF083221) putative 
neurotransmitter receptor [Takifugu rubripes] 


2 


2080 


4902680 


gi|4902680iemb|CAB43550.1| (AL031673) dJ694B14.3 
novel haloacid dehalogenase-like hydrolase family 
protein similar to (archaea) bacterial proteins) [Homo 
sapiens] 


2.9 


2084 


6322760 


gi|6322760|ref|NP_012833.1| Ykl090wp 
[Saccharomyces cerevisiae] sp|P36O75|YKJ0_YEAST 
rl Y FU 1 rib 1 1CAL jU.y JsXi rKiJliiiJN UN JdUJJZ- 
MIF2 INTERGENIC REGION pir||S37915 
hypothetical protein YKL090w - yeast (Saccharomyces 
cerevisiae) emb|CAA81928.1| (Z28090) ORF 
YKL090w [Saccharomyces cerevisiae] 


2.8 


2085 


DOJJUOt 


gi|6635084|emb|CAB64573.1| (AL135930) hypothetical 
nrntpin T4738 02 FLeishmaTiia maiorl 


3.6 


2086 


7487726 


gi|7487726|pirHT05814 hypothetical protein T5K18.90 - 
Arabidopsis thaliana emb|CAA18618.1| (AL022580) 
hypothetical protein [Arabidopsis thaliana] 
pmhirAR78933 11 (AL1 615501 hvoothetical nrotein 

Cllll/|k//uJ / %j^sms+Jm x | irujiuiJ^vy t^i wen. ^/ivwm* 

[Arabidopsis thaliana] 


5.9 


2089 


13385468 


gi|13385468|refINP_080247.1| RDCEN cDNA 
2900001A12 gene [Mus musculus] dbj|BAB28377.1| 
(AK012645) putative [Mus musculus] dbj|BAB28865.1 
(AK0 13457) putative [Mus musculus] 


4E-11 


2090 


14742770 


gi|14742770|ref|XP_039393.1| KIAA1550 protein 
[Homo sapiens] 


3.2 


2092 


2982251 


gi|2982251|gb|AAC32113.1| (AF051208) putative RNA 
binding protein [Picea mariana] 


7.6 
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SEQ ID 
NO 


ACCESS 
N 


DESCRIPTION 


P VALUE 


2095 


12718478 


gi|12718478|emb|CAC28807.1|(AL513466) 

liy JJ WULIL/LIV^CWL |J1 VJLwlii. [1 1 Will V/OptU d 01<*ODC*J 


3 1 

•J . X 


2096 


5532964 


gi|5532964|gb|AAD44957.1| (AF156655) MHC class I 
leavy chain [Ambystoma mexicanum] 


9.3 


2097 


1711658 


gl|l / 1 lOJcSjSplJrJH- / V / | 1 IU_JVlUUoli o rAsJ x xlK-lvIL/Jtl 

PROTEIN T10 IN DGCR REGION pir||S37488 gene 
Mus musculus] 


4.3 


2098 


14193393 


gi| 1 4 1 93393 |gb|AAK5595 3 . 1 |AF268062_2 (AF268062) 
RNA polymerase beta-prime subunit [Candidates 
Carsonella ruddii] 


2.1 


2099 


7243081 


gi|7243081|dbjIBAA92588.1| (AB037771) KIAA1350 
protein [Homo sapiens] 


7E-97 


2100 


1 1 J J / 1 O 1 


gi|l 1357181|pir||T49996 AtAGP4 - Arabidopsis 
thaliana gb|AAC77826. 1| (AF082301) arabinogalactan- 
protein [Arabidopsis thaliana] 
gb|AAD38870.1|AF060874_l (AF060874) AtAGP4 
[Arabidopsis thaliana] emb|CAB89400. 1| (AL353995) 
AtAGP4 [Arabidopsis thaliana] 
gb|AAK49601.1|AF372885_l (AF372885) 
Ai z>giU43U/r izjd i / _z/u [Araoiciopsis xnauanaj 
gb|AAK68734.1| (AY042794) AtAGP4 [Arabidopsis 
iiia.ii an aj 




2103 


74Q7^f>Q 


gi|7497369|pir||T32512 hypothetical protein C44B12.4 - 
Caenoihabditis elegans gb|AAB88327.1| (AF036692) 
Hypothetical protein C44B12.4 [Caenorhabditis 


6.7 


2109 


14485227 


gi|14485227|gb|AAK62977.1|AF384372_3 (AF384372) 
surface antigen [Hepatitis B virus] 


5.7 


2111 


071 i Rf/y 


gi|y / 1 looz|d.Dj|i5Ai>u /ioo. i| putative 
extensin-like protein [Oryza sativa] dbj|BAB33013.1| 
(AP003 118) putative extensin-like protein [Oryza 
sativaj 


10 


2114 


9759203 


gi|9759203|dbj|BAB09740.1| (AB015476) heat shock 
transcription factor HSF30-like protein [Arabidopsis 
thaliana] 


8.4 


2115 


204070 


gi|204070|gb|AAA41 130.1| (M22030) electron transfer 
flavoprotein alpha-subunit [Rattus norvegicus] 


0.75 


2117 


3638957 


gi|3638957|gb|AAC36301.1| (AC004877) sco-spondin- 
mucin-like; similar to P98 167 (PID:gl7 11548); details 
of intron/exon structure uncertain [Homo sapiens] 


8.1 
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SEQBD 
NO 


ACCESS 
N 


DESCRIPTION 


P VALUE 


2121 


14250436 


gi| 14250436|gblAAH08653. 1|AAH08653 (BC008653) 
Similar to LRP16 protein [Mus musculus] 


9.3 


2126 


14760522 


gi|14760522|reflXP_038798.1| 7371 [Homo sapiens] 
cblAAK01445 11 CAF334585') NIR3 IHomo saoiensl 


5E-39 


2134 


3024944 


gi|3024944|sp|Q583661Y956_METJA 

iirUl xxti 1 ICAI-r .rivw I HITN 1V1 J U7jO pir| |J-/Of *f 1 7 

hypothetical protein MJ0956 - Methanococcus 
jannaschii gb|AAB98969.1| (U67539) M. jannaschii 
predicted coding region MJ0956 [Methanococcus 
jannaschii] 


8.4 


2135 


6606266 


gi|66062661gb|AAF19148.1|AF158634_l (AF158634) 
Vrgal [Aegilops ventricosa] 


7-1 


2137 


1729878 


72987 5|Sp|F5441U|lCrxl_llil lrl 1 -t^UJYLr 
PROTEIN 1, ETA SUBUNTT (TCP-1-ETA) (CCT- 
til AJ pu]|o / lo^o x-compiex protein i ineia cnain - 
Tetrahymena thermophila (fragment) gb|AAC47007. 1| 
(U46028) CCTeta [Tetrahymena thermophila] 
prf||2209286B chaperonin CCT-eta [Tetrahymena 
thermophila] 


4.4 \ 


2141 


3688193 


m!3688lQ3)emblCAA08995 II (AJ010091) MAP3K 
alpha 1 protein kinase [Brassica napus] 


6.9 


2142 


7491910 


gi|7491910|pir||T41367 hypothetical protein 
SPCC4G3.09c - fission yeast (Schizosaccharomyces 
pombe) emb|CAB09776.1| (Z97052) hypothetical 
protein [Schizosaccharomyces pombe] 


3.3 


2144 


6513832 


gi|6513832|gb|AAF14807J|AF197815_l (AF197815) 
maturase [Alisma plantago-aquatica] 


1.5 


2148 


6323084 


giJ6323084lreflNP_013156.il transcription factor, 
probable member of histone acetyltransferase SAGA 
complex; Spt8p [Saccharomyces cerevisiae] 
QniP^8Q151SPT8 YFAST TRANSCRIPTION 
FACTOR SPT8 pir||S47898 transcription fector SPT8 - 
yeast (Saccharomyces cerevisiae) gb|AAA53585.1| 
(M94955) transcription factor [Saccharomyces 
cerevisiae] emb|CAA64302.1| (X94607) transcription 
fector [Saccharomyces cerevisiae] emblCAA97585.1| 
(Z73227) ORF YLR055c [Saccharomyces cerevisiae] 


5.8 
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SEQID 
NO 


ACCESS 
N 


DESCRIPTION 


P VALUE 


2151 


1074219 


Dili 074.9 1 Qlr*irllQ409^0 lv<rrkn+hotir>'i1 rtTV\t<^in 9 

gijiu /H*z,ir'|pir||o £ fv'Zjy nypoineucai protein z 
(capsulation locus) - Haemophilus influenzae (strain 
RM107) 


4 


2153 


10046710 


gi|10946710|ref|NP_067350.1| Rhesus blood group- 
associated B glycoprotein; Rh type B glycoprotein [Mus 
musculus] gb|AAF19371.1| (AF193808) Rhtype B 
giycupioLcui i^xvxus ruubcuiusj 




2159 


9625644 


gi|9625644|reflNP_039895.1| BDLF2 late reading frame 
[Human herpesvirus 4J sp|P03223|BDJL2_JSBV 
PROTEIN BDLF2 pir||QQBE44 BDLF2 protein - 
human herpesvirus 4 (strain B95-8) emb|CAA24836. 1| 
(V01555) BDLF2 late reading frame [Human 
herpesvirus 4] 


3.7 


2160 


220578 


/ ojUDj|i3Air\uu*r*t / . i j ^ijuuj / uj open reading 
frame (25 1 AA) [Mus musculus] 


4.7 


2165 


9633076 


gi|9633076|refINP_050182.1| B4 [Human herpesvirus 
6B] pir||T44148 hypothetical protein B4 [imported] - 
human herpesvirus 6 (strain Z29) 

herpesvirus 6B] 


0.057 


2168 


1 ^SR^K 

A JJOjO 


gi|135838|sp|P01267|THYG_BOVIN 
THYROGLOBULIN PRECURSOR pir||UIBO 
tnyrogioouiin precursor - oovine emo|L/AAzo .>o4. 1 1 
(X02815) thyroglobulin precursor [Bos taurus] 
piL\\i L\j27Z* t T\Jx\. myrugiouuim [JDUb uiuxu&j 


z 


2173 


1 1 JUUlJt 


gi|l 1360.1 54|pir||T46337 hypothetical protein 
DKFZp43402413.1 - human (fragment) 
emb|CAB70664.1| (AL137265) hypothetical protein 

rT-T rvtnr* cunipncl 
^1 XUlllAJ odpiCIlbj 


^ 7 


2177 


181400 


gi|181400|gb|AAA35748.1| (M34225) cytokeratin 8 
[Homo sapiens] 


7E-53 


2181 


11347010 


gi|l 1347010|pir||B81303 probable membrane protein 
Cj 1013c [imported] - Campylobacter jejuni (strain 
NCTC 11168) emb|CAB73269.1| (AL139077) putative 
membrane protein [Campylobacter jejuni] 


o.i ! 


2182 


8745261 


gi|8745261|gb|AAF78857.1|AF134516_l(AF134516) 
VP4 [Banna virus] 


6 
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SEQID 
NO 


ACCESS 
N 


DESCRIPTION 


P VALUE 


2183 




gi|285275|pir||A43963 envelope glycoprotein G(envelope 
glycoprotein Gl, envelope glycoprotein G2) - 
Hantavirus sp.=Puumala virus gb|AAB22506.1| 
envelope glycoprotein G(envelope glycoprotein Gl, 
envelope glycoprotein G2) [Hantavirus sp =Puumala 
vim 9 Hallnas strain Peotide 1 148 aal 


0.6 


2186 


7506378 


gi| /dkjod /o|pir|| iz^yoy nypoinencai protein jcvu / a^. j - 
Caenorhabditis elegans emb|CAA91763.1| (Z67756) 

r>TYNTA T7CT i.jlr&'Xt*] C\ ^ /wmf»c "frY-vm tViic cpnfv— oPiN^ A 
CJ-JINA £jl yivOjClv.J tHJIllCb xIUIIl Uiio g&U.C' VJ-/l^l 

EST yk63el0.3 comes from this gene [Caenorhabditis 
elegans] 


8.9 


2187 


4511976 


gi|4511976|gb|AAD21536.1| (AF088896) unknown 
[Zymomonas mobilis] 


0.65 1 


2188 


T 476 1X47 


gi|14761847|ref|XP_017198.2| hypothetical protein 
FT ,T 19085 fHomo saniensl 


3E-13 


2192 


5835478 


gipo3D^ / ojreijlNJr UUo*fUH-. 1|v^ I lJO^JOH- ID cyiociiroinc 

b [Balanoglossus carnosus] pir||T11138 ubiquinol— 
cytochrome-c reductase (EC 1.10.2.2) cytochrome b - 
acorn worm mitochondrion gb| AAD 1 1 95 1 . 1 1 
(AF051097) cytochrome b [Balanoglossus carnosus] 


0.95 


2195 


14906463 


mll4906463!ffb!AAK72690 11 <AY039648i transcriDtion 
factor Rel 1 [Crassostrea gigas] 


9.3 | 


2196 


6680964 


gi|6680964|refiNP_03 1758.1| procollagen, type XVII, 
aipna 1 iivius muscuiusj pir||i\ t tov/j.j duhuui> 
pemphigoid antigen, BPAG2, type XVII collagen alpha 

- mnnQp ahlAAA37443 11 fL08407^ collaeen 

type XVII [Mus musculus] 


6.8 


2197 


T)C\(.(i'3. 1 
/ iUDDj 1 


gi|7206631|gb|AAF39790.1| (AC006631) Hypothetical 

nmtpiti P97R^ 7 rf^apnnrhahdrli^ eleoransl 


4.3 


2201 


2564679 


gi|2564679|gb|AAB81836.1| (AF023484) putative KP78 
protein kinase [Drosophila melanogaster] 


0.83 


2204 


9558143 


gi|9558143|emb|CAC00269.1| (AL160371) possible 
fl6d3.1 protein [Leishmania major] 


5.3 


2205 


13813146 


gi|13813146|gb|AAK40384.1| (AE006643) ATP- 
dependent helicase [Sulfolobus solfataricus] 


4.3 


2206 


7662168 


gi|7662168|reflNP_055497.1| KIAA0535 gene product 
[Homo sapiens] dbj|BAA25461.1| (AB011107) 
KIAA0535 protein [Homo sapiens] 


0.0003 
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SEQJD 
NO 


ACCESS 
N 


DESCRIPTION 


P VALUE 


2214 


9229890 


mlQIOQSQftlrlkirR ARftD£1 8 1 1 ( A Rn"?fii!4 1 "\ nrirVlp 0 
E^|7ZZ707U|UDJ|JDx\BuUOIo. 1| \J\B\J3\jo'* I ) pxTCisJC Z 

Ciona intestinalis] 


4.4 


2217 


3435174 


gi|3435174|gb|AAC32342.1| (AF061251) O antigen 
flippase Wzx [Escherichia coli] I 
gb|AAG57097.1|AE005429_8 (AE005429) O antigen 
flippase Wzx [Escherichia coli 0 157 .H7 EDL933] 


2.5 


2225 


5725923 


gi|5725923|gb|AAD48242. 1|AF089987_1 (AF089987) 
four-loop conotoxin ABVIF [Conus abbreviatus] 
gb|AAD48243.1[AF089988_l (AF089988) four-loop 
conotoxin ABYEF [Conus abbreviatus] 
gb|AAD48244.1|AF089989_l (AF089989) four-loop 
conotoxin ABVIF [Conus abbreviatus] 
gb|AAD48245.11AF089990_l (AF089990) four-loop 
conotoxin ABVIF [Conus abbreviatus] 


3.8 


ZZZo 


1705523 


gill705523|sp|P52650|C24A_PIG CYTOCHROME B- 
245 LIGHT CHAIN (P22 PHAGOCYTE B- 
CYTOCHROME) (NEUTROPHIL CYTOCHROME 
B, 22 KD POLYPEPTIDE) (P22-PHOX) 
(CYTOCHROME B(558) ALPHA CHAIN) 
(SUPEROXIDE-GENERATING NADPH OXIDASE 

T ir^LTT PlIATM CTTOTTVnT\ <*UI A A A f.Af^'X^ 11 fI1(Y)A.Tl\ 

JLlCxrll CHAIN isUrJUlNll ) gD|AAA040^D.l| \\J\JZ i H 1) 

NADPH oxidase light chain subunit [Sus scrofa] 


0.48 


2240 


730885 


gi|730885|sp|P07989|TlM_SALPO TYPE I 
RESTRICTION ENZYME STYSPI M PROTEIN 
(M.STYSPI) gb|AAA27143.1| (L02507) restriction- 
modification enzyme type I M subunit [Salmonella 
enterica] 


4.9 


2241 


6753572 


gi|6753572|ref|NP_034126.1| cytochrome P450, 24 
[Mus musculus] sp|Q64441|CP24_MOUSE 
CYTOCHROME P450-CC24, MITOCHONDRIAL 
PRECURSOR (P450-CC24) (VITAMIN D(3) 24- 

TTtrrvnAVlTT A fir\ /I C T*vTT T\.7T~\"T> t~X^J~\.T*\. TT r T^ A TV ifTNT 

HYDROXYLASE) ( 1 ,25-DlHYDKUX Y VITAMIN 
D(3) 24-HYDROXYLASE) (24-OHASE) pir||S60033 
25-hydroxyvitamin D3 24-hydroxylase precursor - 
mouse dbj|BAA08416.1| (D49438) 25-hyckoxyvfcunin 
D3 24-hydroxylase precursor [Mus musculus] 
dbj|BAA2 1843.1| (D89669) vitamin D-24-hydroxylase 
[Mus musculus] 


9.7 
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SEQID 
NO 


ACCESS 

"NT 

IN 


DESCRIPTION 


P VALUE j 


2243 


14^78900 


gi|14578299|gb|AAF99465.1|(AY003872) 

PVIWIdOQS P rPlpQTYinHinm viv^vl 
j. v ihitu/J x | ± igjiimuiuni vi vcia.j 


3.7 


2245 


1916617 


gi|1916617|gb|AAB5 1194.1| (U66003) ADAM 13 
Xenopus laevis] 


0.45 


2249 


D / JZ7\) ID 


gi|5739073|gb|AAD50327.1| (AF063693) type Xffl 

CUlid^Cll ^IViUo iilLlal>LLiU.oJ 


2.5 


2254 


O ijJOl I 


gi|3 13561 l|gb|AAC29067.1| (AF062485) cellulose 


3.1 


2256 


Zo IDO" 


gi|281689|pir||S27657 hypothetical protein 1 - 

vniZODlUITl meiUOU gDJ/V/V/\ZOZJ^.l| ylyiy'tKJOJ) nui 

aomologous to known sequences as of 2/92; ORF1; 

rMi+5j+4x/p» rQtnrvrVii7i'Yl"irii'm inftlil frill 

J Li Lit LiVC j O lllvjl lll-^w Ul Lil 1 1 lllwliiUUJ 


6 


2257 




gi|10434352|dbj|BAB14232.1| (AK022759) unnamed 

TM*r»+^iti nrnHnrt rHrvtYifi Qarnfin«5l 


2E-23 


2258 




gi|1504022|dbj|BAA13210.1| (D86974) KIAA0220 

JTlAJlllL) is<ipiCliaJ 


4E-21 


2266 


1871176 


gi|1871176|gb|AAB63536.1| (U90439) unknown protein 
[Arabidopsis thaliana] 


7.5 


2270 


12846015 


gi|12846015|dbj|BAB26996.1| (AK010513) putative 
[Mus musculus] 


8 


2273 


12860337 


gi|12860337|dbj|BAB31923.1| (AK019929) putative 
[Mus musculus] 


5 


2276 




gi|994736|gb|AAA75561.1| (M18327) LacOPZ-alpha 
peptide from pUC9; putative [unidentified cloning 
vector] gb(AAA75563.1| (M18328) LacOPZ-alpha 
peptide from pUC9; putative [Cloning vector pBGS9+] 
gb|AAA75565.1| (M18329) LacOPZ-alpha peptide from 

nTTPO* nntativp fflnninjy vector iVRCtS9-1 


0.00002 


2280 


11352313 


gi|l 13523 13|pir||G83376 probable trehalose synthase 
x^azidz [lmporteaj - irseuaomonas aeruginosa ^surain 
PAOl) gb|AAG05540.1|AE004642_7 (AE004642) 
probable trehalose synthase [Pseudomonas aeruginosa] 


2 


2285 


7486992 


gi|7486992|pir||T00831 hypothetical protein T13L16.5 - 
Arabidopsis thaliana gb|AAD20114.2| (AC006201) 
hypothetical protein [Arabidopsis thaliana] 


1.4 


2286 


10173203 


gi|10173203|dbj|BAB04308.1|(AP001509) 

BH05 8 9~unknown conserved protein in others [Bacillus 

halodurans] 


7.3 
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SEQID 

! NO 


ACCESS 
N 


DESCRIPTION 


P VALUE 


2289 


7497R57 


gi|7497857|pir||T20180 hypothetical protein C53B4.4a - 
Caenorhabditis elegans emb|CAA92457.1| (Z68215) 
contains sinulanty to Pram domain: FrOOliU (Pnorbol 
esters/diacylglycerol binding domain (CI domain)), 
Score=12.8, E-value=0.0015, N=l; PF00595 (PDZ 
domain (Also known as DHR or GLGF).), Score=34.8, 

F-va1iip=6 4e-07 N=l — cTYNA V<\T PFlVKPfifiF mmp> 

£1 VwlUw u ,*tC u / ? 1>~1 OXJiNrV JuiJ 1 v^JCrlY10V_^tltIJ7 CvUIlC-^ 




2291 




gi|6424831|gb|AAF08166.1| (AF130210) NADH 

UCtiyUl^J^wlUtov oUUUllll JT |_UIipallCIlb U1J.1L/Iclj j 


yj . 1 


2292 


7489002 


gi|7489002|pir||T07021 extensin-like protein DiflO 
precursor - xoniaio ^rragnienij cniD}L/Ar\D jold.i\ 
(X9945 1) extensin-like protein DiflO [Lycopersicon 
esculentum] 


5.6 


2294 


1684828 


gi|1684828|gb|AAB36537.1| (U77681) tyrosine kinase 
receptor [Xenopus laevis] 


2 


2297 


12859724 


gi|12859724|dbj|BAB31753.1| (AK019486) putative 
[Mus musculus] 


0.0003 


2301 


11498284 


gi|11498284|ref]NP_069510.1| adenylate kinase (adk) 
[Archaeoglobus fulgidus] sp|029581|KAD_ARCFU 
ADENYLATE KINASE (ATP-AMP 
iivAiSoFriuoJrJriLrK.x LAoii^ pir||Uoy3j4 adenylate 
kinase (EC 2.7.4.3) - Archaeoglobus fulgidus 

crhIA A ROOSTS 11 ^APnOin^S^ aH^vlatp Lrmscp- /'orfVl 

[Archaeoglobus fulgidus] 


3E-13 


2309 




gi|14725330|ref]XPJ)02254.2| mitochondrial 
translational initiation factor 2 precursor {Homo 

adpidlbj 




2313 


79651 


gi|79651|pir||A30189 iron stress-induced hypothetical 
nrotein nrecursor - Svnechococcus sr> 


2.8 


2314 


7500007 


gi|7500007|pir||T16186 hypothetical protein F27D9.4 - 
Caenorhabditis elegans gb|AAA93383.1| (U49829) 
Hypothetical protein F27D9.4 [Caenorhabditis elegans] 


7.1 


2317 


13517833 


gi|13517833|gb[AAK29011.1| (AF344620) long- 
wavelength rhodopsin [Ceratina calcarata] 


3.4 
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Table 3B Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQBO 
NO 


ACCESS 
N 


DESCRIPTION 


P VALUE 


2318 


747fc££1 


gi|7478661|pir||F70662 probable plcC protein - 
Mycobacterium tuberculosis (strain H37RV) 
emb|CAB06 146. 1| (Z83860) plcC [Mycobacterium 
tuberculosis] gb|AAK46707.1| (AE007081) 
jhospholipase C [Mycobacterium tuberculosis 

l-> J 1J 


4.4 


2323 


it / oozz / 


gi|14768227|refpCP_012121.3| purinergic receptor P2X, 

io-smd-o-jitpd inn rhnnnpl 7 rHYvmn ^anietiQl 


4.3 


2328 




glj/o^UjZ|pirj|oZZf jo nycuoxyprounc-nt/ii g iy vxjpi ulcih 
perennial teosinte emb|CAA455 14. 1| (X64173) 
ivHmwnrnliTip-rirH olvcnrirnteiri TZea dinlooerennisl 


5.4 


2329 


10241645 


gi|10241645|emb|CAC09484.1| (AL442113) putative 
protein [Oryza sativa] 


9.1 


2330 




gi|14733085|ref|XP_p03575.3| soluble liver antigen/liver 

nunprpa c ^itHctpti n-Tonm QaniRTi*?! 
JcUlwiC<i«3 el migwii |xxisiiix/ acijjA^/XJLoj 


1E-17 


2335 


12188796 


gi|12188796|emb|CAC21494.1| (AJ278866) MchF 
protein [Escherichia coli] 


4.7 


2340 


2492604 


gi|2492604|splP78595|CDR2_CANAL MULTIDRUG 
PFQTQTATsJPF PROTFTN fT>R9 <*hlAAB96797 11 
(U63812) drug resistance protein 2 [Candida albicans] 


6.4 


2345 


1jjOjjD7 


™HQ'3£^£Q!HkilT} AR^QI 14 11 / , AP0n9RQ7^ hvnr»thpHra1 

protein-similar to Oryza sativa chromosome 1, 


1.8 


2346 


1^4619 


gi|1334612|emb|CAA41034.1| (X57968)nadl [Triticum 

dCo UV lull] 


4.8 


2351 


1 ^4£1 9 


gi|1334612|emb|CAA41034.1| (X57968)nadl [Triticum 

dCo U V UillJ 


4.8 


2352 


1334612 


gi|1334612|emb|CAA41034.1| (X57968)nadl [Triticum 
aestivum] 


4.6 


2353 


1334612 


™I1 ^jKI Olomhir* A A41 (YXA 1 1 rV^7Q65tt naHl rTritirnm 

aestivum] 


4.8 


2356 


1334612 


mil ^/1A1 OlomhlP A A41 (1*54 1 1 r5f^7Q65tt naHl fTritinim 

aestivum] 


4.4 


2357 


14250644 


oil 1425 0644leblAAH08786 1IAAH08786 (BC008786) 
integrin, alpha 5 (fibronectin receptor, alpha 
polypeptide) [Homo sapiens] 


2 


2367 


7293054 


gi|7293054|gb|AAF48440.1| (AE003498) Topi gene 
product [Drosophila melanogaster] 


1.2 


2369 


13959344 


gi|13959344|sp|P82957|DM43_DIDMA VENOM 
METALLOPROTEENASE INHIBITOR DM43 


8.2 
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SEQID 
NO 


ACCESS 
N 


DESCRIPTION 


P VALUE 


2371 


9082017 


gi|9082017|gb|AAF82689.11AF227196_l (AF227196) 
ORFla polyprotein [gill-associated virus] 


7.8 


2374 


7498863 


gi|7498863|pir||T20730 hypothetical protein F10G8.8 - 
Caenorhabditis elegans emb|CAB02286.2| (Z80216) 
Weak similarity with intermediate filament protein 
(TREMBL id G633240), contains similarity to Pfem 
domain: PF00169 (PH domain), Score=79.3, E- 
value=2.7e-21, N=2~cDNA EST EMBL:T01262 comes 
from this gene~cDNA EST yk23d5.3 comes from this 
gene~> emb|CAA19441.2| (AL023823) Weak similarity 
with intermediate filament protein (TREMBL id 
G633240), contains similarity to Pfem domain: 
Fr uu i by (r ri domain ocore— / y . 3 , n-vaiue— z . / e-z i , 
N=2~cDNA EST EMBL:T01262 comes from this 
gene-cDNA EST yk23d5.3 comes from this gen> 


2.5 


2375 


321514 


gi|321514|pir||S27931 Env/v-mpl fusion protein - 
myeloproliferative leukemia virus gb|AAA77654.1| 
\iyxO\jjD\j) env.v-mpi rusion protein |iviyeioproiueraiive 
leukemia virus] 


2.8 


2376 


11348760 


gi|11348760|pir||C83635 hypothetical protein PA0086 
[imported] - Pseudomonas aeruginosa (strain PAOl) 

rrKI A ACM\W7& 1 1 A T7 C\C\AAA.l 0 ( AT*{\C\AA.A1\ Vix/r»/\th**tif*a1 

gDjA/vouJ'f /o. i\j\j2i\j\J i t £ f t t i y \i\i!aj\j*+'* £ * i ) nypomeucai 

protein [Pseudomonas aeruginosa] 


4.6 


2377 


742234 


gi| /4zzJ4|prt||zuuyj i / a a&oA upstream ukj* 
[Acetobacter (subgen. Acetobacter) aceti] 


8.2 


2379 




gi| 1 1 466 1 o y |rei|!Nr'_0665 1 1 . 1 1 apocytocnrome b 
[Naegleria gruberi] gb|AAG17790.1|AF288092_15 
\j\FZoo\)y/.) apocyrocnrome o LiNaegiena gruuenj 




1 2380 


8250181 


gi|8250181|emb|CAB93524.1| (AJ271740) D-Titin 
[Drosophila melanogaster] 


4.2 


2383 


5870848 


gi|5870848|gb|AAB03857.2| (U31864) stearyl-CoA 
desaturase rCvnrinu^ caroiol 


7.7 


2387 


4063751 


gi|4063751|gb|AAC98459.1| (AC005851) putative ABC 
transporter [Arabidopsis thaliana] 


7.6 


2389 


7504822 


gi|7504822|pir||T33371 hypothetical protein H02F09. 1 - 
Caenorhabditis elegans gb|AAC64621.1| (AF077538) 
unknown [Caenorhabditis elegans] 


3.6 
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SEQID 
NO 


ACCESS 
N 


DESCRIPTION 


P VALUE 


2390 


15011889 


mil SO1 1 RRQIrpfTWP 077? S 1 1 1 r»w<rfpro1 hindine nrotein 
2 [Mus musculus] emb|CAC16404.2| (AJ278263) 
oxystyrol-binding protein homologue 1 [Mus musculus 
domesticus] 


1.9 


2393 


450730 


gi)450730|emb|CAA50838.1| (X71982) ORP jl8L; 
potential membrane spanning region; potential 
glcosylation site [African swine fever virus] 


2.5 


2398 


7445990 


gi|7445990|pir||G72290 branched chain amino acid ABC 
transporter, ATP-binding protein - Thermotoga 

maritimn Strain MSRJtt ohlAAD36215 1IAE001771 8 

(AE001771) branched chain amino acid ABC 
transnnrter ATP -binding nrotein IThermotOCa 
maritima] 


7.4 


2401 


11990448 


gi|11990448|dbjlBAB19782.1| (AB052747) vascular 
ppll adhesion molecule- 1 6D variant lackins D7 fBos 
taurus] 


4.6 


2402 


12841678 


gi|12841678|dbj|BAB25308.1| (AK007856) putative 
rMu^ musculusl 


1E-39 


2403 


5052967 


gi|5052967|gb|AAD38786.1|AF151533_l (AF151533) 
polyketide synthase (Nodulisporium sp. ATCC74245] 


6.3 


2407 


8099350 


gi|8099350|gblAAF72105.11AF154847_l (AF154847) 
33 kDa Vamp-associated protein [Homo sapiens] 


5.2 


2410 




gl|267344[Sp|x J 2y /y 1| VLflwr_r>xvo VA r UalUlN 
GLYCOPROTEIN PRECURSOR [CONTAINS: 

"FTTQTfYM clj VP OPT? OTF TTvJ F9- FTTCTON 

GLYCOPROTEIN Fl] piii|VGNZBA cell fusion 
glycoprotein precursor - bovine respiratory syncytial 
virus (strain A5 1908) gb|AAA42804.1| (M82816) 
■fii^inn nrotein F ovine resoiratorv svncvtial virusl 


1.4 


2417 


12856615 


gi|12856615|dbj|BAB30727.1| (AK017396) putative 
[Mus musculus] 


8E-49 


2425 


11347010 


gi|11347010|pir||B81303 probable membrane protein 
Cj 1013c [imported] - Campylobacter jejuni (strain 
NCTC 11168) emb|CAB73269.1| (AL139077) putative 
membrane protein [Campylobacter jejuni] 


0.89 


2426 


7499991 


gi|7499991|pir||T29526 hypothetical protein F27C1.7 - 
Caenorhabditis elegans gb|AAB37654.1| (U80441) 
Hypothetical protein F27C1.7 [Caenorhabditis elegans] 


3.1 
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SEQID 
NO 


ACCESS 
N 


DESCRIPTION 


P VALUE 




2498043 


gi|2498043|sp|Q57568|Y104_METJA 
HYPOTHETICAL ATP-BINDING PROTEIN MJ0104 
pir||H64312 probable DNA helicase MJ0104 - 
Methanococcus jannaschii gb|AAB98084. 1| (U67467) 
DNA-binding protein, probably DNA helicase 
[Methanococcus jannaschii] 


1.5 


2430 


7494263 


gi|7494263|pir||T18488 hypothetical protein C0825c - 
malaria parasite (Plasmodium falciparum) 
emb|CABT 1 127. 1| (Z98551) putative cleavage and 
polyadenylation specificity factor protein [Plasmodium 
falciparum] 


2.3 




493224 


gi|493224|dbj|BAA03434.1| (D14581) fatty-acid 
desaturase [Anabaena variabilis] 


9.1 


2438 


6755468 


gi|6755468|ref(NP__036019.1| septin 3 [Mus musculus] 
sp|Q9Z 1 S5 |SEP3_MOUSE NEURONAL-SPECIFIC 
SEPTIN 3 gb|AAD02884.1| (AF10441 1) neuronal- 
specific septin 3 [Mus musculus] 


0.59 


2440 


9967295 


gi|9967295|dbj|BAB12347.1| (AB047936) hypothetical 
protein [Macaca fascicularis] 


0.063 


2447 


12847975 


gi|12847975|dbj|BAB27780.1| (AK011690) putative 
[Mus musculus] 


7E-65 


2451 


1730946 


gi| 1730946|sp|P50833|YPPE_BACSU 
HYPOTHETICAL 14.5 KDA PROTEIN IN PONA- 
COTD INTERGENIC REGION pir||B69940 
hypothetical protein yppE - Bacillus subtilis 
gb|AAB38463.1| (L47838) putative [Bacillus subtihs] 
emb|CAB14144.1| (Z99115) yppE [Bacillus subtUis] 


9.3 


2452 


401192 


gi|401192|sp|P30975|TLR2_DROME TACHYKININ- 
LIKE PEPTIDES RECEPTOR 99D (DTKR) 
pir||S 17783 tachykinin receptor homolog DTKR - Suit 
fly (Drosophila melanogaster) emb|CAA44595.1| 
CX6271 1) receptor for tachykmin-like peptides 
[Drosophila melanogaster] 


4.8 


2453 


14768202 


gi|14768202|reflXP_018137.2| LI cell adhesion 
molecule precursor (Homo sapiens] 


0.11 



521 



3NSDOCID: <WO 0214500A2_I_> 



WO 02/14500 



PCT/US01/25840 





Table 3B Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQID 
NO 


ACCESS 
N 


DESCRIPTION 


P VALUE 


2457 


141028 


gi|141028|sp|P04540|NU5M_TRYBB NADH- 
UBIQUINONE OXIDOREDUClAbl} CHAHN d 
pir||QQUTC5 NADH dehydrogenase (ubiquinone) (EC 
1 .6.5.3) chain 5 - Trypanosoma brucei mitochondrion 
gb|AAB59225.1| (M14820) NADH dehydrogenase 
subunit 5 [Trypanosoma brucei] emb|CAB57807. 1) 
(X01094) unidentified reading frame 10 [Trypanosoma 
brucei] 


3.6 


2461 




gi|7499039|pir||T20867 hypothetical protein F13H10.5 - 
Caenorhabditis elegans emb|CAA92956.1| (Z68748) 
contains similarity to Pfam domain: PF01663 (Type I 
phosphodiesterase / nucleotide pyrophosphatase), 
Score=512.3, E-value=l.le-150,N=l [Caenorhabditis 
elegans] emb|CAA15977.1| (AL021176) contains 
similarity to Pfam domain: PF01663 (Type I 
phosphodiesterase / nucleotide pyrophosphatase), 
Score=512.3, E-value=l.le-150, N=l [Caenorhabditis 


9.7 


2462 


3023956 


gi|3023956|spJQ00808|HETl_PODAN 

V EtKjE* 1 A 1 1x5 J_/JD UN ^^IVJLt /\ 1 LDLLil 1 I Jr JVU J. JCOTN 

HET-E-1 pir||T18521 beta transducin-like protein - 
Podospora anserina gb|AAA85775.1| (L28125) beta 
transducin-like protein [Podospora anserina] 


8.7 


2463 




gi|14972564|gb)AAK75201.1| (AE007410) glutamine 
omirl/ , *fr!a-nc-fi»r5icp k r»1jiQQ T rSllrpntneoeeii*; nneirmoniael 


0.4 


2470 


13928966 


gi|13928966|ref|NP_113882.1| heat shock factor 2 
[Rattus norvegicus] gb|AAD51329.1|AF172640_l 
(AF 172640) heat shock factor 2 [Rattus norvegicus] 


6E-14 


2473 


4521320 


gi|4521320|dbjlBAA11580.1| (D82816) product is 
unknown [Gallus gallus] 


0.29 


2475 


7507534 


m\7^fi7^d\Wir\\T0dl^R VivnntVipriral nrntein T0QF1 12- 

gll I D\J / J^'rjpiXIJ X Z*rr / JO IlYJJUtXLwUl'CU. yiKJlXsLU. X \J7XsX X 

Caenorhabditis elegans emb|CAB03532.1| (Z81147) 
Similarity to zinc finger proteins, contains similarity to 
Pfam domain: PF00104 (Ligand-binding domain of 
nuclear hormone receptor), Score=14.1, E- 
value=0.00047, N=l; PF00105 (Zinc finger, C4 type 
(two domains)), Score=42.6, E-value=5.5e-12, N> 


6.7 
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SEQID 
NO 


ACCESS 
N 


DESCRIPTION 


P VALUE 


2478 


927030 


gi|927030|gb|AAA73871.1| (L13287) kcrB4 gene 
product [Escherichia coli] 


3.5 


2481 


6724309 


gii6724309|gb|AAF26929.1| (AF079967) NADH 
dehydrogenase subunit 4 [Phytomonas serpens] 


7.9 


2482 


6319700 


gi|63 19700|ret]NP_009753. 1| Ybr224wp 
[Saccharomyces cerevisiae] sp|P38320(YB74_YEAST 
HYPOTHETICAL 19.3 KDA PROTEIN IN FAT2- 
MCX1 INTERGENIC REGION PRECURSOR 
pirj|o*roiuu proDaDic niciii Diane proLCin x Dj\zz*tw ~ 
yeast (Saccharomyces cerevisiae) emb|CAA85187.1( 
(Z36092) ORF YBR224w [Saccharomyces cerevisiae] 


2.4 


2496 


7503173 


gi[7503173|pir||T31884 hypothetical protein F41E6.14 - 
Caenorhabditis elegans gb|AAB65962.1| (AF016448) 
weak similarity to several acyltransferases 
[Caenorhabditis elegans] 


9.7 


2498 


11352621 


gi| 1 1 35262 1 |pir||E83 1 87 tetrahydrodipicolinate, 
succinylase PA3666 [imported] - Pseudomonas 
aeruginosa (strain PAOl) dbj|BAA75911.1| 
(AB024601) tetrahydrodipicolinate N- 
succinyletransferase [Pseudomonas aeruginosa] 
gb|AAG07054. 1|AE004786_6 (AE004786) 

+^'l"i"cili\//1t*r»Hi'r^'ir*r»1t'tiJi+f* ciT/VMtTi/lsicf* 1 PcpiiHaiti Ann c 

LCtlCLUyUl UUipilAJJJLUcLU? o UvAy illy 1 do C> |_X ovUU-UlllAJIlClo 

aeruginosa] 


3.3 


2500 


7662214- 

1 UUx.Z< JL*T 


gi|7662214|ref|NPJ)55604.1| KIAA0628 gene product 
[Homo sapiens] ref|XP_005044.3| KIAA0628 gene 
product [Homo sapiens] dbj|BAA31603.1| (ABO 14528) 

TTTA A C\f\J R r»rrrf"f»in rHrwno <sanif*n<5l 

JNJLrVrVV/U^O JJi \J twill \i- JAJlllVJ oapi^uoj 


3 


2506 


7508531 


gi|7508531|pir||T25325 hypothetical protein T26H2.7 - 
Caenorhabditis elegans embICAB04848.1| (Z82055) 
contains similarity to Pfam domain: PF01757 (Domain 
of unknown function) Score— 543 5 E-value=4 6e-160 
N=l [Caenorhabditis elegans] 


9.2 


2507 


14730527 


gill4730527lrefpff_051896.ll phospholipase A2, group 
IVA (cytosolic, calcium-dependent) (Homo sapiens] 
ref|XP_05 1897.1| phospholipase A2, group IVA 
(cytosolic, calcium-dependent) (Homo sapiens] 


5E-26 
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SEQID 
NO 


ACCESS 

JN 


lJJioL/lvlr 1 HJJN 


P VAT TTF 


2510 


138394 


gi|138394|sp|P27330|VHEL LSV PROBABLE 
HELICASE (ORF 2) emb|CAA33398.1| (X15343) 
25kD protein [Lily symptomless virus] 


i n 


2519 


13938116 


gi|13938116|gb|AAH07173.1|AAH07173 (BC007173) 
Unknown (protein for IMAGE:3493 127) [Mus 
musculus] 


8E-22 


2520 


7486371 


gi|7486371|pir||T00661 hypothetical protein F3I6.24 - 
Arabidopsisthalianagb|AAC00591.1| (AC002396) 
Unknown protein [Arabidopsis thaliana] 




2523 


11358141 


gi|l 1358141|pui|T48198 hypothetical protein 
T20L15.40 - Arabidopsis thaliana emb|CAB82747. 1| 
(AL162351) putative protein [Arabidopsis thaliana] 


9.1 


ZDZO 


14485227 


•n a Moroni— L.I A a T/z!onT7 1 1 ATJOO/lOTI O / A "C2 0/1*2 TO \ 

gi|14485227|gb|AAK62977.1|Ar3543 Jl_3 (Ari©43 /z) 
surface antigen [Hepatitis B virus] 


2.6 


ZjZ / 


6594617 


gi|6594617[gb|AAF18559.1|u4235U__l (u4z3oU) 
aminopeptidase [Aplysia californica] 


5.3 


2529 


14193306 


gi|14193306|gb|AAK55890. 1|AF26721 1_2 (AF26721 1) 
ATP synthase gamma subunit [Candidatus Carsonella 
ruddii] 


3.5 




3319680 


gi|3319680|emb|CAA76509.1| (Y 1 /614) IMS protein 
[Medicago truncatula] 


1.5 




13811987 


gi|1381 1987|ref|NP_l 13 1 16. 1| DNA repair helicase 
component of transcription factor b [Guillardia theta] 
gb|AAK39689.1|AF083031_46 (AF083031) DNA 
repair helicase component of transcription factor b 
[Guillardia theta] 


4.8 


2537 


3122601 


gi|3 122601|sp|P93 107|PF20_CHLRE FLAGELLAR 
WD-REPEAT PROTEIN PF20 pir||T08180 PF20 
protein, microtubule-associated - Chlamydomonas 
reinhardtii gb|AAB41727.1| (U78547) PF20 
[Chlamydomonas reinhardtii] 


1.1 


2540 


7460001 


gi|7460001[pir|[G71079 hypothetical protein PH0903 - 
Pyrococcus horikoshii dbj|BAA29997.1| (AP000004) 
141aa long hypothetical protein [Pyrococcus horikoshii] 


6.3 


2541 


12330702 


gi|12330702|gb|AAG52889.1|AF333769_l (AF333769) 
cell recognition molecule CASPR3 [Homo sapiens] 


2E-28 
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SEQID 
NO 


ACCESS 

J/N 


TIFSCPTPTTON 


p VALUE 


2547 


6754652 


gi|6754652|ref|NP_034904.1| methyl-CpG binding 
domain protein 4 [Mus musculus] gb|AAC6S878.1| 
(AF072249) methyl-CpG binding protein MBD4 [Mus 
museums] gbjAAD3o5y3. i|Ar izuyyo_i (Ar izuyyoj 
methyl-CpG binding protein 4 [Mus musculus 
domesticus] 


7.1 




12655370 


gi|l2o3 5 d /U|eniD|UAx>D / J44.j| /us; pncKie pK 
isoform [Drosophila melanogaster] 


7.7 


Zjj 1 


12853631 


^|12o53o31|ClDJ|i>Al5ZyoUU.l| {AJS\)lo555) putanve 
Mus musculus] 


0.61 


2554 


11466216 


gi|1146621o|rei|NFjUoo53y.l| naem lyase [JNaeglena 
gruberi] gb|AAG17817.1|AF288092_42 (AF288092) 
haem lyase [Naegleria gruberi] 


1.5 


ZJJ / 


7321597 


gi|7izl 5y /|gD|AAA^2uyy.z| (LfZoo//^ unicnown 
Tetrahymena pyriformis] 


3 


2560 




gi|7304202|gb|AAF59238.1| (AE003840) CG1602gene 
jroQUCi LL/rosopmia meianogasusr j 


1.6 




336831 


glp30o31|gD|AAoUZZol.l| ^IVlj/yiU^ IN/VJL/rl 

dehydrogenase subunit 2 [Drosophila melanogaster] 


5.5 


2566 


qr i r«5 i 

7515231 


gi|7515231(pir|[T13518 hypothetical protein 29 - 
Bacillus phage phi-105 dbj|BAA36635.1| (AB016282) 
wivr zy [oactenopnage pm- 1 ud j 


7 7 


2568 


12847263 


gi|12847263|dbj|BAB27500.1| (AK011258) putative 
(Mus musculus] 


6.6 


2569 


9964395 


gi|9964395|ref|NP_064863.1| AMV081 [Amsacta 
moorei entomopoxvirus] gb|AAG02787.1|AF250284_81 
(AF250284) AMV081 [Amsacta moorei 
entomopoxvirus] 


2.4 


2573 


12060849 


gi|12060849|gb|AAG48266.1|AF308299_l (AF308299) 
serologically defined breast cancer antigen NY-BR-85 
[Homo sapiens] 


0.0006 


2575 


9633381 


gi|9633381|ref|NPJ)50485.1| DSL protein [variola 
minor virus] pir||H72173 DSL protein - variola minor 
virus (strain Garcia-1966) gb|AAA69395.1| (U18339) 
D4L [Variola virus] emb|CAA50966.1| (X72086) 

fYRT?l*JT • T^l^T in nffatirm P31 WnrirAa viniol 
wivr jl^jLj, jd i ol in ciutuoii |jj |^v<uiui<i vixuoj 

emb|CAB54786.1| (Y16780) D5L protein [variola 

minor virus] 


7.5 


2576 


7407131 


gi|7407131|gb|AAF61923.1| (AF228524) SanE 
[Streptomyces ansochromogenes] 


0.53 


2577 


14760789 


gi|14760789|ref|XP_044332.1| piwi (Drosophila)-like 1 
[Homo sapiens] 


9E-42 
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&±*t\£ ,11/ 

NO 


ACCESS 
N 


DESCRIPTION 


P VALUE 


2584 


2764800 


gl|Z / 04oUU|CinD|v^i\/\«>H- 1.3.5.1 1 (-rV/O/JOj IZS glODUIin 

Avena sativa] 


1.4 




12860337 


^11 OO AO OTl^KilXi ATI'S 1 OOO 11 / A FH1 OOOQ^ nii+Qtiiro 

gi|izoou3J /|aDj|jDAi5Jiyzj.i| (AJvUiyyzyj putative 
Mus musculus] 


5.3 


2586 


HH£LSl A QA 


gi|77684o4)emD|CAB90775.1| (AL353o3Z) putative 
mitochondrial carrier protein [Schizosaccharomyces 
pombe] 


2 8 


2587 




gi|12620098|gb|AAG60558. 1| (AF250768) BioA-like 
protein [uncuiturea oacxenurn pi^osroij 




2591 


2281181 


gi|2281181|gb|AAB66275.1| (U58587) maturase 
[Lagoecia cuminoides] 


3.8 


2595 


6573736 


gi|6573736|gb|AAF17656.1|AC009398__5 (AC009398) 
F20B24.10 [Arabidopsis thaliana] 


6.7 


2596 


31155 


gi|31155|emb|CAA24999.1| (X00176) preproenkephalin 
part 1 [Homo sapiens] 


9.5 


2600 


11278033 


gi|11278033|pir||C81832 transferrin-binding protein A 
NMA2024 [imported] - Neisseria meningitidis (group A 
strain Z2491) gb|AAC13726.1| (AF058689) transferrin 
binding protein A precursor [Neisseria meningitidis] 
emb|CAB85243.1| (AL162757) transferrin-binding 
protein A [Neisseria meningitidis Z2491] 


5 


2613 




gi|7493138|pir||T37964 probable ubiquitin ligase - 
fission yeast (Schizosaccharomyces pombe) 
emb|CAB 16714.1| (Z99531) putative ubiquitin ligase 
[Schizosaccharomyces pombe] 


0 21 


261 5 


14625275 


gi|l4ozjZ /D|gD|AAAoU3ou.Z| ^ujyo**4j nypotneucax 
protein T10E10.4 [Caenorhabditis elegans] 


5.3 


2617 


7515479 


gi|7515479|ptr||o7229© hypothetical proteni yl - 
Plasmodium falciparum plastid emb|CAA64588.1| 
(X95276) ORF91 [Plasmodium falciparum] 


0.66 


2619 


14l)yio3D 


gi|14091855|gb|AAK53858.1|AC016781_12 
(ACUlo /o l) riypotneticai protem [uryza sanvaj 




2621 


8393165 


gi|8393 165|ref|NP_035661 ,2| transiently-expressed 

avonal crlvrnnroteiTi PMus muscnlii^l 


0.11 


2624 


6754242 


gi|6754242|ref[NP_034603.1| histidine rich calcium 
binding protein [Mus musculus] 
gb|AAD55250.1|AF158597_l (AF158597) histidine- 
rich Ca2+ binding protein [Mus musculus] 


8.1 


2625 


7959261 


gi|7959261|dbj|BAA96024.1| (AB040933) KIAA1500 
protein [Homo sapiens] 


1E-36 
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cpn TO 

NO 


ACCESS 
N 


DESCRIPTION 


P VALUE 


2626 


5508828 


gi|5508828|gb|AAD43995.1| (U59485) AttU 
Agrobacterium tumefeciens] 


0.27 


2627 


3643603 


gi|3643603|gb|AAC42250.1| (AC005395) unknown 
protein [Arabidopsis thaliana] 


4.7 


2630 


5453906 


gi|5453906|ref]NP_0063 10. 1 1 CDP-diacylglycerol- 
inositol 3-phospbatidyltransferase (phosphatidylinositol 
synthase) [Homo sapiens] ref|XP_008065.1| CDP- 
diacylglycerol— inositol 3-phosphatidyltransferase j 
(phosphatidylinositol synthase) [Homo sapiens] 
reflXP_043951.1| CDP-diacylglycerol-inositol 3- 
phosphandyltransferase (phosphatidylinositol synthase) 
[Homo sapiens] sp|014735|PIS_HUMAN CDP- 
DIACYLGLYCEROL-INOSITOL 3- 
PHOSPHATIDYLTRANSFERASE 
(PHOSPHATIDYLINOSITOL SYNTHASE) (PTDINS 
SYNTHASE) (PI SYNTHASE) gb|AAB94860.1| 
(AF014807) phosphatidylinositol synthase [Homo 
sapiens] gb|AAH01444.1|AAH01444 (BC001444) CDP- 
diacylglycerol— inositol 3-phosphatidyltransferase 
(phosphatidylinositol synthase) [Homo sapiens] 


7 


2631 


7515479 


gi|75 15479|pirl|S72298 hypothetical protein 91 - 
Plasmodium falciparum plastid emb|CAA64588.1| 
(X95276)ORF91 [Plasmodium felciparum] 


0.71 


2636 


1 1467075 


gi|11467075|ref|NP_04255Ll| ribosomal protein L5 
[Acanthamoeba castellanu] sp|P46764|RM05_ACACA 
MITOCHONDRIAL 60S RIBOSOMAL PROTEIN L5 
pir||S53852 ribosomal protein L5 - Acanthamoeba 
castellanii mitochondrion gb|AADl 1844.1| (U12386) 
ribosomal protein L5 [Acanthamoeba castellanii] 


9 Q ' 


2639 


1352549 


gi|1352549|sp|P48906|NU2M_HANWI NADH- 
UBIQUINONE OXIDOREDUCTASE CHAIN 2 


4.1 




13111580 


gi|13111580|gb|AAK12385.1|AF296091_l (AF296091) 
polyprotein [Porcine teschovirus] 


2.4 


2646 


1314734 


rrill Q 1 A11A\rrU\ A A AQQRfhl 11 ni^A.&A'W 990 VHsi cillr 

protein [Chironomus thummi] 


1.9 


2651 


7001374 


gi|7001374|gb|AAF34871. 1|AF1 12184 _1 (AF1 12184) 
serine/threonine kinase NKIATRE alpha [Rattus 
norvegicus] 


2.9 


2653 


2687582 


gi|2687582|gb|AAB88853.1| (AF032875) protein kinase 
[Mus musculus] 


7.2 
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SEQID 
NO 


ACCESS 
N 


DESCRIPTION 


P VALUE 


2654 


10636263 


gi|10636263|emb|CAC10528.1| (AJ293919) putative 

inncitol 1 A S-tTiQnViocWImtf* TPnj^rstnr ff^siPiifirhaHdifci^ 

briggsae] 


5 


2663 


5902891 


gi|5902891|dbj|BAA84474.1| (AB032367) type I 
polyketide synthase AVES 1 [Streptomyces avermitilis] 


0.087 


2664 


4567214 


gi|4567214|gb|AAD23629. 1|AC0071 13_2 (AC0071 13) 
putative villin [Arabidopsis thaliana] 


6.4 


2665 




gi|2326816|emb|CAA99382.1| (Z75081) ORF 
YOR 1 79.w rSaccharomvcfi^ cerevi^iael 

i WXVl / 4-i W ^UdvvUCU vl**j vvO vvlv Y10XHVJ 


0.46 | 


2667 j 


14285987 


gi|14285987|sp|083933|Y967_TREPA 

hypothetical protein TP0967 - syphilis spirochete 
gb|AAC65925.1| (AE001264) T. pallidum predicted 
coding region TP0967 [Treponema pallidum] 


6.9 


2669 


6678247 


gi|6678247|ref|NP_033358.1| transcription factor 7-like 
1 rMii«a miiOTiliicl emhIPA Al 1 070 11 CAT223069I TCF- 
3 protein [Mus musculus] 


3.4 


2671 


7491697 


gi|7491697|pir||T40527 hypothetical protein 

CDPP^^fl 1 1 p _ -PJ co-inn x/f^act ( Qr\\ i "rrvca r*pV> p rr»m\7f*.f*c. 

pombe) emb|CAA19177.1| (AL023634) putative 
transcriptional regulator [S chizosaccharomyces pombe] 


4.9 


2673 


5459308 


gi|5459308|emb|CAB50693.1| (AJ238951) CE9 protein 
[Canis familiaris] 


8.8 


2675 


7959261 


oi(70S0761 WhilRAAQ6024 11 fABQ40933i KIAA1500 
protein (Homo sapiens] 


1E-36 


2680 


14520279 


gi|14520279|re£|NP_125754.1| hypothetical protein 
[ryrococcus aoyssij pir||D ij ivz nypometicai proicin 
PAB2304 - Pyrococcus abyssi (strain Orsay) 
emb|CAB48985.1| (AJ248283) hypothetical protein 
[Pyrococcus abyssi] 


6.7 


2681 


12654531 


gi|12654531|gb|AAH01098.1|AAH01098(BC001098) 
Unknown (protein for IMAGE:3508043) [Homo 
sapiens] 


2.8 


2682 


12657687 


gi|12657687|gb|AAK01000.1| (AF178873) NADH 
dehydrogenase subunit 1 [Archiearis parthenias] 


0.68 
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SEQID 
NO 


ACCESS 
N 


DESCRIPTION 


P VALUE 


2686 i 


6319517 


gi|6319517|reflNP_009599.1| Ybr043cp 
[Saccharomyces cerevisiae] sp|P38227|YBP3_YEAST 
HYPOTHETICAL 77.3 KDA PROTEIN IN FAT1- 
TCM62 ESfTERGENIC REGION pir||S45901 probable 
membrane protein YBR043c - yeast (Saccharomyces 
cerevisiae) emb|CAA84985.1| (Z35912) ORF YBR043c 

rSacnViarntnvnftS! cftrevi^iapl 

^t^civviuiii/iuyv/wo vvl w ViwiOvl 


7 2 


2691 


/U17JUJ 


gi|7019565|ref|NP_037528.1| ubiquitin specific protease 
25; ubiquitin specific protease USP25 [Homo sapiens] 
ref|XP_047738. 1| ubiquitin specific protease 25 [Homo 
sapiens] gblAAF24998.1|AF134213_l (AF134213) 

nV^imiifrn— cnwifi/* tmyM^qc^ FH/"*tn/"\ canipnol 
il UlLJUlLlll oL/ piULCcioC [JT1UJ-I1U octpiCUoJ 


TP -90 


2693 


2130214 


gi)2130214|pir)|S67381 tubulin-folding cofactor D 
homolog - fission yeast (Schizosaccharomyces pombe) 
pir||T393 19 tubulin-folding cofactor D homolog - fission 
yeast (Schizosaccharomyces pombe) emb|CAA20686.1| 

f AT 0*31 ^9R\ +iitm1in_f/VMfncr irtfbrfrvr A 

\±\ju\J3 ljzoj tuouiin-ioiuing coxactur o. 
[Schizosaccharomyces pombe] 


3.9 


2696 




gi|2 12060 l|pirjlJC6030 3-oxo-5alpha-steroid 4- 
dehydrogenase (EC 1.3.99.5) - Comamonas testosteroni 
gb|AAB085 17.1| (L23428) delta 4, 5-alpha steroid 
uenyurogciiasc [^oinamoiias icsLOSicronij 


A £ 


2697 


4263524 


gi|4zojJz4|gD|/VALii joz>u.1| {j\\^\)\}**\) e W) nypotneticai 
protein [Arabidopsis thaliana] emb|CAB77761.1| 
(AL161495) hypothetical protein [Arabidopsis thaliana] 


5.8 


2698 


5453072 


gil^Hjou i z,\g&\t\j\\\j^j^z,\j. i| ^Ajru/:>y/ oiiacioiy 
receptor [Mus musculus domesticus] 


1.1 


2701 


7298915 


gi|7298915|gb|AAF54120.1| (AE003675) CG10267 
gene product [Drosophila melanogaster] 


0.86 


2703 


158148 


gi|158148jgb|AAA28827.1| (M19537) RNA polymerase 
II largest subunit ( EC 2 7 7 6} HDrosoohila 
melanogaster] 


0.95 


2706 


7492334 


gi|7492334|pir||T37965 probable 40s ribosomal protein - 
fission yeast (Schizosaccharomyces pombe) 
emb|CAB16715.1| (Z99531) putative component of U3 
snoRNP, required for pre-18S rRNA processing 
[Schizosaccharomyces pombe] 


6.8 
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SEQID 
NO 


ACCESS 
N 


DESCRIPTION 


P VALUE 


2709 


9366627 


gi|9366627|emb|CAB95389.1| (AL359782) conserved 
lypothetical protein, CHR1.84 [Trypanosoma brucei] 


9.2 


2713 


7436828 


gi|7436828|pir||T00129 hypothetical protein 7 - 
Leptospira interrogans (fragment) dbj|BAA24376. 1| 
(ABO 10203) ORF7; putative [Leptospira interrogans] 


7.2 


2715 


380^770 


gi|3805770|gb|AAC69148.1| (U78721) auxin response 
transcription factor 3 (ETTIN/ARF3) [Arabidopsis 
thaliana]gb|AAG53998.1|AF336917 1 (AF336917) 
ARF3 [Arabidopsis thaliana] 
gblAAK26023.1|AF360313_l (AF360313) ARF3 

FArsiVnHrHvnc! tlialianal 


0.65 


2716 


J'tJtIJO 


gi|5454158|reflNP_006286.1| valyl-tRNA synthetase 2 
[Homo sapiens] sp|P26640|SYV2_HUMAN VALYL- 
TRNA SYNTHETASE 2 (VALINE-TRNA LIGASE 
2) (VALRS 2) gb|AAD21819.1| (AF134726) G7A 

n-Ir\Yr»f\ csitvi f*r»cl 


1.6 


2721 


/Ul ^~/\JJ 


gi| /0iy5oj|ret|iNir_UJ /j/o. i| uoiquitin specitic protease 
25; ubiquitin specific protease USP25 [Homo sapiens] 
ref|XP_047738.1| ubiquitin specific protease 25 [Homo 
sapiens] gb|AAF24998.11AF134213JL (AF134213) 
litSinnitin-^neci'fic nrotease fHomo saniensl 


8E-21 


2722 


13186335 


gi|13186335|gb|AAK15378.1| (AF211133) tryptophanyl- 
fRN A synthetase [Carsonella ruddii] 


8.9 


2726 


7514607 


gi|75146071pirllA71301 conserved hypothetical protein 
TP0636 - syphilis spirochete gb|AAC65610.1| 
(AE001238) conserved hypothetical protein [Treponema 
pallidum] 


9.4 


2727 


13129046 


gi|13129046|ref|NPJ)76972.1| hypothetical protein 

MfTr*971R rUnmn <5anipn<!l rpflYP 017742 21 

hypothetical protein MGC2718 [Homo sapiens] 
ref]XP_043040.1| hypothetical protein MGC2718 
[Homo sapiens] gb|AAH01076.1|AAH01076 
(BC001076) Unknown (protein for MGC:2718) [Homo 
sapiens] gb|AAH05121.1|AAH05121 (BC005121) 
hypothetical protein MGC2718 [Homo sapiens] 


3.2 
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SEQDD 
NO 


ACCESS 
N 
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2730 


11466216 


gi|l 1466216|ref|NP_066539.1| haem lyase [Naegleria 
gruberi] gb|AAG17817.1|AF288092_42 (AF288092) 
haem lyase [Naegleria gruberi] 


1.4 


2734 


14701844 


gi|14701844|gb|AAK72251.1| (AF378136) MB2 

"Plasmodium "folcvnaruml 

X IdOlllUUlUlll JXUV/XJL/CU 14-1. H J 


3.3 


2739 


7486489 


gi|7486489|pir||T00671 hypothetical protein F6E13.4 - 
Arabidopsis thaliana gb|AAC23400.1| (AC004005) 
putative methyl chloride transferase [Arabidopsis 
thaliana] gb|AAK73255.1| (AY044314) putative methyl 
chloride transferase [Arabidopsis thaliana] 


8.1 


2747 


/ JU77*tO 


gi|7509946|pir||T26972 hypothetical protein Y47H9C.4 - 
Caenorhabdras elegans emb|CAA21739.1| (AL032657) 
contains similarity to Pram domain: Pr UUUUe (kur-uke 
domain), Score=76.2, E*-value=2.2e-19, N=17~cDNA 
EST yk20a5.3 comes from this gene-cDNA EST 
yk20a5.5 comes from this gene-cDNA EST yk299al2.3 
comes rrom this gene^cjLJrNrv no x yK*ro /go.o tomcis ^ 
gb|AAG60061.1|AF332568_l (AF332568) CED-1 

[ v_/ clCIUJl LLcLU KM Llo GlOgcUlDj 


1.6 


2749 


8569100 


gi|8569100|gblAAF76445.1|AC015445_12 (AC015445) 
Contains Ribosomal S17 PF|00366 and DLH PF|01738 
domains. [Arabidopsis thaliana] 


9.5 


2750 


7293625 


gi|7293625|gb|AAF48997.1| (AE003512) CG14223 
gene product [Drosophila melanogaster] 


2.9 


2751 


5454050 


gi|5454050|ref|NP_006369.1| sema domain, 
immunoglobulin domain (Ig), transmembrane domain 
(TM) and short cytoplasmic domain, (semaphorin) 4D; 
sema domain, immunoglobulin domain (Ig), 
transmembrane domain (TM) and short cytoplasmic 
domain, 4D [Homo sapiens] 

_„j/^no /l iCli/l/IT^ TTT TTV /T A XT CT7A/T A T51-JOI> TXT AT\ 

Spjv2"^oj4(bM41J_rlUMAJN oJbJVLAJ'nLIKJJN H\J 

PRECURSOR (LEUKOCYTE ACTIVATION 
ANTIGEN CD100) (BB18) (A8) (GR3) 
gb|AAC50810.1| (U60800) semaphorin [Homo sapiens] 


9 


2761 


9964076 


gi|9964076|gb|AAG098 12. 1|AF275943_1 (AF275943) 
avermectin polyketide synthase [Streptomyces 
avermitihs] 


2.9 


2762 


4234794 


gi|4234794|gblAAD 12962.1] (AF078135) unknown 
[Leptospira borgpetersenii] 


1.3 
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SEQID 
NO 


ACCESS 

"M 


TYP SCRTPTTON 


P VALUE i 


2764 


11131078 


gi|11131078|sp|O15072|ATS3_HUMAN ADAM-TS 3 

METALLOPROTEINASE WITH 
THROMBOSPONDIN MOTIFS 3) (ADAMTS-3) 

^AliAM-TQ^^ HHirRA A9089 1 11 f A"R009364^ 

CIAA0366 [Homo sapiens] 


4E-17 


2765 


6449069 


ml£/lAQn£Qlc*hl A ATTnaftn^ 1 1 AF1Q77dR 1 ^AF1Q974R^ 

synapsin lb [Lampetra fluviatilis] 


0.008 


2769 


7300358 


gi|7300358|gb|AAF55517.1) (AE003721) CG8045 gene 
jroQUCi [oil zj [urosopmia meianugaoierj 
gb|AAB34837.2| (S78747) RK2 [Drosophila sp.] 


3 


2772 


1 ATX 1 AAA 




6E-20 


7774 


4927134 


gii4927 134|gblAAD33018. 1|AF13 1999_1 (AF 131999) 
mutative eiytnrocyie Dincung proiein CDb-i 
Plasmodium falciparum] 


8.6 ! 


977S 


11994465 


gi|11994465|dbj|BAB02467.1| (AB025624) contains 
similarity to late emoryogenesis aounciani 
protein-gene id:MLD14.16 [Arabidopsis thaliana] 


7.5 


A. 1 / t> 


12852706 


gi|12oj2/Uo|aoj|iJAi>zy3Uo.i| jaivu l^foy I) puiativc 
[Mus musculus] 


1 


2777 
x. / / / 


9972373 


r^lOO^O^^^IftKIA Arrin/;9^ 1IAPH99S91 1 f AP099'521^ 

Unknown protein [Arabidopsis thaliana] 


3.2 


2779 


izojo /oy 


giil2838769idbj|BAB24323.1| (AK005931) putative 
[lvius museums j 


7.6 


2780 


14743085 


gi|14743085|ref|XP_050026.1| similar to 
immunoglobulin superfamily containing leucine-rich 
repeat (H. sapiens) [Homo sapiens] 


3.4 


2781 


4581140 


gi|4581 140|gb|AAD24624.1|AC006919_4 (AC006919) 
unknown protein [Arabidopsis thaliana] 


2.8 


2792 


7661684 


gi|7661684|reflNP_056277.1| DKFZP586L0724 protein 
[Homo sapiens] ref|XP_038194.1| DKFZP586L0724 
protein [Homo sapiens] ref|XPJ)38195.1| 19734 [Homo 

SapiCIloJ pJLl|| 1 1H / 07 IiypuuiCllWal jJUJlOUl 

DKFZp586L0724.1 - human emb|CAB53709.1| 
(AL1 10271) hypothetical protein [Homo sapiens] 
gb|AAH01726.1|AAH01726 (BC001726) Similar to 
DKFZP586L0724 protein (Homo sapiens] 


7 


2793 


13624635 


gi|13624635|emblCAA10856.2| (AJ222584) maturase- 
like protein [Euglena viridis] 


1.3 
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SEQED 
NO 


ACCESS 
N 


DESCRIPTION 


P VALUE 


9 son 


9622888 


gi|yo22o5&|gb|AAFo99oG.l|AF200532_l (AF200532) 
cellulose synthase-8 [Zea mays] 


3.8 


2803 


7499161 


gi|7499161|pir||T25690 hypothetical protein F15A8.6 - 
Caenorhabditis elegans gb[AAB52848.1| (U97549) 
strong similarity to the type-B carboxylesterase/lipase 
family [Caenorhabditis elegans] 


3.1 


2806 


14746401 


gi| 1474640 l|ref|XP_03 1955.1] ring finger protein 27 
[Homo sapiens] 


0.3 


2809 


14731173 


gi|1473 1 173|ref|XP_017730.2| IQ motif containing 
GTPase activating protein 2 (Homo sapiens] 


2E-11 


2813 


4567214 


gi|4567214|gb|AAD23629.1|AC007113_2(AC007113) 
putative villin [Arabidopsis thaliana] 


3.5 


2817 


2500866 


gi|2500866|sp|Q2041 1|SA1 1_CAEEL SRA-1 1 
PROTEIN pir||T22192 hypothetical protein F44F4.13 - 
Caenorhabditis elegans emb|CAA85461.1| (Z37092) 
contains similarity to Pfem domain: PF021 17 (C.elegans 
Sra family integral membrane protein), Score=675.6, E- 
value=8.3e-200, N=l [Caenorfiabditis elegans] 


3.1 


2818 


15011503 


gi| 1 50 1 1 503|gb|AAK77598 . 1|AF396436_38 
(AF396436)ymf71 [Tetrahymena thermophila] 


5.9 


2823 


1 1278033 


gi|l 1278033|pir||C81832 transferrin-binding protein A 
NMA2024 [imported] - Neisseria meningitidis (group A 
strain Z2491) gb|AAC13726. 1| (AF058689) transferrin 
binding protein A precursor [Neisseria meningitidis] 
emb|CAB85243.1| (AL162757) transferrin-binding 
protein A [Neisseria meningitidis Z249 1] 


5 


• 

2825 




gi|3913143|sp|023913|AXlB ARATH 
ALTERNATIVE OXIDASE IB PRECURSOR 
dbj|BAA22624.1| (D89875) alternative oxidase 
[Arabidopsis thaliana] dbj|BAB01774.1| (AB022215) 
alternative oxidase lb precursor [Arabidopsis thaliana] 


L.Z 


2827 


12842679 


gi|12842679|dbj|BAB25689.1| (AK008476) putative 
[Mus musculus] 


7.8 


2835 


12848636 


gi|12848636|dbj|BAB28031.1| (AK012100) putative 
[Mus musculus] 


0.083 


2839 


7292152 


gi|7292152igb|AAF47564.1| (AE003472) Dhc62B gene 
product [Drosophila melanogaster] 


4.6 
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2846 


9628099 


gi|9628099|ref|NP_042686.1| alternative tat protein 
jemDrana disease vinisj guj/v/\-f\o £ f->^-?.i| ^u^iwj; 
alternative tat protein [Jembrana disease viras] 
prf||21 16345E tat gene [Jembrana disease virus] 


3.3 


2855 


4165313 


protein [Homo sapiens] 


9.2 


2859 


14388365 


gi|14388365|dbj|BAB60739.1| (AB062957) hypothetical 
protein [Macaca fascicularis] 


0.55 


2864 


14739019 


gi|14739019|ref|XPJ)05626.3| deleted in bladder cancer 
chromosome region candidate 1 [Homo sapiens] 


6.3 


2867 


6319950 


gi|63 19950|ref|NPJ)10031.1| Transcription regulator; 
Ycrl06wp [Saccharomyces cerevisiae] 
sp|P2561 1|YCZ6_YEAST PUTATrvt 95.7 JsJJ 
TRANSCRIPTIONAL REGULATORY PROTEIN IN 

T» * A A TVO TS.T P I ' CP T7XTT/"* "D T?/"1T/"Y\T r>i»-IIQ 1 OA 1 ft 

i*A U 3 -AAD3 IN 1 iiROxbiN KiiVxllJJN pirj | o 1 y*t I o 
probable membrane protein YCR106w - yeast 
paccnaromyces cerevisiae j emD|v>^\A 4 rZrZ.jo.ij 
(X59720) hypothetical protein [Saccharomyces 
cerevisiae] 


9.1 


2870 


14724850 


gi|14724850|reflXP 050192.1| 29140 [Homo sapiens] 


2.2 


2871 


4587097 


gi!4587097|dbj|BAA76616.1| (AB019045) 
OlVrPHerfirhowlase rRhizornucor ousillusl 


9.7 


2873 


7522108 


gi|7522108|pir||T29097 pro-pol-dUTPase polyprotein - 
murine endogenous retrovirus ERV-L (fragment) 
emb|CAA7325Ll| (Y127 13) protease; reverse 
vranscriptasc, ivln aacrr, iiilc^icu>c, uuir oaw, jriwr-xvu 

U.VJ X Jr ooC LHJ1 V Ui Ulwiii | ivxLAo iiiuaviuuDj 


0.003 


"7875 


7482073 


gi|7482073|pir||B69010 conserved hypothetical protein 
JV1 i rxiu / o - JVietnanoDacterium mermoauToiropincum 
(strain Delta H) gb|AAB85567.1| (AE000879) 
conserved proxein [ivieuianouiciiiiuDaoLci 
thermautotrophicus] 


4.6 


2876 


11360401 


gill 1360401|pir||T42759 Muncl3-3 protein - rat 


0.000000005 


2879 


5091521 


gi|5091521|dbj|BAA78756.1| (AB023482) Hypothetical 
protein [Oryza sativa] 


2.1 


2881 


13810543 


gi|13810543|dbj|BAB43950.1| (AB051633) ookinete 
surface protein Pos28-2 [Plasmodium ovale] 


3 


2882 


5091521 


gi|5091521|dbj|BAA78756.1| (AB023482) Hypothetical 
protein [Oryza sativa] 


2.1 
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zoo / 


6723244 


gi|6723244|dbj|BAA89640.1| (AB036666) similar to 
terminase large subunit of phage lambda [Wolbachia sp. 
wKue] 


7.1 


Z007 


7494347 


gi|7494347|pir||E71625 variant-specific surface protein 
1 truncated homolog PFB0020c - malaria parasite 
\ Plasmodium falciparum) gb|AAC71794.1| (AE001366) 
PfEMPl fragment [Plasmodium falciparum] 


5 


OOQA 

Zoy\J 


6942188 


gi|6942188|gb|AAF32349. 1|AF220008__1 (AF220008) 
coilin p80 [Danio rerio] 


0.11 


2891 


10639397 


gi|10639397|emb|CACl 1399.1| (AL445063) 
lypothetical membrane protein [Thermoplasma 
acidophilum] 


8.5 


2898 


10800417 


gi|10800417|ref|NPJ)06759.2| BRCA1 associated 
protein (Homo sapiens] 




2901 


7488089 


gi|7488089|pir||T02283 probable disease resistance 
protein T13D8.20 - Arabidopsis thaliana 
gb|AAC24071.1| (AC004473) Contains similarity to 
TMV resistance protein N homolog gb|2245048 from A. 
thaliana chromosome 4 contig gb|Z97342. [Arabidopsis 
thaliana] 


8.5 


9 OH/1 


10728394 


gi|10728394[gb|AAF45654.2| (AE003421) EG:171E4.2 
gene product [Drosophila melanogaster] 


1.4 


2905 


10956333 


gi|10956333|ref|NP_052782.1| pXOl-86 [Bacillus 
anthracis] pirj|F59101 hypothetical protein pXOl-86 - 
Bacillus anthracis virulence plasmid pXOl 
gb|AAD32390.1|AAD32390 (AF065404) pXOl-86 
[Bacillus anthracis] 


9.7 


2906 


4028153 


gi|4028153|gbjAAC961 17.1| (AF083221) putative 
neurotransmitter receptor [Takifugu rubripes] 


2.8 


2909 


12836120 


gi|12836120|dbj|BAB23511.1| (AK004729) putative 
[Mus musculus] 


5.3 


2911 


1 / /oo*f*l- 


gi|1778844|gb|AAB40929.1| (U83086) LimA 
[Uiciyosienum aiscoiaeumj 


X.J 


2913 


3192956 


gi|3192956|gb|AAC41298.1| (AF033670) T-Box protein 
4 [Gallus gallus] 


3.1 


2914 


7506147 


gi|7506147|pir||T33303 hypothetical protein R01B10.4 - 
Caenorhabditis elegans gb|AAC17768.1| (AF068718) 
R0 IB 10.4 gene product [Caenorhabditis elegans] 


0.4 
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2915 


6041793 


gi|6041793|gb|AAF021 13. 11AC009755_6 (AC009755) 
putative auxin-independent growth promoter 
"Arabidopsis thaliana] 


2.1 


2916 


5803098 


gi|58030981refINP_006757.1) zinc finger protein 220; 
Monocytic leukemia zinc finger protein [Homo sapiens] 
sp|Q92794|MOZ_HUMAlN MUNUCYIIC 
LEUKEMIA ZINC FINGER PROTEIN (ZINC 

monocle leukaemia zinc finger protein [Homo sapiens] 


0.084 


2919 


10639353 


^|l(/oJyj3J|emo|CACii^D3.i| (ai>hk>uo.3 j j 
hypothetical membrane protein [Thermoplasma 
acidophilum] 


3.1 


2920 


13235586 


gi|13235586|emb|CAC33776.1| (AJ301807) SclB 
protein [Streptococcus pyogenes] 


9.7 


2925 


/jU/Olo 


gi|7507618|pir||T33548 hypothetical protein T10D4. 8 - 
^aenornauuius cicgcixib 


4.3 


2928 


7522108 


gi|7522108|pir||T29097 pro-pol-dUTPase polyprotein - 
murine endogenous retrovirus ERV-L (fragment) 
emb|CAA73251.1| (Y12713) protease; reverse 
transcriptase; RNaseH; integrase; dUTPase; Pro-Pol- 
dUTPase polyprotein [Mus musculus] 


1.2 


2932 


13542796 


gl|13542796|gb|AAHUjoUl.l|AAnUjOUl (iSv^UUDOU 1 ) 

Similar to RIKEN cDNA 1 1 10061A19 gene [Mus 
musculus] 


6.4 


2934 


417©o9 


gi|417869|sp|P33007|TERPJPSESP TERPREDOXIN 
( I ux.) pn]\cAZy 1 1 terpreuoxin - jrseuaomonas sp 
gb|AAA25998.1| (M91440)terpredoxin [Pseudomonas 
sp.J 


3.7 


2938 


7433891 


gi|7433891|pir))T00981 flavonol 3-O-glucosyltransferase 
homolog T9J22.15 - Arabidopsis thaliana 
gb|AAC14497.1| (AC002505) putative 
glucosyltransferase [Arabidopsis thaliana] 


6.8 


2939 


15021546 


gi|15021546|gb|AAK77823.1|AF369029_154 
(AF369029) ORF154 [white spot syndrome virus] 


0.25 


2944 


11466694 


gi|11466694|ref[NP_039290.1| ORF370i [Marchantia 
polymorpha] sp|P12174|MATK_MARPO PROBABLE 
INTRON MATURASE pir||A05034 hypothetical 
protein 370i - liverwort (Marchantia polymorpha) 
chloroplast emb|CAA28076.1| (X04465) ORF370i 
[Marchantia polymorpha] 


4.3 
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2945 


7302236 


gi|7302236|gb|AAF57330.1| (AE003786) CG10416 
gene product [Drosophila melanogaster] 


1.4 


2950 




gi|2495638|sp|P7651 1|YFD0_EC0LI 
HYPOTHETICAL 14.4 KDA PROTEIN IN DMTC- 
DSDC INTERGENIC REGION pir||C65009 
lypoLneucai proicm dzj-)o - JDScnciicnia. con ^sxraui rv~ 
12) gb|AAC75417.1| (AE000324) orf, hypothetical 

nrrvt f*in f Pcr*lif*ri f*Vii n rvVH TCI 91 

piUlCUl [LjaCilCliviUd LAJ11 


4 ^ 


2955 


220578 


gi|220578|dbj|BAA00447.1| (D00570) open reading 
frame (25 1 AA) [Mus musculus] 


4.2 


2958 


12855573 


gi|12855573|dbj|BAB30384.1| (AK016695) putative 
[Mus musculus] 


0.007 


2963 


2981631 


familiaris] 


0.29 


2966 


76^74.01 


gi|7657401|refINP_056616.1| neuropathy target 
esterase; Swiss cheese [Mus musculus] 
gb|AAD51700.1|AF173829_l (AF173829) neuropathy 

trarrtpf pctpracp ti/MYif\lr\f» f \/Ti 1 c mnonilnol 
UxigCL CblCio.SC IHJIllUXOg L-LV±Ub lllUbl/UiUoj 




2967 


IHZOJJjJ 


gi| 14285535|sp|P71399|LSGl_HAEIN LSG LOCUS 1 

PTTTATTVP pi? OTT?TM 1 


4 8 


2973 


144?^7R(i 


gi|14423780|sp|O95013|O4F3_HUMAN OLFACTORY 

KiiUJiF 1 UK 4r 3 gD|AALIUD 150 . lj {AUUU4yUo J Similar 

to rat olfectory receptor OR18; similar to S29710 




2975 


14771691 


gi|14771691|reffXP_045484.1| 67354 [Homo sapiens] 


1.3 


2978 


6573738 


gi|6573738|gb|AAF17658.1|AC009398_7(AC009398) 
F20B24.13 [Arabidopsis thahana] 


8.5 


2982 


1381 1437 


gi|13811437|gblAAK40121.1| (AF354707) type H 


? 4 


2986 


7522108 


gi|7522108jpir||T29097 pro-pol-dUTPase polyprotein - 

iwiifiti** f*vi r\ r\cr(*Y\ tw 1 c rptrm/iriic P t£ \7_T / fro rmn^Tii"^ 
lULUJliC vilUAJgCllvJU-o 1 vLi U V±l US> JLvXv V L-i \±1 ct£^l 1 1C11 L J 

emb|CAA73251.1| (Y12713) protease; reverse 
transcriptase; RNaseH; integrase; dUTPase; Pro-Pol- 
dUTPase polyprotein [Mus musculus] 


0.00001 


2990 


12722927 


gi| 12722927|gb|AAK041 8 1 . 1|AE006247_3 (AE006247) 
UNKNOWN PROTEIN [Lactococcus lactis subsp. 
lactis] 


1.3 
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2994 




trill n4.671^1nirllPfX1^09 VivnntViPticnl nrntein Cil009c 

imported] - Campylobacter jejuni (strain NCTC 11168) 
emb|CAB73265.1| (AL139077) hypothetical protein 

fi 1 Oft Op r atrrnvl rvha cvfer ieiiinil 


4.1 


2995 


7321945 


ml7^9104SlahlAAP60504 91 ^68356^ action ootential 
broadening potassium channel [Aplysia sp.] 


4.2 


3003 




oill 1 ^£0^Q4lrnrl 1X497*3 1 fltrnnVirn-1 related nrotein - rat 
gb|AAA98970. 1| (U44091) atrophin-1 related protein 

"RattiiQ noTveericu^l 


7.6 


3005 


10765337 


gi|10765337|gblAAG22997.1| (AF188579) glycoprotein 
Bovine respiratory syncytial virus] 


1.9 


3006 


5803252 


gi|5803252|dbj[BAA83562.1| (AP000399) hypothetical 
protein [Oryza sativa] 


0.046 


3007 


7662688 


oil7662688l&blAAF66138 11 (L00016 1 ) urf4 fHomo 
sapiens] 


1.7 


3013 


14150037 


gi|14150037|ref]NP_l 15666.1| hypothetical protein 

hypothetical protein DKFZp761C121 [Homo sapiens] 
emhirAB66495 11 CAL136560^ hvDOthetical protein 
[Homo sapiens] 


0.00000001 


! 3014 


7243706 


gi|7243706|gb|AAF43421.1|AF233291_l (AF233291) 
epsin-like protein [Drosophila melanogaster] 


6.5 


3019 




gi|7293274|gb|AAF48655.1| (AE003503) CG9644 gene 
nrnrhirt IT)rn<3nTiViila melarioeasterl 


0.47 


3025 


11414881 


gi|11414881|dbj|BAB18568.1| (AB028173) HCCA2 
[Homo sapiens] 


8.6 


3026 


11359423 


gi|l 1359423|pir||T48729 hypothetical protein 8D4.30 

lIxTipOITeClJ IN CU-I UbUUl d Llcuoa Giiiiy|v^xvi-JOtJ»;"T*/. xj 

(AL353819) conserved hypothetical protein 
[Neurospora crassa] 


0.66 


3028 


2118405 


gi|21 18405|pir||I51018 cobra venom fector precursor - 
monocled cobra gb|AAA68989.1| (U09969) cobra 
venom factor precursor [Naja naja] 


3.1 


3029 


2739145 


gi|2739145|gb|AAC98522.1| (AF030306) envelope 
protein; ORF4 [Porcine reproductive and respiratory 
syndrome virus] 


4.3 


3030 


13549158 


gi|13549158|gb|AAK29672.1|AF353095_l (AF353095) 
protein synthesis initiation factor eIF2 beta [Arabidopsis 
thaliana] 


6.3 
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3034 


14753542 


gi|14753542|ref]XP_046011.1| 53288 [Homo sapiens] 


0.000000006 


3042 


3150471 


gipi_>u £ t/ jLfgu|A/w^i0707. i| \atuo/zii^ xiypuuicuicai 
protein B0205.2 [Caenorhabditis elegans] 


0.56 


3044 


14734012 


gi|14734012|reflXP_051005.1| KIAA1297 protein 
[Homo sapiens] 


0.16 


3047 


74Q3QQ4 


gi|7493994|pir||JC6564 cellobiose oxidase (EC 1.1.3.25) 


6 1 

v. X 


3056 




gl|114351|sp|rUo314|A112_hloVlr AJLrrlA IKAJNb- 
INDUCING FACTOR 77 KD PROTEIN pir||TNBE77 
/ / rv aipna ixctns -mciucrng protein - nuiuaii nerpes virus x 
(strain F) gb|AAA45768. 1| (M15621) alpha trans- 

itir4nr»tncr "feif^trvr 771fl*\ rVintYidti Vimt\pc\71T1'iq 11 
lUvlllwlLlg xateixjx / / 1S.U [llu-lllall xlCxpwoViiuo xj 


7 


3069 


6562755 


gi|6562755|emb|CAB62894.1| (AL035475) possible 
ribosomal protein [Plasmodium falciparum] 


6.4 


3071 


804764 


<yit9riA7/^lloKt A A A 1 1 /TV/T7787M npiitral nrrrfpsicf* 
gljOV/H / 0*r|gO|r\/VrVO J^Jzfzf. L\ \J\L£. t Oa.\jj IlCUUdl plUlCdbC 

large subunit [Homo sapiens] 


0.00003 


3079 


14768311 


frill /17£5i^l 1 l™»flYP nAR^Ofi llmpfhvl Pr»f^ Viinrlincr 

protein 2 [Homo sapiens] 


0.65 


3080 


11994604 


gi|11994604|dbj|BAB02658.1| (AP002062) 
geneJki:T22B15.1 l~unknown protein [Arabidopsis 
thaliana] 


5 


3083 


7302673 


r»il7^n7£7^lrrkl A A 17^77^ ll ( APfiO^RAfW c+an (tpn^ 
glj / 3UZO / 0|gDJAiVr3 / / . l| \/YliUU;5 OUIJJ SuJU gene 

product [alt 2] [Drosophila melanogaster] 


9.2 


3085 


7494170 


gi|7494170|pi*i|D71613 GAF domain protein (cyclic nt 
signal transduct) PFB05 lOw - malaria parasite 
(Plasmodium falciparum) gb|AAC71891.1| (AE001399) 
vr/\r uoinain protein ^cyclic nt signal irausaucL.j 
[Plasmodium falciparum] 


8.1 


3087 


14590168 


gi|14590168|ref]NP_142233.1| hypothetical protein 

1 DinwnorMic V»r\t*ilroolri-il rvi"rirF7 1 74.7 li^JT^r\tlip+i/*5k1 i%Ttf\fr^4n 

I^jt yiL)t/UvwUo iiuxiKUSiJiij pii|(.r / ±z,*t / liypumcuuii piuucixi 

PH0237 - Pyrococcus horikoshii dbj|BAA29309.1| 
(AP000001) 230aa long hypothetical protein 
[Pyrococcus horikoshii] 


2.3 ; 


3097 


14326099 


gi|14326099|gb|AAK60138. 1|AF365405_1 (AF365405) 
ribosomal protein S14 [Schizosaccharomyces pombe] 


3 
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